A system creates real-time music based on tabletop role-playing game narratives.
Felipe Marra, Lucas N. Ferreira
― 7 min read
Cutting edge science explained simply
A system creates real-time music based on tabletop role-playing game narratives.
Felipe Marra, Lucas N. Ferreira
― 7 min read
Latest Articles
Shashi Kumar, Iuliia Thorbecke, Sergio Burdisso
― 5 min read
Risako Tanigawa, Kenji Ishikawa, Noboru Harada
― 7 min read
Leena G Pillai, Kavya Manohar, Basil K Raju
― 5 min read
Zitong Lan, Chenhao Zheng, Zhiwei Zheng
― 7 min read
Tito Spadini, Kenji Nose-Filho, Ricardo Suyama
― 5 min read
A new model improves identifying and locating sounds effectively.
Jinbo Hu, Yin Cao, Ming Wu
― 7 min read
AuscultaBase enhances accuracy in diagnosing health conditions using diverse body sound data.
Pingjie Wang, Zihan Zhao, Liudan Zhao
― 4 min read
ArPA helps Arabic-speaking kids improve their pronunciation through interactive activities.
Lamia Berriche, Maha Driss, Areej Ahmed Almuntashri
― 5 min read
A new dataset helps find music through friendly dialogue.
SeungHeon Doh, Keunwoo Choi, Daeyong Kwon
― 7 min read
Combining audio recordings with sheet music for better practice.
Irmak Bukey, Michael Feffer, Chris Donahue
― 6 min read
AEROMamba enhances low-quality audio into rich, high-fidelity sound.
Wallace Abreu, Luiz Wagner Pereira Biscainho
― 5 min read
A groundbreaking audio-language model aids in studying animal sounds and behaviors.
David Robinson, Marius Miron, Masato Hagiwara
― 7 min read
Creating an AI model for natural conversations in Taiwanese Mandarin.
Chih-Kai Yang, Yu-Kuan Fu, Chen-An Li
― 5 min read
Mamba enhances speech recognition with speed and accuracy, reshaping interaction with devices.
Yoshiki Masuyama, Koichi Miyazaki, Masato Murata
― 4 min read
New method enhances speech clarity using visual information from surroundings.
Xinyuan Qian, Jiaran Gao, Yaodan Zhang
― 5 min read
Exploring the challenges and implications of deepfake technology in today’s media landscape.
Ammarah Hashmi, Sahibzada Adil Shahzad, Chia-Wen Lin
― 6 min read
Research reveals how brain waves can aid silent communication.
Soowon Kim, Ha-Na Jo, Eunyeong Ko
― 6 min read
Research seeks to translate brain signals into various types of speech.
Jung-Sun Lee, Ha-Na Jo, Seo-Hyun Lee
― 6 min read
New models improve detection of fake voices in speech technology.
Yang Xiao, Rohan Kumar Das
― 5 min read
This project aims to standardize Bangla dialects for clearer communication.
Md. Nazmus Sadat Samin, Jawad Ibn Ahad, Tanjila Ahmed Medha
― 6 min read
SAMOS offers a new way to measure speech quality, enhancing naturalness.
Yu-Fei Shi, Yang Ai, Ye-Xin Lu
― 6 min read
Explore the fascinating science behind the sounds of pouring drinks.
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek
― 5 min read
A new system evaluates singing voices using pitch and spectrum.
Yu-Fei Shi, Yang Ai, Ye-Xin Lu
― 6 min read
Discover how deep learning shapes music recommendations.
Aditya Sridhar
― 7 min read
Learn how machines classify sounds using spectrogram images.
Satvik Dixit, Laurie M. Heller, Chris Donahue
― 5 min read
Discover innovative methods for audio compression and their impact on immersive sound.
Toni Hirvonen, Mahmoud Namazi
― 5 min read
Voice analysis may help detect early signs of depression in young people.
Klaus R. Scherer, Felix Burkhardt, Uwe D. Reichel
― 6 min read
New tests aim to improve fairness in TTS voice ratings.
Praveen Srinivasa Varadhan, Amogh Gulati, Ashwin Sankar
― 6 min read
Research focuses on teaching computers to grasp music conversations.
Daeyong Kwon, SeungHeon Doh, Juhan Nam
― 5 min read
Learn how technology interprets our voices through sound wave analysis.
Nirmal Joshua Kapu, Raghav Karan
― 6 min read
Tiny-Align enhances voice assistants for better personal interaction on small devices.
Ruiyang Qin, Dancheng Liu, Gelei Xu
― 6 min read
FabuLight-ASD improves speaker detection by combining audio, visual, and body movement data.
Hugo Carneiro, Stefan Wermter
― 5 min read
A fresh sound system identifies sound directions, improving detection in noisy environments.
Erik Tegler, Magnus Oskarsson, Kalle Åström
― 4 min read
Discover how communication enhances teamwork and performance in esports.
Aymeric Vinot, Nicolas Perez
― 8 min read
HARP dataset transforms how we experience sound in virtual environments.
Shivam Saini, Jürgen Peissig
― 5 min read
Learn how new tech transforms images into immersive sound experiences.
Wei Guo, Heng Wang, Jianbo Ma
― 7 min read
A new method achieves high accuracy in voice recognition using minimal data.
Irfan Nafiz Shahan, Pulok Ahmed Auvi
― 6 min read
Revolutionizing sound creation for musicians with endless audio effects options.
Alec Wright, Alistair Carson, Lauri Juvela
― 6 min read
A tool connecting AI and human insights in music analysis.
Prashanth Thattai Ravikumar
― 6 min read
Exploring how audio tricks confuse language models.
Wanqi Yang, Yanda Li, Meng Fang
― 7 min read
Discover how DiM-Gestor enhances virtual character gestures in real-time.
Fan Zhang, Siyuan Zhao, Naye Ji
― 4 min read
An overview of deepfakes, their risks, and a new Hindi dataset.
Sukhandeep Kaur, Mubashir Buhari, Naman Khandelwal
― 6 min read
Research reveals how emotions shape our memories through innovative technology.
Joonwoo Kwon, Heehwan Wang, Jinwoo Lee
― 7 min read
A new ASR system enhances medical speech recognition for accurate patient care.
Sourav Banerjee, Ayushi Agarwal, Promila Ghosh
― 6 min read
Discover how music style transfer brings new life to your favorite tunes.
Sooyoung Kim, Joonwoo Kwon, Heehwan Wang
― 5 min read