Shinnosuke Takamichi

Core-set selection improves text-to-speech models by focusing on diverse data.

2025-09-12T08:19:30+00:00 ― 5 min read

This study examines if learned speech symbols mimic word frequency patterns.

2025-09-09T04:12:40+00:00 ― 5 min read

Coco-Nut offers diverse Japanese voice samples for advanced text-to-speech applications.

2025-09-05T11:57:05+00:00 ― 10 min read

A study on improving TTS systems with diverse voice samples.

2025-08-16T12:35:45+00:00 ― 4 min read

RALL-E enhances text-to-speech synthesis for clearer, more natural speech.

2025-08-13T01:11:40+00:00 ― 5 min read

Introducing spatial voice conversion to enhance audio realism and immersion.

2025-07-27T01:54:15+00:00 ― 6 min read

This study examines how voice preferences vary among different listeners.

2025-07-21T00:57:50+00:00 ― 4 min read

Researchers explore textless approaches for better understanding of spoken language.

2025-07-13T18:11:30+00:00 ― 6 min read

J-CHAT provides a large, open-source dataset for enhancing spoken dialogue systems.

2025-07-12T12:06:15+00:00 ― 5 min read

Researchers develop SaSLaW to enhance machine speech adaptation in various environments.

2025-07-01T16:11:00+00:00 ― 5 min read

BigCodec improves sound quality in low-bitrate audio transmission.

2025-06-15T19:36:50+00:00 ― 4 min read

A new method enhances synthesized ensemble singing by modeling singer interactions.

2025-06-09T11:23:10+00:00 ― 5 min read