A new system helps blind viewers understand short videos better.
― 4 min read
Cutting edge science explained simply
A new system helps blind viewers understand short videos better.
― 4 min read
Innovative approaches are improving access to education for underserved communities.
― 8 min read
New techniques enhance ASR systems for better long speech recognition.
― 5 min read
Text simplification helps improve access to information for diverse readers.
― 6 min read
A study on making scientific images accessible for those with color vision deficiency.
― 6 min read
This project enhances real-time speech translation and automatic subtitling systems.
― 4 min read
MAIDR helps blind users access data visualizations through sound, touch, and text.
― 7 min read
Exploring advancements in automated audio captioning and its impact on accessibility.
― 5 min read
Chart4Blind transforms complex charts into formats accessible for visually impaired users.
― 7 min read
A look into how AVQA technology answers questions using video and audio.
― 6 min read
Research reveals preferences of BLV users for video access.
― 5 min read
A method to help the visually impaired recognize sounds in mixed reality.
― 5 min read
A new model improves speech-to-text efficiency in real-time applications.
― 6 min read
Our model generates hint-text to improve usability for visually impaired users.
― 4 min read
New methods improve accessibility and accuracy in audio captioning.
― 6 min read
RASSAR app improves home safety and accessibility using advanced technology.
― 4 min read
New methods aim to improve communication for the deaf community.
― 5 min read
A method for enhancing speech recognition accuracy in Kannada and Telugu languages.
― 7 min read
A new approach to generate more informative captions for images.
― 7 min read
RALL-E enhances text-to-speech synthesis for clearer, more natural speech.
― 5 min read
A new method enhances clarity and expressiveness in sign language.
― 6 min read
Introducing a new approach to improve text layout analysis in images.
― 5 min read
Learn how enhancing UI agents can create better user experiences.
― 7 min read
A new method directly creates subtitles, improving accessibility for diverse audiences.
― 8 min read
Examining how technology can better express emotions in communication.
― 7 min read
New methods improve how AI connects text and images for better results.
― 8 min read
A study reveals user frustrations and preferences regarding CAPTCHAs on websites.
― 7 min read
Introducing a model that generates synchronized audio and video with mixed noise levels.
― 6 min read
This system helps visually impaired individuals shop more independently using a robotic cane.
― 6 min read
A new method enhances how machines convey visual information to humans.
― 6 min read
Seed-TTS creates lifelike speech from text for various applications.
― 5 min read
A new method creates better video captions by focusing on narratives and causality.
― 5 min read
A new approach to audio captioning reduces reliance on paired data.
― 5 min read
A new approach to predict mobile app UI changes based on user actions.
― 5 min read
Using sound to make astronomical data more accessible and engaging for all.
― 8 min read
A project blends dance and technology for creative expression.
― 6 min read
ReadCtrl allows language models to better match text complexity to reader abilities.
― 5 min read
GigaSpeech 2 offers a vast dataset for low-resource languages to improve speech recognition.
― 5 min read
Examining the need for context in accurate sign language translation.
― 5 min read
A system combines audio and video to enhance speaker detection accuracy.
― 5 min read