Computer Science - Multimedia

RSS

Multimedia Addressing Misinformation on Social Media

A new model combines logic and neural networks to detect misinformation more effectively.

2025-11-18T04:01:06+00:00 ― 6 min read

Latest Articles

Image and Video Processing Understanding HDR-VDP-3: A Guide to Image Quality Assessment

Learn how HDR-VDP-3 improves image quality evaluation for various applications.

2025-11-16T08:39:35+00:00 ― 4 min read

Multimedia Advancements in Multimodal Sentiment Analysis

New methods improve sentiment analysis with limited labeled data.

2025-11-15T10:26:54+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Video Question Answering via Game Theory

A new model enhances video question answering using game theory principles.

2025-11-15T07:56:48+00:00 ― 6 min read

Sound LORIS: A New Approach to Video Music Generation

LORIS generates high-quality music that syncs perfectly with video movements.

2025-11-14T05:38:50+00:00 ― 5 min read

Image and Video Processing GAMIVAL: A New Tool for Gaming Video Quality

GAMIVAL evaluates streaming quality for mobile cloud gaming without reference videos.

2025-11-13T21:33:00+00:00 ― 4 min read

Multimedia Advancing Video Character Search with SoCoSearch

SoCoSearch improves how we find characters in video content using social context.

2025-11-13T14:20:24+00:00 ― 5 min read

Computation and Language Addressing Disinformation with FACTIFY 3M

A dataset aimed at improving fact-checking by combining text and images.

2025-11-13T10:55:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Integrity Encryptor: A New Approach to Deepfake Detection

A proactive method to safeguard images against deepfake manipulations.

2025-11-12T20:18:06+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Video Quality Assessment Methods

Research enhances video quality evaluation using advanced methods and comprehensive databases.

2025-11-12T17:32:12+00:00 ― 5 min read

Computer Vision and Pattern Recognition The Rise of Text-to-Image Generation

This article reviews the current state of text-to-image generation technology.

2025-11-12T07:16:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Semantic Segmentation with Depth Data

A new method enhances segmentation accuracy by integrating depth information without source data.

2025-11-12T00:01:30+00:00 ― 6 min read

Computer Vision and Pattern Recognition New Framework Transforms Video Generation from Text

A new method improves video creation from text with added control and quality.

2025-11-11T16:15:24+00:00 ― 6 min read

Audio and Speech Processing Advancements in Speech-to-Singing Technology

Research presents a method to convert spoken words into singing efficiently.

2025-11-11T12:52:10+00:00 ― 7 min read

Computer Vision and Pattern Recognition Advancing Machine Learning with Integrated Multimodal Perception

A look at how Integrated Multimodal Perception enhances machine learning capabilities.

2025-11-10T19:51:55+00:00 ― 6 min read

Sound Advancements in Speech Synthesis with CoMoSpeech

CoMoSpeech improves speech synthesis speed and quality with a one-step process.

2025-11-10T05:17:25+00:00 ― 4 min read

Human-Computer Interaction Addressing Hate Raids in Live Streaming Communities

A look into hate raids and their impact on marginalized streamers.

2025-11-09T22:07:24+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancing Image Compression for Human Perception

A new method improves image compression by prioritizing human-friendly features.

2025-11-09T19:34:25+00:00 ― 5 min read

Computation and Language Understanding Memes Through Contextual Analysis

This study highlights the importance of context in interpreting memes.

2025-11-09T18:10:24+00:00 ― 5 min read

Sound Innovative Approaches to Music Rearrangement

A new method for creating unique music versions by rearranging existing pieces.

2025-11-09T15:31:30+00:00 ― 6 min read

Information Retrieval Introducing the SURE Dataset for Shopping Dialogues

A dataset designed to improve interactions between customers and salespeople in stores.

2025-11-09T10:24:18+00:00 ― 6 min read

Computer Vision and Pattern Recognition A New Approach to Visual Question Answering

Introducing a modular method for zero-shot visual question answering.

2025-11-08T19:07:54+00:00 ― 4 min read

Computation and Language Revising Task Steps Using Video Analysis

A new method to better organize task steps with video insights.

2025-11-08T18:04:42+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in Deblurring Quality Measurement

Improving metrics for assessing deblurring methods using a new dataset.

2025-11-08T16:14:06+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Vision-Language Models with CLIP Feedback

A new method enhances vision-language models through real-time feedback for better performance.

2025-11-08T04:38:54+00:00 ― 6 min read

Computation and Language Advancing Fake News Detection Models

New models enhance the detection of fake news using diverse data techniques.

2025-11-08T01:13:30+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancements in Multi-Camera Systems for Autonomous Vehicles

Occ-BEV enhances vehicle perception through multi-camera 3D modeling and data integration.

2025-11-07T14:57:18+00:00 ― 6 min read

Cryptography and Security Analyzing the J-UNIWARD Method and Its Error

A look into J-UNIWARD's message hiding technique and its minor calculation error.

2025-11-06T17:05:54+00:00 ― 4 min read

Computer Vision and Pattern Recognition Addressing Bias in Visual Question Answering

A new approach tackles language and vision biases in VQA systems.

2025-11-06T14:27:54+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Compression Quality of 3D Point Clouds

A method to enhance compressed 3D point cloud data using advanced neural networks.

2025-11-06T06:33:54+00:00 ― 6 min read

Machine Learning Advancing Multi-modal Learning with C-MCR

C-MCR simplifies multi-modal learning by connecting existing knowledge efficiently.

2025-11-05T03:49:55+00:00 ― 6 min read

Sound Simplifying Sound Synthesis with NAS-FM

A new method for creating synthesizers that benefits musicians.

2025-11-04T17:18:20+00:00 ― 6 min read

Computer Vision and Pattern Recognition Do-GOOD Benchmark: Enhancing Document Understanding Models

New benchmark reveals performance gaps in document processing models.

2025-11-04T02:17:36+00:00 ― 7 min read

Computer Vision and Pattern Recognition Advancements in Panoramic Semantic Segmentation

New model improves panoramic image analysis for real-world applications.

2025-11-04T00:19:06+00:00 ― 4 min read

Human-Computer Interaction LoopBoxes: A New Way to Make Music

LoopBoxes helps children create music easily and collaboratively.

2025-11-03T08:55:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Challenges in Text-Video Retrieval and Solutions

A look at biases in text-video retrieval and ways to enhance accuracy.

2025-11-03T00:45:00+00:00 ― 6 min read

Sound Advancements in Audio Classification Techniques

A novel method enhances audio classification by learning new sounds efficiently.

2025-10-31T22:37:00+00:00 ― 4 min read

Multimedia 360TripleView: Enhancing 360-Degree Video Experience

A new system improves viewing direction selection in 360-degree videos.

2025-10-31T20:44:30+00:00 ― 6 min read

Computer Vision and Pattern Recognition GeneCIS: Advancing Conditional Image Similarity in Computer Vision

A benchmark for assessing image similarity based on user-defined conditions.

2025-10-31T19:09:42+00:00 ― 6 min read

Sound Advancing Audio Question Answering with MWAFM Model

A new model improves how machines understand and respond to audio questions.

2025-10-31T18:34:05+00:00 ― 5 min read

Multimedia Balancing Active Learning in Multimodal Data

A new strategy ensures equal representation of data types in machine learning.

2025-10-31T02:02:42+00:00 ― 6 min read