Computer Science - Multimedia

RSS

Computer Vision and Pattern Recognition Advancements in Language and Visual Models

New model links language understanding with image processing efficiently.

2025-10-25T06:20:48+00:00 ― 5 min read

Multimedia A New System for Music and Video Matching

This research introduces a system for matching music to video content effectively.

2025-10-24T07:37:10+00:00 ― 6 min read

Multimedia The Metaverse: A New Digital Landscape

Discover the evolving Metaverse and its impact on communication and economy.

2025-10-24T03:21:18+00:00 ― 6 min read

Computers and Society The Role of Transcripts in Educational Videos

Transcripts enhance understanding of educational videos, addressing audio quality issues.

2025-10-24T02:33:54+00:00 ― 6 min read

Signal Processing Advancements in 3D Point Cloud Transmission with SEPT

SEPT improves wireless transmission of 3D point clouds using deep learning.

2025-10-23T03:16:45+00:00 ― 5 min read

Information Retrieval A New Multilingual Dataset for Video News

This dataset aims to improve video news retrieval across five languages.

2025-10-23T01:32:54+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Frame Selection for Video Question Answering

New methods enhance how models select frames for answering questions from videos.

2025-10-22T05:40:00+00:00 ― 7 min read

Computer Vision and Pattern Recognition Improving Video Calls with Predictive Coding

A new method enhances video call quality while saving bandwidth.

2025-10-22T03:02:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Transforming Photos into Character Line Drawings

A method for creating artistic line drawings from photographs with user control.

2025-10-19T23:25:12+00:00 ― 6 min read

Multimedia Advancing Video-Text Tasks in Indonesian Language

New dataset enhances video-text tasks for Indonesian speakers.

2025-10-19T21:32:45+00:00 ― 7 min read

Sound Advancements in Measuring Music Similarity

Research aims to combine audio and symbolic data for music similarity analysis.

2025-10-19T11:49:45+00:00 ― 7 min read

Multimedia Advancements in Watermark Attack Techniques Using Diffusion Models

New methods improve watermark removal while preserving image quality.

2025-10-18T23:41:00+00:00 ― 5 min read

Computation and Language Improving Hate Speech Detection with mDT

A new method enhances hate speech detection by combining text, images, and discussion context.

2025-10-18T18:50:54+00:00 ― 6 min read

Networking and Internet Architecture AI-Driven Predictions Boost XR Service Efficiency

AI predictions improve service for extended reality users on advanced networks.

2025-10-18T09:22:06+00:00 ― 4 min read

Multimedia Improving Target Speaker Extraction with Visual Cues

A new model enhances speech extraction using audio and visual information.

2025-10-17T12:51:55+00:00 ― 5 min read

Computer Vision and Pattern Recognition New Dataset Aims to Detect Altered Faces

RetouchingFFHQ dataset enhances face retouching detection methods.

2025-10-17T11:46:30+00:00 ― 6 min read

Multimedia Revolutionizing Infant Sleep Monitoring with LittleBeats

Study uses multi-data device to track infant sleep patterns more accurately.

2025-10-16T17:25:55+00:00 ― 4 min read

Computer Vision and Pattern Recognition Improving Image Annotation with vTelos Method

A new approach to enhance image labeling accuracy in machine learning.

2025-10-15T08:57:06+00:00 ― 6 min read

Computer Vision and Pattern Recognition Efficient Video Action Recognition with Fewer Frames

A new method improves action recognition by using fewer frames without losing important context.

2025-10-14T23:36:12+00:00 ― 8 min read

Computer Vision and Pattern Recognition Improving Image Generation from Text Descriptions

A new method enhances how images match text inputs.

2025-10-14T14:00:56+00:00 ― 6 min read

Databases The Impact of Blockchain on Copyright Management

Exploring how blockchain technology can reshape copyright management for creators.

2025-10-14T07:24:30+00:00 ― 5 min read

Computer Vision and Pattern Recognition Innovative Smartphone Method to Monitor Malnutrition

A new way to assess health using just a smartphone image.

2025-10-13T21:47:48+00:00 ― 7 min read

Computer Vision and Pattern Recognition Simplifying Video Labeling with Visual Analytics

A new tool streamlines the process of labeling video data effectively.

2025-10-13T11:00:00+00:00 ― 7 min read

Computer Vision and Pattern Recognition Understanding Emotions in Images with StyleEDL

A new method combines image style and content to interpret emotions accurately.

2025-10-12T03:24:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancements in Scene Text Editing with FAST

FAST revolutionizes scene text editing with natural modifications and flexibility.

2025-10-12T01:17:36+00:00 ― 6 min read

Computer Vision and Pattern Recognition Advancements in 3D Shape Generation Using Sketches and Text

A new method combines sketches and text to improve 3D shape generation.

2025-10-12T00:46:00+00:00 ― 7 min read

Multimedia Protecting Copyrights in Prompt Services

A new framework for safeguarding prompt creators' rights in AI tools.

2025-10-11T23:42:48+00:00 ― 5 min read

Multimedia Advancements in Vision-Language Pre-training Methods

A new approach improves efficiency in Vision-Language Pre-training tasks.

2025-10-11T17:07:48+00:00 ― 6 min read

Computer Vision and Pattern Recognition Improving Video Creation with DiffSynth

DiffSynth enhances video quality by reducing flickering and improving frame blending.

2025-10-11T07:46:54+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advances in Spiking Neural Networks: Model Compression with Minimax Optimization

A look at how Minimax Optimization enhances Spiking Neural Networks efficiency.

2025-10-11T03:18:18+00:00 ― 6 min read

Multimedia Jade: A New Approach to Video Streaming Quality

Jade improves video quality through user feedback and adaptive streaming techniques.

2025-10-10T17:57:24+00:00 ― 5 min read

Computer Vision and Pattern Recognition Innovative Model for Color Selection in Design

A new model recommends colors based on design elements and text.

2025-10-10T17:49:30+00:00 ― 5 min read

Computer Vision and Pattern Recognition Transferring Hand Movements Between Avatars

A new method enhances gesture communication for avatars with unique hand shapes.

2025-10-10T04:39:30+00:00 ― 5 min read

Computer Vision and Pattern Recognition Audio-Visual Question Answering: Bridging Sound and Sight

AVQA connects audio and visual elements in videos to answer questions.

2025-10-09T23:47:12+00:00 ― 6 min read

Computer Vision and Pattern Recognition Introducing the Versatile Face Animator for 3D Animation

A new method for creating realistic 3D facial animations quickly and efficiently.

2025-10-09T16:32:42+00:00 ― 5 min read

Cryptography and Security Advances in Video Steganography and Detection

New methods improve the detection of hidden messages in video files.

2025-10-09T09:34:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Transforming Skulls into Living Animal Images

A method to translate skull images into realistic animal representations using text prompts.

2025-10-08T21:43:00+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancements in Real-Time Video Analysis

New methods improve event detection in streaming videos using language and historical data.

2025-10-08T18:57:06+00:00 ― 5 min read

Computer Vision and Pattern Recognition New Method for Detecting Hateful Memes

A novel approach improves detection of harmful memes using targeted questioning.

2025-10-08T12:22:06+00:00 ― 8 min read

Multimedia EMID: A New Approach to Music and Images

Explore the emotional ties between music and images with the EMID dataset.

2025-10-08T07:45:36+00:00 ― 5 min read