Latest Articles for Video

Computer Vision and Pattern Recognition New Dataset Transforms Video Generation Research

A large dataset of prompts and videos advances text-to-video technology.

2025-08-30T19:51:18+00:00 ― 6 min read

Computer Vision and Pattern Recognition Guiding Attention in Image and Video Creation

Learn how saliency maps enhance image and video generation.

2025-08-28T18:20:54+00:00 ― 5 min read

Computer Vision and Pattern Recognition SV3D: Transforming 2D Images into 3D Reality

SV3D creates stunning 3D visuals from single 2D images.

2025-08-28T07:48:54+00:00 ― 6 min read

Multimedia Virbo: Simplifying Video Production with Avatars

Create talking avatar videos easily with Virbo's innovative system.

2025-08-28T05:34:36+00:00 ― 6 min read

Computer Vision and Pattern Recognition Innovative Method for Video Depth Estimation

A new model improves depth estimation by combining predictions and multi-frame analysis.

2025-08-27T22:59:36+00:00 ― 5 min read

Computer Vision and Pattern Recognition New Dataset Captures Learning Through Observation

Researchers create a dataset to study how people learn by mimicking others.

2025-08-26T05:31:06+00:00 ― 7 min read

Hardware Architecture New AI System Inspired by the Brain

A new AI approach aims to improve image and video generation speed and efficiency.

2025-08-21T10:50:12+00:00 ― 4 min read

Computers and Society Understanding Media-Based Misinformation: A Deep Dive

This study sheds light on how media fuels misinformation online.

2025-08-10T02:48:48+00:00 ― 4 min read

Computer Vision and Pattern Recognition Simplifying Video Editing with Automatic Narratives

A new system streamlines video editing through automated descriptions.

2025-08-05T21:49:30+00:00 ― 6 min read

Bioinformatics ExoDeepFinder: A New Tool for Detecting Exocytosis Events

ExoDeepFinder efficiently detects rare exocytosis events in video data using deep learning.

2025-08-01T02:19:13+00:00 ― 4 min read

Audio and Speech Processing Using Audio Technology for Pedestrian Tracking

This study examines audio methods for tracking pedestrian movement in urban areas.

2025-07-29T17:52:20+00:00 ― 7 min read

Computer Vision and Pattern Recognition GenMM: A New Way to Insert 3D Objects in Videos

GenMM improves realistic insertion of 3D objects in videos and LiDAR scans.

2025-07-28T13:33:36+00:00 ― 6 min read

Computers and Society TikTok's Influence on Health Behaviors

How TikTok shapes user habits around vaping and drinking.

2025-07-25T13:48:06+00:00 ― 5 min read

Computer Vision and Pattern Recognition New Method for Creating Sound from Video and Text

This article presents a method to generate accurate sound from videos and text.

2025-07-20T16:03:25+00:00 ― 7 min read

Computer Vision and Pattern Recognition New Method for Early Detection of Autism

This study proposes a video-based approach to assess autism severity in children.

2025-07-14T09:35:48+00:00 ― 6 min read

Computation and Language YouTube-SL-25: Advancing Sign Language Research

A substantial dataset to enhance sign language technology and research.

2025-07-13T11:44:24+00:00 ― 4 min read

Computer Vision and Pattern Recognition Innovative Method for Video and Depth Generation

New approach generates high-quality human action videos with depth information.

2025-07-13T09:45:54+00:00 ― 8 min read

Computer Vision and Pattern Recognition Advancements in Digital Face Creation

Researchers develop PAV for realistic digital avatars from video clips.

2025-07-09T08:04:06+00:00 ― 5 min read

Computer Vision and Pattern Recognition New Benchmark Enhances Video-Language Understanding

A new benchmark improves models' understanding of long videos and language.

2025-07-09T01:29:06+00:00 ― 5 min read

Computer Vision and Pattern Recognition Introducing the PIV3CAMS Dataset for Computer Vision

A new dataset featuring image pairs from three camera types for computer vision research.

2025-07-06T09:29:42+00:00 ― 5 min read

Computer Vision and Pattern Recognition Innovative Model for Diagnosing Depression

A new approach merges audio, video, and text data for effective depression diagnosis.

2025-07-06T04:53:12+00:00 ― 8 min read

Multimedia Addressing Hate Speech in Videos with MultiHateClip Dataset

New dataset provides insights on hate speech across languages and formats.

2025-07-06T02:31:00+00:00 ― 6 min read

Computer Vision and Pattern Recognition A New Way to Measure Pain

This framework combines videos and brain data for better pain assessment.

2025-07-05T08:44:30+00:00 ― 6 min read

Image and Video Processing SAM-2: Advancements in Surgical Video Segmentation

SAM-2 improves surgical video analysis, handling challenges like smoke and low lighting.

2025-07-04T09:46:15+00:00 ― 5 min read

Computer Vision and Pattern Recognition Introducing VidGen-1M: A New Dataset for Video Generation

VidGen-1M improves video generation from text with high-quality data.

2025-07-02T03:58:48+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Deepfake Detection with Fine Details

A new approach focuses on subtle inconsistencies in deepfake detection.

2025-07-01T04:02:15+00:00 ― 6 min read

Animal Behavior and Cognition AnimalMotionViz: A New Tool for Cow Behavior Analysis

A software tool to track and analyze cow movement and space use.

2025-06-28T08:15:25+00:00 ― 6 min read

Robotics RoboMNIST: A New Dataset for Robot Activity Recognition

RoboMNIST aids robots in recognizing various activities using WiFi, video, and audio.

2025-06-22T09:30:35+00:00 ― 6 min read

Computer Vision and Pattern Recognition Kangaroo: A New Approach to Video Understanding

Kangaroo improves video analysis by integrating visuals, sounds, and text effectively.

2025-06-20T14:33:30+00:00 ― 5 min read

Computer Vision and Pattern Recognition Improving Human Motion Tracking with New Techniques

A new method enhances accuracy in tracking human movement from video.

2025-06-14T09:46:30+00:00 ― 5 min read

Multimedia New Method for Detecting Human Emotions

A study reveals a new way to identify emotions using video, sound, and text.

2025-06-12T23:24:36+00:00 ― 5 min read

Computer Vision and Pattern Recognition Advancements in Active Speaker Detection Technology

New model improves real-time speaker detection and efficiency in communication.

2025-06-12T14:43:12+00:00 ― 5 min read

Sound Advancements in Video-to-Audio Generation

New methods improve audio synchronization with changing video scenes.

2025-06-10T20:35:05+00:00 ― 4 min read

Robotics Robots Learning to Cook from Online Data

This article covers how robots learn cooking skills using internet information.

2025-06-07T13:58:54+00:00 ― 7 min read

Computer Vision and Pattern Recognition V-AURA: Advancing Video-to-Audio Integration

A new model creates audio that matches video, enhancing media experiences.

2025-06-05T23:59:05+00:00 ― 4 min read

Computation and Language New Dataset Sheds Light on Climate Change Opinions

MultiClimate dataset reveals public stances on climate change through videos.

2025-06-05T09:34:42+00:00 ― 6 min read

Robotics Teaching Robots to Imitate Human Actions

New method helps robots learn tasks by watching human demonstrations.

2025-06-05T05:53:30+00:00 ― 5 min read

Human-Computer Interaction Can Nudges Help Fight Fake Videos?

A study shows nudges work for headlines but not for cute deepfake videos.

2025-06-02T17:35:18+00:00 ― 5 min read

Sound Integrating Audio-Visual Data for Speech Processing

This study analyzes how audio, video, and text work together in speech recognition.

2025-05-30T15:13:22+00:00 ― 7 min read

Computer Vision and Pattern Recognition ReCapture: New Video Angle Tool

Change how you see videos with ReCapture's innovative angle shifting technology.

2025-05-28T15:45:00+00:00 ― 6 min read