Papers
Sign in to view your remaining parses.
Tag Filter
Pre-trained Video Generation Models
Mitty: Diffusion-based Human-to-Robot Video Generation
Published:12/19/2025
Human-to-Robot Video GenerationDiffusion TransformerUnlabeled LearningPre-trained Video Generation ModelsHuman-Robot Video Synthesis
The paper presents Mitty, a diffusionbased framework for endtoend humantorobot video generation that learns directly from human demonstrations, overcoming information loss and errors from intermediate representations. Leveraging a pretrained diffusion model, it generates hig
011
Señorita-2M: A High-Quality Instruction-based Dataset for General
Video Editing by Video Specialists
Published:2/11/2025
Instruction-based Video Editing DatasetHigh-Quality Video Editing PairsEnd-to-End Video Editing MethodsPre-trained Video Generation ModelsVideo Editing Filtering Pipeline
Señorita2M offers 2M highquality video editing pairs from four specialized models, with a filtering pipeline improving data quality, advancing endtoend video editing with faster inference and superior results.
05