Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing Paper • 2603.03143 • Published 8 days ago • 109
Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published 12 days ago • 52
Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published 12 days ago • 39
From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors Paper • 2602.21778 • Published 14 days ago • 14
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction Paper • 2602.20160 • Published 16 days ago • 10
Rolling Sink: Bridging Limited-Horizon Training and Open-Ended Testing in Autoregressive Video Diffusion Paper • 2602.07775 • Published Feb 8 • 8
Optimizing Few-Step Generation with Adaptive Matching Distillation Paper • 2602.07345 • Published Feb 7 • 9
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published 19 days ago • 30
InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions Paper • 2602.06035 • Published Feb 5 • 23
DuoGen: Towards General Purpose Interleaved Multimodal Generation Paper • 2602.00508 • Published Jan 31 • 4
DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation Paper • 2601.22153 • Published Jan 29 • 72
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer Paper • 2601.16515 • Published Jan 23 • 15
ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion Paper • 2601.16148 • Published Jan 22 • 12
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published Jan 22 • 14
PhysRVG: Physics-Aware Unified Reinforcement Learning for Video Generative Models Paper • 2601.11087 • Published Jan 16 • 11
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published Jan 14 • 33