EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper β’ 2602.18071 β’ Published 11 days ago β’ 22
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper β’ 2602.18422 β’ Published 11 days ago β’ 30
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper β’ 2602.08354 β’ Published 22 days ago β’ 255
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper β’ 2512.03041 β’ Published Dec 2, 2025 β’ 66
GENIUS: Generative Fluid Intelligence Evaluation Suite Paper β’ 2602.11144 β’ Published 19 days ago β’ 53
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper β’ 2602.05400 β’ Published 26 days ago β’ 342
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper β’ 2602.06949 β’ Published 24 days ago β’ 35
Running on CPU Upgrade 1.1k Omni Image Editor πΌ 1.1k Image edit, text to image, image upscale, remove watermark
Running on Zero MCP 1.05k Wan2.2 14B Preview π 1.05k generate a video from an image with a text prompt
Running on Zero MCP Featured 952 Qwen-Image-Edit-2511-LoRAs-Fast π 952 Demo of the Collection of Qwen Image Edit LoRAs
Running on Zero Featured 1.6k Qwen3-TTS Demo π 1.6k Generate custom speech from text, voice descriptions, or samples
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper β’ 2602.00919 β’ Published about 1 month ago β’ 307