view article Article Getting More from Your Test-Time Compute Budget with Portfolio Beam Search 12 days ago • 7
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings • 7 items • Updated 10 days ago • 86
EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper • 2602.18071 • Published 16 days ago • 22
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models Paper • 2512.22238 • Published Dec 23, 2025 • 30
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents Paper • 2602.14234 • Published 21 days ago • 26
GLM-5: from Vibe Coding to Agentic Engineering Paper • 2602.15763 • Published 19 days ago • 105
NVIDIA Cosmos 2 Collection The latest open, multimodal generation models for world generation and reasoning for Physical AI. • 3 items • Updated 1 day ago • 15
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning Paper • 2602.12099 • Published 24 days ago • 57
PhyCritic: Multimodal Critic Models for Physical AI Paper • 2602.11124 • Published 25 days ago • 52
RADIO Collection A collection of Foundation Vision Models that combine multiple models (CLIP, DINOv2, SAM, etc.). • 19 items • Updated 1 day ago • 32
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning Paper • 2601.21468 • Published Jan 29 • 25
DocReward: A Document Reward Model for Structuring and Stylizing Paper • 2510.11391 • Published Oct 13, 2025 • 27
TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers Paper • 2601.14133 • Published Jan 20 • 61
Endless Terminals: Scaling RL Environments for Terminal Agents Paper • 2601.16443 • Published Jan 23 • 18
Behavior Knowledge Merge in Reinforced Agentic Models Paper • 2601.13572 • Published Jan 20 • 26
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning Paper • 2601.16163 • Published Jan 22 • 14