Sema Code: Decoupling AI Coding Agents into Programmable, Embeddable Infrastructure Paper • 2604.11045 • Published 12 days ago • 24
FORGE:Fine-grained Multimodal Evaluation for Manufacturing Scenarios Paper • 2604.07413 • Published 17 days ago • 94
Communicating about Space: Language-Mediated Spatial Integration Across Partial Views Paper • 2603.27183 • Published 27 days ago • 20
CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents Paper • 2603.24440 • Published about 1 month ago • 98
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings Paper • 2603.13594 • Published Mar 13 • 148
LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs Paper • 2602.00462 • Published Jan 31 • 19
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning Paper • 2505.20046 • Published May 26, 2025 • 18
LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces Paper • 2503.01894 • Published Feb 27, 2025 • 2
Societal Alignment Frameworks Can Improve LLM Alignment Paper • 2503.00069 • Published Feb 27, 2025 • 17
Language Models' Factuality Depends on the Language of Inquiry Paper • 2502.17955 • Published Feb 25, 2025 • 32
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding Paper • 2502.01341 • Published Feb 3, 2025 • 39