Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations Paper • 2602.05885 • Published 3 days ago • 22
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper • 2602.04634 • Published 4 days ago • 87
LatentMem: Customizing Latent Memory for Multi-Agent Systems Paper • 2602.03036 • Published 6 days ago • 12
Scaling Embeddings Outperforms Scaling Experts in Language Models Paper • 2601.21204 • Published 11 days ago • 97
AgentDoG Collection A Diagnostic Guardrail Framework for AI Agent Safety and Security • 11 items • Updated 11 days ago • 96
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 17 days ago • 53
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey Paper • 2601.11655 • Published 24 days ago • 60
Toward Efficient Agents: Memory, Tool learning, and Planning Paper • 2601.14192 • Published 19 days ago • 54
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published 25 days ago • 87
OpenNovelty: An LLM-powered Agentic System for Verifiable Scholarly Novelty Assessment Paper • 2601.01576 • Published Jan 4 • 18
VA-π: Variational Policy Alignment for Pixel-Aware Autoregressive Generation Paper • 2512.19680 • Published Dec 22, 2025 • 11
GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators Paper • 2512.19682 • Published Dec 22, 2025 • 17