SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published 6 days ago • 35
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published 6 days ago • 39
Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning Paper • 2602.00759 • Published 9 days ago • 5
FlowRL: Matching Reward Distributions for LLM Reasoning Paper • 2509.15207 • Published Sep 18, 2025 • 116