Daixuan Cheng's picture

Daixuan Cheng

daixuancheng

·

https://github.com/cdxeve

DaixuanC45443

AI & ML interests

None yet

Recent Activity

authored a paper 5 days ago

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

upvoted a paper 5 days ago

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

upvoted a paper 5 days ago

SWE-World: Building Software Engineering Agents in Docker-Free Environments

View all activity

Organizations

None yet

upvoted 2 papers 5 days ago

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Paper • 2602.03411 • Published 6 days ago • 35

SWE-World: Building Software Engineering Agents in Docker-Free Environments

Paper • 2602.03419 • Published 6 days ago • 39

upvoted a paper 6 days ago

Adaptive Ability Decomposing for Unlocking Large Reasoning Model Effective Reinforcement Learning

Paper • 2602.00759 • Published 9 days ago • 5

upvoted 6 collections 14 days ago

Agentic

14 items • Updated 17 days ago • 2

Agents

12 items • Updated 15 days ago • 1

AI-papers

5 items • Updated 16 days ago • 1

Ai-general

50 items • Updated 11 days ago • 3

Agent

94 items • Updated 17 days ago • 11

2026

174 items • Updated about 18 hours ago • 3

upvoted 5 collections 17 days ago

Coding

3 items • Updated 5 days ago • 1

Agents

11 items • Updated 5 days ago • 3

Training-Free

3 items • Updated 10 days ago • 1

LLM

11 items • Updated 10 days ago • 3

Agent

42 items • Updated 2 days ago • 3

upvoted a paper 17 days ago

LLM-in-Sandbox Elicits General Agentic Intelligence

Paper • 2601.16206 • Published 18 days ago • 84

upvoted 2 papers 4 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 106

BitNet Distillation

Paper • 2510.13998 • Published Oct 15, 2025 • 59

upvoted a paper 5 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18, 2025 • 116

upvoted a paper 7 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

upvoted a collection 8 months ago

Models

297 items • Updated 18 days ago • 6