arxiv:2505.19897
Oran Feng
xiachongfeng
AI & ML interests
Large Language Models
Recent Activity
upvoted
a
paper
about 1 month ago
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models