tongjingqi(SII)
tongjingqi
AI & ML interests
NLP
Recent Activity
upvoted
a
paper
3 days ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization
upvoted
a
paper
4 days ago
Learning to Discover at Test Time