Learning to Self-Verify Makes Language Models Better Reasoners Paper • 2602.07594 • Published 25 days ago • 1
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling Paper • 2602.12279 • Published 20 days ago • 19
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval Paper • 2407.12883 • Published Jul 16, 2024 • 13
Improving Data and Reward Design for Scientific Reasoning in Large Language Models Paper • 2602.08321 • Published 24 days ago • 41
Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards Paper • 2601.06021 • Published Jan 9 • 47
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published Jan 30 • 108
Cognitive Foundations for Reasoning and Their Manifestation in LLMs Paper • 2511.16660 • Published Nov 20, 2025 • 11
PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR Paper • 2601.18207 • Published Jan 26 • 19
Innovator-VL: A Multimodal Large Language Model for Scientific Discovery Paper • 2601.19325 • Published Jan 27 • 79
PaperSearchQA Collection Data and corpora for the paper "PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR". The main dataset is `PaperSearchQA`. • 5 items • Updated 29 days ago • 4
End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning Paper • 2508.15746 • Published Aug 21, 2025 • 14
Preserving Privacy, Increasing Accessibility, and Reducing Cost: An On-Device Artificial Intelligence Model for Medical Transcription and Note Generation Paper • 2507.03033 • Published Jul 3, 2025 • 10
SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning Paper • 2506.21355 • Published Jun 26, 2025 • 10
Unifying Segment Anything in Microscopy with Multimodal Large Language Model Paper • 2505.10769 • Published May 16, 2025 • 2
ZeroSearch: Incentivize the Search Capability of LLMs without Searching Paper • 2505.04588 • Published May 7, 2025 • 65
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 205