17 43 13

James Burgess

jmhb

https://jmhb0.github.io/

jmhb0
jmhb0

AI & ML interests

Vision-language models, evaluation, biology applications

Recent Activity

upvoted a paper 13 days ago

Learning to Self-Verify Makes Language Models Better Reasoners

upvoted a paper 14 days ago

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

upvoted a paper 19 days ago

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

View all activity

Organizations

upvoted a paper 13 days ago

Learning to Self-Verify Makes Language Models Better Reasoners

Paper • 2602.07594 • Published 25 days ago • 1

upvoted a paper 14 days ago

UniT: Unified Multimodal Chain-of-Thought Test-time Scaling

Paper • 2602.12279 • Published 20 days ago • 19

upvoted a paper 19 days ago

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Paper • 2407.12883 • Published Jul 16, 2024 • 13

upvoted a paper 21 days ago

Improving Data and Reward Design for Scientific Reasoning in Large Language Models

Paper • 2602.08321 • Published 24 days ago • 41

upvoted 2 papers 23 days ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 47

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 108

upvoted 2 papers 26 days ago

Cognitive Foundations for Reasoning and Their Manifestation in LLMs

Paper • 2511.16660 • Published Nov 20, 2025 • 11

BABE: Biology Arena BEnchmark

Paper • 2602.05857 • Published 27 days ago • 10

upvoted a paper 29 days ago

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

Paper • 2601.18207 • Published Jan 26 • 19

upvoted a paper about 1 month ago

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published Jan 27 • 79

upvoted a collection about 1 month ago

PaperSearchQA

Collection

Data and corpora for the paper "PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR". The main dataset is `PaperSearchQA`. • 5 items • Updated 29 days ago • 4

upvoted a paper 6 months ago

End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning

Paper • 2508.15746 • Published Aug 21, 2025 • 14

upvoted a paper 7 months ago

Prompt Orchestration Markup Language

Paper • 2508.13948 • Published Aug 19, 2025 • 48

upvoted 2 papers 8 months ago

Preserving Privacy, Increasing Accessibility, and Reducing Cost: An On-Device Artificial Intelligence Model for Medical Transcription and Note Generation

Paper • 2507.03033 • Published Jul 3, 2025 • 10

SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning

Paper • 2506.21355 • Published Jun 26, 2025 • 10

upvoted 3 papers 10 months ago

upvoted 2 papers 11 months ago

Self-Steering Language Models

Paper • 2504.07081 • Published Apr 9, 2025 • 19

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7, 2025 • 205

James Burgess

AI & ML interests

Recent Activity

Organizations

jmhb's activity