ToolRM: Towards Agentic Tool-Use Reward Modeling
-
One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Paper • 2510.26167 • Published • 1 -
RioLee/ToolRM-Gen-Qwen3-4B-Thinking-2507
Text Generation • 4B • Updated • 11 -
RioLee/ToolPref-Pairwise-30K
Viewer • Updated • 89.5k • 84 • 2 -
RioLee/TRBench-BFCL
Viewer • Updated • 11.9k • 24 • 1