Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xinpeng
/
big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_185
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
main
big-math-hard-tiny-qwen2.5-7b-instruct-og-rloo-implicit-cheat-rm-loophole-rerun-global_step_185
Commit History
Upload global_step_185 checkpoint
503f1c9
verified
xinpeng
commited on
Oct 9, 2025
initial commit
c83b69f
verified
xinpeng
commited on
Oct 9, 2025