Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
gaokerena
/
gaokerena-r1.0
like
1
Follow
gaokerena
7
Text Generation
PEFT
Safetensors
Transformers
doi:10.57967/hf/7394
dpo
lora
trl
conversational
arxiv:
2510.20059
Model card
Files
Files and versions
xet
Community
Use this model
main
gaokerena-r1.0
/
training_args.bin
Commit History
Upload folder using huggingface_hub
cc3cf3c
verified
sadrahakim
commited on
Sep 5, 2025