sayantan0013/ultrafeedback-binarized-preferences-cleaned_math-stack_Qwen3-0_ramp Text Generation • 0.6B • Updated Jun 9, 2025 • 1
kowndinya23/ultrafeedback_binarized-alpaca-llama-3-3b-2-epochs-alpha-0.4-beta-0.6-2-epochs Text Generation • 3B • Updated Jun 9, 2025 • 1
lindsaybordier/Qwen3-0.6B-DPO_not-robust_final-dataset_acc4_beta0.07 Text Generation • 0.6B • Updated Jun 10, 2025 • 1
kowndinya23/ultrafeedback_binarized-alpaca-llama-3-3b-2-epochs-alpha-0.6-beta-0.6-2-epochs Text Generation • 3B • Updated Jun 10, 2025 • 1
lindsaybordier/Qwen3-0.6B-DPO_not-robust_final-dataset_acc4_beta0.10 Text Generation • 0.6B • Updated Jun 10, 2025 • 1
kowndinya23/ultrafeedback_binarized-alpaca-llama-3-3b-2-epochs-alpha-0.8-beta-0.6-2-epochs Text Generation • 3B • Updated Jun 10, 2025 • 1
lindsaybordier/Qwen3-0.6B-DPO_not-robust_final-dataset_acc4_beta0.13 Text Generation • 0.6B • Updated Jun 10, 2025 • 1
kowndinya23/ultrafeedback_binarized-alpaca-llama-3-3b-2-epochs-alpha-1-beta-0.6-2-epochs Text Generation • 3B • Updated Jun 10, 2025 • 2