AI & ML interests
None yet
Organizations
None yet
XuHuang/inpo_iter2_bs256_lr4e-7_aug28
9B • Updated • 1
XuHuang/gemma-2-9b-it_off_policy_tdpo_stage_2_bs_256_lr_2e-7
9B • Updated • 1
XuHuang/gemma-2-9b-it_inpo_stage_1_bs_256_lr_8e-7
Updated
9B • Updated • 1
XuHuang/gemma-2-9b-it_DPO
9B • Updated • 1
9B • Updated • 1
XuHuang/simpo_onPolicy_iter1_aug23
9B • Updated • 1
XuHuang/simpo_inpo_op_iter3_aug22
9B • Updated • 1
XuHuang/simpo_inpo_op_iter2_aug22
9B • Updated • 1
XuHuang/simpo_inpo_iter3_aug_20
9B • Updated XuHuang/simpo_inpo_iter2_aug_19
9B • Updated • 1