koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_think Text Generation • 4B • Updated 2 days ago • 46
koutch/paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated 2 days ago • 51
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated 2 days ago • 51
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_para Text Generation • 4B • Updated 2 days ago • 56
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_think Text Generation • 4B • Updated 2 days ago • 46
koutch/paper_qwen_qwen3-instruct-4b_train_sft_all_train_think Text Generation • 4B • Updated 2 days ago • 46
koutch/paper_llama_llama3.1-8b_train_sft_train_no_think Text Generation • 8B • Updated 2 days ago • 51
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_think Text Generation • 4B • Updated 2 days ago • 46
koutch/paper_qwen_qwen3-instruct-4b_train_sft_train_no_think Text Generation • 4B • Updated 2 days ago • 51