arxiv:2502.05171
John Kirchenbauer
jwkirchenbauer
AI & ML interests
Deep Learning in/and NLP
Organizations
models 59
jwkirchenbauer/Qwen3-4B-Inst-2507-MTP
4B • Updated • 11 • 1
jwkirchenbauer/L3-1-8B-Magpie-MTP
8B • Updated • 8
jwkirchenbauer/daint_prod_q4_32N128n_12cacdc9_latest
Text Generation • 8B • Updated • 5
jwkirchenbauer/debug_metamath_full_rand_k2-8_ex_valk_baseline_latest
Text Generation • 8B • Updated • 4
jwkirchenbauer/daint_prod_q4_128N512n_fd7261ea
Text Generation • 8B • Updated • 6
jwkirchenbauer/daint_prod_q4_32N128n_e3c83f91_latest
Text Generation • 8B • Updated • 4
jwkirchenbauer/daint_prod_q4_128N512n_fd7261ea_latest
Text Generation • 8B • Updated • 2
jwkirchenbauer/pythia-1.4b-retr-32k_w_meta-00100000-phase2
Updated
jwkirchenbauer/pythia-1.4b-retr-32k_w_meta-00120000-phase2
Updated
jwkirchenbauer/pythia-1.4b-retr-32k_w_meta-phase1
Updated
datasets 8
jwkirchenbauer/metamathqa-grouped-split-l3-magpie
Viewer • Updated • 395k • 14
jwkirchenbauer/fictionalqa_reformatted_triviaqa
Viewer • Updated • 16.4k • 18
jwkirchenbauer/fictionalqa_training_splits
Viewer • Updated • 219k • 1.14k
jwkirchenbauer/fictionalqa
Viewer • Updated • 39.2k • 213 • 2
jwkirchenbauer/metamathqa-grouped-split
Viewer • Updated • 395k • 137
jwkirchenbauer/fictional_qa_03-19-25_training_splits
Viewer • Updated • 107k • 4
jwkirchenbauer/trivia_qa_03-19-25_training_splits
Viewer • Updated • 16.4k • 5
jwkirchenbauer/fictional_qa_03-19-25_processed_flat
Viewer • Updated • 31.7k • 5