seongyun_lee's picture

seongyun_lee

Seongyun

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams

upvoted a paper about 1 month ago

Privasis: Synthesizing the Largest "Public" Private Dataset from Scratch

upvoted a paper about 2 months ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

View all activity

Organizations

Seongyun 's models 52

Seongyun/qwen3-8b-thinking-rare-ckpt-100

8B • Updated Dec 14, 2025

Seongyun/qwen3-4b-thinking-rare-ckpt-109

4B • Updated Dec 11, 2025

Seongyun/qwen3-4b-thinking-rl-ckpt-109

4B • Updated Dec 11, 2025

Seongyun/qwen3-4b-thinking-rl-ckpt60

4B • Updated Dec 10, 2025 • 2

Seongyun/exaone_deep_2.4b_non_math_only_mcqa_format

Updated Apr 2, 2025

Seongyun/math_only_mcqa_format

2B • Updated Mar 29, 2025 • 1

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_6

Updated Mar 10, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_5

2B • Updated Mar 10, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_4

2B • Updated Mar 8, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_mcqa_repetition_penalty_2

Text Generation • 2B • Updated Mar 8, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_mcqa_repetition_penalty

2B • Updated Mar 7, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math_2

Text Generation • 2B • Updated Mar 5, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_math

2B • Updated Mar 4, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_3

2B • Updated Mar 4, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k_2

2B • Updated Mar 3, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_sample_190k

2B • Updated Mar 2, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_code_code2

Updated Mar 1, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_code_code

Text Generation • 2B • Updated Mar 1, 2025

Seongyun/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_pref_repetition_penalty

Text Generation • 2B • Updated Mar 1, 2025 • 2

Seongyun/ds_r1_distill_qwen_1.5b_grpo_pref

Updated Feb 28, 2025

Seongyun/output

Updated Feb 7, 2025

Seongyun/OLMo300m-Fineweb-Edu-100bt-latest

Updated Nov 19, 2024