Jerry Pan

JERRYPAN617

https://jerrypan617.github.io/

jerrypan617

AI & ML interests

RLHF, Retrieval-Augmented Multimodal Understanding...

Recent Activity

upvoted a paper 27 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

liked a dataset 2 months ago

PKU-Alignment/PKU-SafeRLHF-single-dimension

liked a dataset 3 months ago

PKU-Alignment/PKU-SafeRLHF

View all activity

Organizations

liked a dataset 2 months ago

PKU-Alignment/PKU-SafeRLHF-single-dimension

Viewer • Updated Jun 14, 2024 • 81.1k • 129 • 3

liked 4 datasets 3 months ago

liked 2 models 3 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 6.13M • • 608

JERRYPAN617/HH-BTRewardModel-roberta

Reinforcement Learning • 0.1B • Updated Nov 13, 2025 • 1 • 1

liked 7 datasets 3 months ago

ys-zong/VLGuard

Viewer • Updated Jan 19, 2025 • 3k • 308 • 13

PKU-Alignment/MM-SafetyBench

Viewer • Updated Sep 19, 2024 • 6.72k • 1.08k • 3

saferlhf-v/BeaverTails-V

Viewer • Updated Mar 8, 2025 • 30.4k • 434 • 7

PKU-Alignment/PKU-SafeRLHF-V

Viewer • Updated Mar 25, 2025 • 30.4k • 144 • 5

Moemu/Muice-Dataset

Viewer • Updated about 16 hours ago • 3.74k • 179 • 49

liuhaotian/LLaVA-Instruct-150K

Preview • Updated Jan 3, 2024 • 3.08k • 570

MMMU/MMMU

Viewer • Updated Sep 19, 2024 • 11.6k • 60.8k • 311

liked a Space 3 months ago

Qwen2.5 Psydoctor Demo

📈

基于 Qwen2.5-1.5B-Instruct 模型微调的 LoRA 适配器，专门用于心理医生对话场景。

liked 2 datasets 3 months ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22, 2025 • 90.1k • 4.71k • 1.06k

nvidia/Nemotron-CC-Math-v1

Viewer • Updated Dec 23, 2025 • 190M • 6.02k • 63

liked a model 3 months ago

JERRYPAN617/qwen2.5-lora-psydoctor

Text Generation • Updated Oct 25, 2025 • 5 • 1

liked 2 datasets 4 months ago

hiyouga/geometry3k

Viewer • Updated Apr 14, 2025 • 3k • 24k • 66

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 20.1k • 1.66k

Jerry Pan

AI & ML interests

Recent Activity

Organizations

JERRYPAN617's activity

Qwen2.5 Psydoctor Demo