Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
23
Jerry Pan
JERRYPAN617
Follow
0 followers
·
12 following
https://jerrypan617.github.io/
jerrypan617
AI & ML interests
RLHF, Retrieval-Augmented Multimodal Understanding...
Recent Activity
upvoted
a
paper
27 days ago
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
liked
a dataset
2 months ago
PKU-Alignment/PKU-SafeRLHF-single-dimension
liked
a dataset
3 months ago
PKU-Alignment/PKU-SafeRLHF
View all activity
Organizations
JERRYPAN617
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
2 months ago
PKU-Alignment/PKU-SafeRLHF-single-dimension
Viewer
•
Updated
Jun 14, 2024
•
81.1k
•
129
•
3
liked
4 datasets
3 months ago
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18, 2024
•
164k
•
7.53k
•
174
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
Oct 16, 2024
•
187k
•
6.41k
•
320
LooksJuicy/ruozhiba
Viewer
•
Updated
Apr 9, 2024
•
1.5k
•
149
•
315
Karsh-CAI/btfChinese-DPO-small
Viewer
•
Updated
Apr 7, 2024
•
5k
•
169
•
22
liked
2 models
3 months ago
Qwen/Qwen2.5-1.5B-Instruct
Text Generation
•
2B
•
Updated
Sep 25, 2024
•
6.13M
•
•
608
JERRYPAN617/HH-BTRewardModel-roberta
Reinforcement Learning
•
0.1B
•
Updated
Nov 13, 2025
•
1
•
1
liked
7 datasets
3 months ago
ys-zong/VLGuard
Viewer
•
Updated
Jan 19, 2025
•
3k
•
308
•
13
PKU-Alignment/MM-SafetyBench
Viewer
•
Updated
Sep 19, 2024
•
6.72k
•
1.08k
•
3
saferlhf-v/BeaverTails-V
Viewer
•
Updated
Mar 8, 2025
•
30.4k
•
434
•
7
PKU-Alignment/PKU-SafeRLHF-V
Viewer
•
Updated
Mar 25, 2025
•
30.4k
•
144
•
5
Moemu/Muice-Dataset
Viewer
•
Updated
about 16 hours ago
•
3.74k
•
179
•
49
liuhaotian/LLaVA-Instruct-150K
Preview
•
Updated
Jan 3, 2024
•
3.08k
•
570
MMMU/MMMU
Viewer
•
Updated
Sep 19, 2024
•
11.6k
•
60.8k
•
311
liked
a Space
3 months ago
Sleeping
1
Qwen2.5 Psydoctor Demo
📈
1
基于 Qwen2.5-1.5B-Instruct 模型微调的 LoRA 适配器,专门用于心理医生对话场景。
liked
2 datasets
3 months ago
FreedomIntelligence/medical-o1-reasoning-SFT
Viewer
•
Updated
Apr 22, 2025
•
90.1k
•
4.71k
•
1.06k
nvidia/Nemotron-CC-Math-v1
Viewer
•
Updated
Dec 23, 2025
•
190M
•
6.02k
•
63
liked
a model
3 months ago
JERRYPAN617/qwen2.5-lora-psydoctor
Text Generation
•
Updated
Oct 25, 2025
•
5
•
1
liked
2 datasets
4 months ago
hiyouga/geometry3k
Viewer
•
Updated
Apr 14, 2025
•
3k
•
24k
•
66
Anthropic/hh-rlhf
Viewer
•
Updated
May 26, 2023
•
169k
•
20.1k
•
1.66k
Load more