Asaf Yehudai's picture

Asaf Yehudai

Asaf-Yehudai

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 7 hours ago

How Well Do Agentic Skills Work in the Wild: Benchmarking LLM Skill Usage in Realistic Settings

upvoted a paper 3 days ago

Self-Execution Simulation Improves Coding Models

upvoted a paper 3 days ago

MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome

View all activity

Organizations

Papers 8

arxiv:2503.16416

arxiv:2503.06573

arxiv:2502.08130

arxiv:2412.09569

spaces 1

Model Memory Utility

models 0

None public yet

datasets 3

Asaf-Yehudai/mt_bench_gpt_4_judge_single_score

Viewer • Updated May 2, 2024 • 5.44k • 8 • 1

Asaf-Yehudai/HelpSteer_prompt_per_row

Viewer • Updated Dec 3, 2023 • 10.4k • 9

Asaf-Yehudai/xnli_concatenated

Viewer • Updated Mar 30, 2023 • 6M • 14 • 1