AgentPublic/evalap-nousresearchmeta-llama-3-8b-instruct_mfs-29 Viewer • Updated about 20 hours ago • 195
AgentPublic/evalap-meta-llama_llama-4-scout-17b-16e-instruct_mfs-32 Viewer • Updated about 20 hours ago • 312
AgentPublic/evalap-mfs_with_judge_meta-llamallama-33-70b-instruct_v5-42 Viewer • Updated about 20 hours ago • 741
AgentPublic/evalap-mfs_with_judge_deepseek-aideepseek-r1-distill-qwen-32b_v5-52 Viewer • Updated about 20 hours ago • 741
AgentPublic/evalap-mfs_with_judge_mistralaimistral-small-31-24b-instruct-2503_v5-56 Viewer • Updated about 20 hours ago • 702
AgentPublic/evalap-albert-brut-mfs_questions_v01-v1-10-25-73 Viewer • Updated about 20 hours ago • 390
AgentPublic/evalap-albert-rag-mfs_questions_v01-v4-10-25-75 Viewer • Updated about 20 hours ago • 390
AgentPublic/evalap-analyse_prompt_test-prompt_20251006_120630-77 Viewer • Updated about 20 hours ago • 184
AgentPublic/evalap-analyse_prompt_assistant-ia_20251006_175310-78 Viewer • Updated about 20 hours ago • 92
AgentPublic/evalap-analyse_prompt_test-assistant-ia_20251010_171320-80 Viewer • Updated about 20 hours ago • 92
AgentPublic/evalap-compare-albert-small-with-apertus-small-models-82 Viewer • Updated about 20 hours ago • 340
AgentPublic/evalap-analyse_prompt_prompt_no_20251104_113432-84 Viewer • Updated about 20 hours ago • 138
AgentPublic/evalap-analyse_prompt_prompt_no_20251104_114039-85 Viewer • Updated about 20 hours ago • 138
AgentPublic/evalap-assistant-lasuite-tools-comparison-v2-101 Viewer • Updated about 20 hours ago • 507
AgentPublic/evalap-legalbenchrag-evaluation-norag-no-system-prompt-102 Viewer • Updated about 20 hours ago • 1k
AgentPublic/evalap-comparing-albert-api-models-v11-12-2025-with-sysprompt-108 Viewer • Updated about 20 hours ago • 673
AgentPublic/evalap-20251215_141158_launch_test_evaluation-112 Viewer • Updated about 20 hours ago • 4
AgentPublic/evalap-comparing-openweight-with-openaigpt-oss-120b-113 Viewer • Updated about 20 hours ago • 1.02k
AgentPublic/evalap-comparing-openweight-with-patronusaiglider-114 Viewer • Updated about 20 hours ago • 641
AgentPublic/evalap-comparing-openweight-with-atlaaiselene-1-mini-llama-31-8b-115 Viewer • Updated about 20 hours ago • 811
AgentPublic/evalap-mediatechs-legi-chunking-evaluation-v1-116 Viewer • Updated about 20 hours ago • 2.25k
AgentPublic/evalap-mediatechs-service-public-travail-emploi-chunking-evaluation-v1-117 Viewer • Updated about 20 hours ago • 2.98k