Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Adanato
's Collections
qwen25_3b_instruct rank_only
llama3_8b_instruct rank_only
llama32_1b_instruct rank_only
mistral_nemo rank_only
qwen25_3b rank_only
llama3_8b_instruct BERT baseline
mistral_nemo BERT baseline
qwen25_3b_instruct BERT baseline
qwen25_3b BERT baseline
llama32_1b_instruct BERT baseline
mistral_nemo PPL baseline
llama3_8b_instruct PPL baseline
qwen25_3b_instruct PPL baseline
llama32_1b_instruct PPL baseline
qwen25_3b PPL baseline
fykorch-test
Qwen2.5-3B qwen2.5+qwen3 rank_only k=3 clusters
Llama-3.2-1B-Instruct qwen2.5+qwen3 rank_only k=3 clusters
Llama-3.2-1B-Instruct qwen2.5+qwen3 rank_only k=2 clusters
Qwen2.5-3B qwen2.5+qwen3 rank_only k=2 clusters
Qwen2.5-3B-Instruct qwen2.5+qwen3 rank_only k=2 clusters
Mistral-Nemo-Instruct-2407 qwen2.5+qwen3 diff_only clusters
Qwen2.5-3B qwen2.5+qwen3 diff_only clusters
Qwen2.5-3B-Instruct qwen2.5+qwen3 diff_only clusters
Meta-Llama-3-8B-Instruct qwen2.5+qwen3 diff_only clusters
Mistral-Nemo-Instruct-2407 qwen2.5+qwen3 rank_only clusters
Llama-3.2-1B-Instruct qwen2.5+qwen3 diff_only clusters
Qwen2.5-3B-Instruct qwen2.5+qwen3 rank_only clusters
Qwen2.5-3B qwen2.5+qwen3 rank_only clusters
Llama-3.2-1B-Instruct qwen2.5+qwen3 rank_only clusters
qwen25_qwen3_diff_only_k6
Llama-3.2-1B-Instruct qwen2.5+qwen3 rank_diff clusters
Qwen2.5-3B qwen2.5+qwen3 rank_diff clusters
Qwen2.5-3B-Instruct qwen2.5+qwen3 rank_diff clusters
Mistral-Nemo-Instruct-2407 qwen2.5+qwen3 rank_diff clusters
Meta-Llama-3-8B-Instruct e1 10model rank k5
Meta-Llama-3-8B-Instruct e1 10model rank k4
Meta-Llama-3-8B-Instruct e1 10model rank k3
Meta-Llama-3-8B-Instruct e1 10model rank k2
Meta-Llama-3-8B-Instruct qwen2.5+qwen3 rank_diff clusters
Meta-Llama-3-8B-Instruct qwen2.5+gemma rank_diff clusters
Meta-Llama-3-8B-Instruct qwen3+llama3 rank_diff clusters
Meta-Llama-3-8B-Instruct qwen3+gemma rank_diff clusters
Meta-Llama-3-8B-Instruct qwen3+llama3 sextiles
Meta-Llama-3-8B-Instruct qwen3+gemma sextiles
Meta-Llama-3-8B-Instruct qwen2.5+gemma sextiles
Meta-Llama-3-8B-Instruct qwen3+gemma clusters
Meta-Llama-3-8B-Instruct qwen2.5+gemma clusters
Meta-Llama-3-8B-Instruct e1 fykcluster rank 3d
Meta-Llama-3-8B-Instruct e1 fykclusters llama3 sextiles
Meta-Llama-3-8B-Instruct e1 fykclusters qwen3 sextiles
Meta-Llama-3-8B-Instruct e1 fykclusters qwen3
Meta-Llama-3-8B-Instruct e3 fykclusters k4
Fykclusters
Meta-Llama-3-8B-Instruct e1 fykclusters k5
Meta-Llama-3-8B-Instruct e1 fykclusters k4
Meta-Llama-3-8B-Instruct e1 fykclusters k6
Meta-Llama-3-8B-Instruct e5 fykclusters k6
FYK Baselines (BERT)
FYK Clusters (no Qwen3)
FYK Clusters (no Gemma)
FYK Dataset (log-loss)
FYKGRID
FYK Grid Dataset
FYK SFT CLUSTER
FYK Dataset
Data Val
Pretrain Similarity
FYK SFT Models
Meta-Llama-3-8B-Instruct e1 fykclusters k4
updated
Jan 14
Upvote
-
This collection has no items.
Upvote
-
Share collection
View history
Collection guide
Browse collections