This collection contains my GPT-3 Small implementations. All models here share same architecture and are same model on different training stages.
Kyryll Kochkin
k050506koch
AI & ML interests
LLMs. I am obsessed with them. Worked with dense models so far, and plan to experiment with MoEs.
Recent Activity
updated a Space 14 days ago
k050506koch/gpt3-dev-api updated a model 14 days ago
k050506koch/GPT3-dev-350m-2805 updated a model 14 days ago
k050506koch/GPT3-dev-125m-0104Organizations
None yet