Pythonformer

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

huu-ontocord published a dataset about 10 hours ago

pythonformer/misc

huu-ontocord updated a dataset about 12 hours ago

pythonformer/sample-codeact-pretraining

huu-ontocord published a dataset about 12 hours ago

pythonformer/sample-codeact-pretraining

View all activity

huu-ontocord

published a dataset about 10 hours ago

pythonformer/misc

Preview • Updated Dec 6, 2025

huu-ontocord

updated a dataset about 12 hours ago

pythonformer/sample-codeact-pretraining

Viewer • Updated about 12 hours ago • 3.08k

huu-ontocord

published a dataset about 12 hours ago

pythonformer/sample-codeact-pretraining

Viewer • Updated about 12 hours ago • 3.08k

Taishi-N324

authored a paper 2 months ago

On the Optimal Reasoning Length for RL-Trained Language Models

Paper • 2602.09591 • Published Feb 10 • 6

huu-ontocord

updated a model 4 months ago

pythonformer/ascent_gen

Text Generation • 0.8B • Updated Dec 29, 2025 • 1

huu-ontocord

published a model 4 months ago

pythonformer/ascent_gen

Text Generation • 0.8B • Updated Dec 29, 2025 • 1

TieuDaoChanNhan

updated a dataset 4 months ago

pythonformer/glaive

Viewer • Updated Dec 27, 2025 • 2.56M • 2

huu-ontocord

updated a Space 4 months ago

README

📊

huu-ontocord

updated a dataset 4 months ago

pythonformer/meta-active

Updated Dec 11, 2025 • 5

huu-ontocord

published 2 datasets 4 months ago

pythonformer/meta-active

Updated Dec 11, 2025 • 5

pythonformer/glaive

Viewer • Updated Dec 27, 2025 • 2.56M • 2

huu-ontocord

updated 2 datasets 4 months ago

pythonformer/glaive

Viewer • Updated Dec 27, 2025 • 2.56M • 2

pythonformer/misc

Preview • Updated Dec 6, 2025

huu-ontocord

published a Space 5 months ago

README

📊

TieuDaoChanNhan

authored a paper 6 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 10

huu-ontocord

authored a paper 7 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 10

Taishi-N324

authored a paper 7 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 10

Taishi-N324

authored a paper 8 months ago

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Paper • 2508.18672 • Published Aug 26, 2025 • 10

Taishi-N324

authored a paper 10 months ago

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Paper • 2505.02881 • Published May 5, 2025 • 7

huu-ontocord

authored a paper 10 months ago

EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition

Paper • 2505.20033 • Published May 26, 2025 • 4

AI & ML interests

Recent Activity

Team members 7

pythonformer's activity

README

README