entfane
/

math-genius-7B

Text Generation

text-generation-inference

Model card Files Files and versions

entfane commited on Jul 16, 2025

Commit

ec071bb

·

verified ·

1 Parent(s): 625742c

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -14,7 +14,7 @@ pipeline_tag: text-generation
 <img src="https://huggingface.co/entfane/math_genious-7B/resolve/main/math-genious.png" width="400" height="400"/>
-# Math Genious 7B
 This model is a Math Chain-of-Thought fine-tuned version of Mistral 7B v0.3 Instruct model.
@@ -30,7 +30,7 @@ Model was fine-tuned on [entfane/Mixture-Of-Thoughts-Math-No-COT](https://huggin
 from transformers import AutoTokenizer, AutoModelForCausalLM
-model_name = "entfane/math-genious-7B"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 model = AutoModelForCausalLM.from_pretrained(model_name)
 messages = [
@@ -48,5 +48,5 @@ The model was evaluated on a randomly sampled subset of 1,000 records from the t
 Math Genius 7B achieved an accuracy of 93.1% in producing the correct final answer under the pass@1 evaluation metric.
 #### AIME
-Math Genious 7B was evaluated on [90 problems from AIME 22, AIME 23, and AIME 24](https://huggingface.co/datasets/AI-MO/aimo-validation-aime).
 The model has successfully solved 3/90 of the problems.

 <img src="https://huggingface.co/entfane/math_genious-7B/resolve/main/math-genious.png" width="400" height="400"/>
+# Math Genius 7B
 This model is a Math Chain-of-Thought fine-tuned version of Mistral 7B v0.3 Instruct model.
 from transformers import AutoTokenizer, AutoModelForCausalLM
+model_name = "entfane/math-genius-7B"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 model = AutoModelForCausalLM.from_pretrained(model_name)
 messages = [
 Math Genius 7B achieved an accuracy of 93.1% in producing the correct final answer under the pass@1 evaluation metric.
 #### AIME
+Math Genius 7B was evaluated on [90 problems from AIME 22, AIME 23, and AIME 24](https://huggingface.co/datasets/AI-MO/aimo-validation-aime).
 The model has successfully solved 3/90 of the problems.