Update README.md

7232ec1 verified about 1 year ago

7.23 kB

	---
	license: mit
	language:
	- en
	- ja
	- ko
	- vi
	base_model:
	- Qwen/Qwen2.5-1.5B
	pipeline_tag: text-generation
	library_name: transformers
	---
	# NxMobileLM-1.5B-SFT

	## Model Description
	`NxMobileLM-1.5B-SFT` is a fine-tuned version of the base model `Qwen2.5-1.5B`, optimized for mobile and edge applications. This model has been trained on proprietary instruction datasets curated to enhance performance in natural language understanding and generation tasks tailored to specific applications.

	### Key Features:
	- Base Model: Qwen2.5-1.5B
	- Parameter Count: 1.5 billion
	- Fine-tuning Objective: Supervised fine-tuning (SFT) on instruction datasets.
	- Specialization: Lightweight and efficient performance for mobile environments.
	- Multilingual Support: Designed to handle multiple languages effectively, enabling robust cross-lingual capabilities for diverse applications.

	## Model Details

	### Training Data
	The model was fine-tuned using a proprietary dataset designed for diverse instruction-following tasks, including question answering, summarization, and dialogue. The dataset emphasizes:
	- Multi-domain generalization
	- Task-specific instruction understanding
	- Multilingual Coverage: Training data includes samples from several major languages to enhance cross-lingual understanding.

	### Training Configuration
	- Framework: PyTorch
	- Optimizer: AdamW
	- Learning Rate: 5e-5
	- Batch Size: 128
	- Epochs: 3
	- Mixed Precision: FP16

	### Evaluation
	The model was evaluated on a variety of benchmarks, demonstrating:
	- High Accuracy: Achieves strong performance across general natural language tasks.
	- Efficiency: Optimized for low-latency inference on edge devices.
	- Multilingual Competence: Strong performance across multiple languages, making it suitable for global applications.

	### Performance Comparison
	#### Open-LLM Leaderboard
	On January 15, 2025, NxMobileLM-1.5B-SFT was ranked among the top 10 edge device models with fewer than 3 billion parameters and achieved the first rank for models with under 2 billion parameters, according to the [OpenLLM leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard#/?params=0%2C3).
	![image/png](https://cdn-uploads.huggingface.co/production/uploads/5ee1b417636bdb3834e2da19/6877IlKJN7agg-6Zt--ZY.png)

	#### P-MMEval
	To evaluate the multilingual capabilities of the model, We conducted evaluations on several benchmarks across three languages: English (en), Japanese (ja), and Vietnamese (vi). For detailed benchmark information, refer to [P-MMEval](https://huggingface.co/datasets/Qwen/P-MMEval).

	\| Benchmark \| Llama-3.2-1B-Instruct \| SmolLM2-1.7B-Instruct \| Qwen2.5-1.5B-Instruct \| NxMobileLM-1.5B-SFT \|
	\|------------------\|------------------------\|------------------------\|------------------------\|---------------------\|
	\| mifeval-en \| 44.79 \| 43.75 \| 50 \| 57.29 \|
	\| mifeval-ja \| 22.92 \| 23.96 \| 29.17 \| 30.21 \|
	\| mifeval-vi \| 30.21 \| 25 \| 28.12 \| 46.88 \|
	\| mmmlu-EN-US \| 35.25 \| 42.5 \| 45.5 \| 45.25 \|
	\| mmmlu-JA-JP \| 31.5 \| 26.25 \| 36.00 \| 41.00 \|
	\| mmmlu-VI-VT \| 22.75 \| 22.25 \| 39.00 \| 38.00 \|
	\| xnli-en \| 35.83 \| 35.83 \| 59.17 \| 66.67 \|
	\| xnli-ja \| 34.17 \| 35.83 \| 52.5 \| 57.5 \|
	\| xnli-vi \| 37.5 \| 34.17 \| 45.83 \| 55.83 \|
	\| Average \| 32.21 \| 31.61 \| 42.93 \| 48.07 \|

	#### LightEval
	The table below compares `NxMobileLM-1.5B-SFT` with other instruction-tuned models using various benchmarks. Results were obtained using the [lighteval](https://github.com/huggingface/lighteval) evaluation framework and are referenced from [Hugging Face TB](https://huggingface.co/HuggingFaceTB/SmolLM2-1.7B-Instruct):

	\| Metric \| SmolLM2-1.7B-Instruct \| Llama-1B-Instruct \| Qwen2.5-1.5B-Instruct \| SmolLM1-1.7B-Instruct \| NxMobileLM-1.5B-SFT \|
	\|------------------------------\|-----------------------\|-------------------\|-----------------------\|-----------------------\|-----------------\|
	\| IFEval (Average prompt/inst) \| 56.7 \| 53.5 \| 47.4 \| 23.1 \| 64.2 \|
	\| HellaSwag \| 66.1 \| 56.1 \| 60.9 \| 55.5 \| 63.57 \|
	\| ARC (Average) \| 51.7 \| 41.6 \| 46.2 \| 43.7 \| 45.21 \|
	\| PIQA \| 74.4 \| 72.3 \| 73.2 \| 71.6 \| 72.91 \|
	\| MMLU-Pro (MCF) \| 19.3 \| 12.7 \| 24.2 \| 11.7 \| 15.43 \|
	\| BBH (3-shot) \| 32.2 \| 27.6 \| 35.3 \| 25.7 \| 31.44 \|
	\| GSM8K (5-shot) \| 48.2 \| 26.8 \| 42.8 \| 4.62 \| 59.51 \|
	\| Average \| 49.8 \| 41.5 \| 47.1 \| 33.7 \| 50.3 \|

	### Limitations
	While `NxMobileLM-1.5B-SFT` excels in many areas, it may not perform well on tasks outside the scope of the fine-tuned dataset. Biases inherent in the training data may also affect outcomes.

	## Intended Use
	`NxMobileLM-1.5B-SFT` is designed for use in:
	- Mobile virtual assistants
	- Real-time language-based applications
	- Compact edge AI solutions
	- Multilingual Scenarios: Supporting applications that require cross-lingual communication and understanding.

	Misuse Warning: The model is not intended for use in generating harmful, biased, or illegal content.

	## How to Use
	Here is a sample code snippet to load and use the model:

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer

	# Load the model and tokenizer
	model_name = "NTQAI/NxMobileLM-1.5B-SFT"
	model = AutoModelForCausalLM.from_pretrained(model_name)
	tokenizer = AutoTokenizer.from_pretrained(model_name)

	# Example usage
	inputs = tokenizer("What is the capital of Vietnam?", return_tensors="pt")
	outputs = model.generate(**inputs, max_length=50)
	print(tokenizer.decode(outputs[0], skip_special_tokens=True))
	```

	## Citation
	If you use this model in your research, please cite it as:

	```
	@misc{NxMobileLM-1.5B-SFT,
	title={NxMobileLM-1.5B-SFT},
	author={NTQAI},
	year={2025},
	url={https://huggingface.co/NTQAI/NxMobileLM-1.5B-SFT},
	}
	```

	## License
	This model is licensed under MIT.

	## Contact
	For questions or issues, please contact us via website: https://ntq.ai