Instructions to use inclusionAI/LLaDA2.0-mini-preview with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use inclusionAI/LLaDA2.0-mini-preview with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="inclusionAI/LLaDA2.0-mini-preview", trust_remote_code=True) messages = [ {"role": "user", "content": "Who are you?"}, ] pipe(messages)# Load model directly from transformers import AutoModelForCausalLM model = AutoModelForCausalLM.from_pretrained("inclusionAI/LLaDA2.0-mini-preview", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use inclusionAI/LLaDA2.0-mini-preview with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "inclusionAI/LLaDA2.0-mini-preview" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "inclusionAI/LLaDA2.0-mini-preview", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker
docker model run hf.co/inclusionAI/LLaDA2.0-mini-preview
- SGLang
How to use inclusionAI/LLaDA2.0-mini-preview with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "inclusionAI/LLaDA2.0-mini-preview" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "inclusionAI/LLaDA2.0-mini-preview", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "inclusionAI/LLaDA2.0-mini-preview" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/chat/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "inclusionAI/LLaDA2.0-mini-preview", "messages": [ { "role": "user", "content": "What is the capital of France?" } ] }' - Docker Model Runner
How to use inclusionAI/LLaDA2.0-mini-preview with Docker Model Runner:
docker model run hf.co/inclusionAI/LLaDA2.0-mini-preview
Unofficial LLaDA2 Evaluation based on lm-eval
Hi everyone, I'm also doing research on dLLM.
Here is an unofficial LLaDA2 Evaluation based on lm-eval, which has been tested on the A100. I hope it can be helpful.
Open source address: https://github.com/preordinary/LLaDA2.
Issues and discoveries:
- Parameter changes: The
stepsparameter definition in version 2.0 is different from 1.0; it refers to the number of steps within a block. Please pay attention when reproducing this issue. - Length sensitivity: I tested lengths of 256/512/1024 and found that the accuracy dropped significantly at length 256 (HumanEval was only 5.5). I suspect this is because the thought chain in version 2.0 is longer, and a short window can easily lead to truncated answers.
The code is relatively simple. Welcome everyone to try it out, submit issues or pull requests, and feel free to share it! If you find it useful, please give it a star βοΈ! Thank you!
Thanks so much for this detailed evaluation and for sharing your findings!
Dear Official Author,
Could I request to have this GitHub link added to the readme file? I hope more people can access it.
If there's anything I can help you with (such as merging it to your repository), I will do my best to assist~ Thank you for your time!
Dear Official Author,
Could I request to have this GitHub link added to the readme file? I hope more people can access it.
If there's anything I can help you with (such as merging it to your repository), I will do my best to assist~ Thank you for your time!
Yes, any pull request is welcome~ thank you for contributing.
Dear Official Author,
Could I request to have this GitHub link added to the readme file? I hope more people can access it.
If there's anything I can help you with (such as merging it to your repository), I will do my best to assist~ Thank you for your time!
Yes, any pull request is welcome~ thank you for contributing.
Dear Official Author,
Can you add me as a member of inclusionAI? So that I can transfer this repository to the organization~
If there is any other way to make a pull request, please let me know. I am willing to PR this repository. Thank you for your patience!