Instructions to use inclusionAI/LLaDA2.0-mini-preview with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use inclusionAI/LLaDA2.0-mini-preview with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="inclusionAI/LLaDA2.0-mini-preview", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModelForCausalLM
model = AutoModelForCausalLM.from_pretrained("inclusionAI/LLaDA2.0-mini-preview", trust_remote_code=True, dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use inclusionAI/LLaDA2.0-mini-preview with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "inclusionAI/LLaDA2.0-mini-preview"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "inclusionAI/LLaDA2.0-mini-preview",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/inclusionAI/LLaDA2.0-mini-preview

SGLang

How to use inclusionAI/LLaDA2.0-mini-preview with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "inclusionAI/LLaDA2.0-mini-preview" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "inclusionAI/LLaDA2.0-mini-preview",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "inclusionAI/LLaDA2.0-mini-preview" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "inclusionAI/LLaDA2.0-mini-preview",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use inclusionAI/LLaDA2.0-mini-preview with Docker Model Runner:
```
docker model run hf.co/inclusionAI/LLaDA2.0-mini-preview
```

Unofficial LLaDA2 Evaluation based on lm-eval

by Lucasoppem - opened Nov 24, 2025

Discussion

Lucasoppem

Nov 24, 2025

Hi everyone, I'm also doing research on dLLM.

Here is an unofficial LLaDA2 Evaluation based on lm-eval, which has been tested on the A100. I hope it can be helpful.
Open source address: https://github.com/preordinary/LLaDA2.

Issues and discoveries:

Parameter changes: The steps parameter definition in version 2.0 is different from 1.0; it refers to the number of steps within a block. Please pay attention when reproducing this issue.
Length sensitivity: I tested lengths of 256/512/1024 and found that the accuracy dropped significantly at length 256 (HumanEval was only 5.5). I suspect this is because the thought chain in version 2.0 is longer, and a short window can easily lead to truncated answers.

The code is relatively simple. Welcome everyone to try it out, submit issues or pull requests, and feel free to share it! If you find it useful, please give it a star ⭐️! Thank you!

utdawn

inclusionAI org Nov 25, 2025

Thanks so much for this detailed evaluation and for sharing your findings!

Lucasoppem

Nov 26, 2025

Dear Official Author,

Could I request to have this GitHub link added to the readme file? I hope more people can access it.

If there's anything I can help you with (such as merging it to your repository), I will do my best to assist~ Thank you for your time!

m1ngcheng

inclusionAI org Dec 1, 2025

Dear Official Author,

Could I request to have this GitHub link added to the readme file? I hope more people can access it.

If there's anything I can help you with (such as merging it to your repository), I will do my best to assist~ Thank you for your time!

Yes, any pull request is welcome~ thank you for contributing.

Lucasoppem

Dec 7, 2025

Dear Official Author,

Could I request to have this GitHub link added to the readme file? I hope more people can access it.

If there's anything I can help you with (such as merging it to your repository), I will do my best to assist~ Thank you for your time!

Yes, any pull request is welcome~ thank you for contributing.

Dear Official Author,

Can you add me as a member of inclusionAI? So that I can transfer this repository to the organization~

If there is any other way to make a pull request, please let me know. I am willing to PR this repository. Thank you for your patience!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment