Instructions to use MiniMaxAI/MiniMax-M2 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MiniMaxAI/MiniMax-M2 with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="MiniMaxAI/MiniMax-M2", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("MiniMaxAI/MiniMax-M2", trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("MiniMaxAI/MiniMax-M2", trust_remote_code=True)
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
HuggingChat
Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use MiniMaxAI/MiniMax-M2 with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "MiniMaxAI/MiniMax-M2"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/MiniMaxAI/MiniMax-M2

SGLang

How to use MiniMaxAI/MiniMax-M2 with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "MiniMaxAI/MiniMax-M2" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "MiniMaxAI/MiniMax-M2" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "MiniMaxAI/MiniMax-M2",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use MiniMaxAI/MiniMax-M2 with Docker Model Runner:
```
docker model run hf.co/MiniMaxAI/MiniMax-M2
```

MiniMax-M2 / docs /sglang_deploy_guide_cn.md

Commit History

update guides

c2b7e11

sriting commited on Oct 29, 2025

update wechat qrcode

7584a2e

sriting commited on Oct 29, 2025

update guide

3af90e1

sriting commited on Oct 29, 2025

update guides

a970723

sriting commited on Oct 27, 2025

update sglang guide

4167c37

sriting commited on Oct 27, 2025

Fix SGLang version in CN deployment guide (#7)

ffec31a
verified

sriting

kzhou10 commited on Oct 27, 2025

update sglang guide cn

a5ec157

sriting commited on Oct 26, 2025

update README

c4075e3

jiaxin commited on Oct 26, 2025

update README

8abcad7

jiaxin commited on Oct 26, 2025

update REDME and guides

7496e98

jiaxin commited on Oct 26, 2025

Commit History

update guides c2b7e11

update wechat qrcode 7584a2e

update guide 3af90e1

update guides a970723

update sglang guide 4167c37

Fix SGLang version in CN deployment guide (#7) ffec31a verified

update sglang guide cn a5ec157

update README c4075e3

update README 8abcad7

update REDME and guides 7496e98

update guides

c2b7e11

update wechat qrcode

7584a2e

update guide

3af90e1

update guides

a970723

update sglang guide

4167c37

Fix SGLang version in CN deployment guide (#7)

ffec31a
verified

update sglang guide cn

a5ec157

update README

c4075e3

update README

8abcad7

update REDME and guides

7496e98