Back to catalog
🌊
Local LLM Deployments
Mistral Large 2
Mistral Large 2 is Mistral AI's flagship model — 70 billion parameters with exceptional multilingual support (12+ languages), strong code generation, and advanced reasoning. It's optimized for production workloads where quality and commercial licensing matter.
Advanced+ requiredfrom $599/mo
10 min provisioning
OpenAI-compatible APIMade by Mistral AI
License: Mistral Research License
Technical Specifications
Tap the icon next to any term for a plain-language explanation.
Model size70B parameters
Memory required64 GB
Speed (M4 Pro)~11 tok/s
QuantizationQ4_K_M
Context window33K tokens
Disk space40 GB
RuntimeOllama + MLX
Use Cases
- Enterprise AI applications
- Multilingual content creation
- Code review and generation
- Complex reasoning chains
- Production chatbots
What you get
- Ollama runtime with Mistral Large 2 pre-loaded
- MLX backend for optimized inference
- OpenAI-compatible API endpoint
- Prometheus metrics
Start using it
curl
curl https://dep-<id>.macyou.cloud/v1/chat/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer mcy_live_<your-key>" \
-d '{
"model": "mistral-large-2",
"messages": [{"role": "user", "content": "Hello!"}],
"stream": true
}'Python (OpenAI SDK)
from openai import OpenAI
client = OpenAI(
api_key="mcy_live_<your-key>",
base_url="https://dep-<id>.macyou.cloud/v1"
)
response = client.chat.completions.create(
model="mistral-large-2",
messages=[{"role": "user", "content": "Hello!"}],
stream=True
)
for chunk in response:
print(chunk.choices[0].delta.content or "", end="")Tags
LLMMultilingualCommercial