Question 1

What’s in the catalog?

Accepted Answer

Pre-configured templates for the most popular AI workloads: Local LLM deployments (Llama, Qwen, Mistral, DeepSeek), Agent Hosting (OpenClaw, AutoGPT, CrewAI, LangGraph), AI Development Environments (MLX + Jupyter, Core ML, VSCode Server), Multi-Model Orchestration (LiteLLM), and Custom (blank Apple Silicon machine). Each template deploys in 5 minutes or less.

Question 2

How is this different from running OpenAI or Anthropic’s API?

Accepted Answer

With Macyou, the model runs locally on dedicated Apple Silicon hardware that only you can access — we don’t send your data to OpenAI or any third party. Your deployment exposes an API that uses the same format as OpenAI (same endpoints, same request/response structure), so existing code works with just a base_url change. But behind that API is your own Ollama or llama.cpp instance running on bare metal, not a call to OpenAI’s servers. Fixed monthly price, no per-token fees, full data privacy.

Question 3

What does “OpenAI-compatible API” mean? Do you use OpenAI?

Accepted Answer

No. We don’t use OpenAI’s services at all. “OpenAI-compatible” means your Macyou deployment exposes the same API format (/v1/chat/completions, /v1/models, etc.) that OpenAI popularized. Behind it runs your own local model (Llama, Mistral, Qwen, etc.) on your dedicated Apple Silicon hardware. This means any code that works with the OpenAI SDK can switch to Macyou by changing two lines: the base URL and the API key. No data ever goes to OpenAI.

Question 4

What happens to my data?

Accepted Answer

Your data runs on a physically dedicated machine — not shared infrastructure. All data at rest is encrypted with full disk encryption. Network traffic is encrypted via TLS. Each machine has its own firewall. Full disk wipe is performed between tenants. Our data center is in a GDPR-friendly jurisdiction.

Question 5

What’s the 5-minute promise?

Accepted Answer

From clicking “Deploy” to a working endpoint: 5 minutes or less for all standard catalog templates. This includes provisioning dedicated Apple Silicon hardware, installing the software stack, and returning your endpoint URL and API key. For Max-tier frontier models, provisioning may take up to 15 minutes.

Question 6

Do I still get SSH and root access?

Accepted Answer

Yes. Every deployment gives you SSH access to your dedicated machine. Custom deployments also include VNC and a web terminal. You have full root access — install anything you want alongside the pre-configured stack.

Question 7

What AI models can I run?

Accepted Answer

It depends on your plan. Starter (16 GB) runs up to 14B parameter models like Llama 3.3 8B. Standard (32 GB) handles up to 32B models. Advanced (64 GB) runs Llama 70B quantized. Pro (96 GB) runs 70B at full precision. Max (128 GB) handles 120B+ frontier models entirely in unified memory. Need more? Any tier can scale RAM by clustering multiple devices via Thunderbolt 5.

Question 8

What is Thunderbolt clustering?

Accepted Answer

Mac Minis can be linked via Thunderbolt 5 (120 Gbps) to pool unified memory across nodes. For example, two 64 GB nodes give you 128 GB effective memory, four nodes give you 256 GB. This lets you run frontier models that exceed a single device’s RAM. Clustering is available on every tier as an add-on — you pay the same tier rate per additional node. M5 chips will bring even higher inter-node bandwidth.

Question 9

How does billing work?

Accepted Answer

Fixed monthly price per plan — no per-token, per-request, or per-hour fees. Save 20% with annual billing. All plans include catalog access, OpenAI-compatible API, 24/7 uptime, backups, and support. No setup fees.

Question 10

Can I cancel anytime?

Accepted Answer

Yes. Monthly plans cancel at the end of the current billing cycle and you keep full access until that date. Annual plans run for the committed term. See our Refund Policy for refund eligibility — refunds are issued only if we fail to provision a server or if you are charged in error.

Question 11

Do you support healthcare, legal, or finance workloads?

Accepted Answer

Yes. Our infrastructure is designed for regulated industries: physically isolated hardware, GDPR-friendly jurisdiction, HIPAA-ready architecture, and SOC 2 in progress. See our solutions pages for healthcare, legal, and finance for details on what we provide today and what’s coming.

Your AI,
deployed.

Deploy AI from the catalog

Local LLM Deployments

Agent Hosting

AI Development Environment

Up and running in 3 steps

Choose

Deploy

Use

Built for every AI workload

Deploy autonomous agents from the catalog

Why we built on Apple Silicon

Unified memory, no hard ceiling

Energy efficiency

MLX and Core ML

Privacy by architecture

Drop-in OpenAI replacement

Loved by developers worldwide

Simple, transparent pricing

Starter

Standard

Advanced

Pro

Max

Need more than self-serve?

Talk to our Solutions team

Frequently asked questions

Your AI,
deployed in 5 minutes.

Simple, transparent pricing

Starter

Standard

Advanced

Pro

Max

Your AI,deployed.

Deploy AI from the catalog

Local LLM Deployments

Agent Hosting

AI Development Environment

Up and running in 3 steps

Choose

Deploy

Use

Built for every AI workload

Deploy autonomous agents from the catalog

Why we built on Apple Silicon

Unified memory, no hard ceiling

Energy efficiency

MLX and Core ML

Privacy by architecture

Drop-in OpenAI replacement

Loved by developers worldwide

Simple, transparent pricing

Starter

Standard

Advanced

Pro

Max

Need more than self-serve?

Talk to our Solutions team

Frequently asked questions

What’s in the catalog?

How is this different from running OpenAI or Anthropic’s API?

What does “OpenAI-compatible API” mean? Do you use OpenAI?

What happens to my data?

What’s the 5-minute promise?

Do I still get SSH and root access?

What AI models can I run?

What is Thunderbolt clustering?

How does billing work?

Can I cancel anytime?

Do you support healthcare, legal, or finance workloads?

Your AI,deployed in 5 minutes.

Simple, transparent pricing

Starter

Standard

Advanced

Pro

Max

Your AI,
deployed.

Your AI,
deployed in 5 minutes.