Pricing

Self-serve and enterprise pricing for our private inference and chat.

Private Chat

$10/month

Access our premium chat interface with the latest AI models, all running in secure enclaves.

  • One week free trial
  • Access to premium models
  • No rate limits
  • Email and Slack support
DeepSeek R1DeepSeek R1
GPT-OSS 120BGPT-OSS 120B
Qwen 2.5Qwen 2.5
Mistral SmallMistral Small
Llama 3.3Llama 3.3

Private Inference

$2 per 1M tokens

Inference API access for building your AI applications with enterprise-grade models.

  • Access to all premium models
  • OpenAI-compatible API
  • Dashboard and usage metrics
  • Email and Slack support

Enterprise

Custom pricing

Custom deployment and dedicated support for your organization.

  • Dedicated inference endpoints
  • Custom models and prompts
  • Model training and fine-tuning
  • Custom API endpoints
  • SSO and Access Controls
  • Audit logs
  • On-prem integrations
  • Dedicated support

Frequently Asked Questions

What models are available?

All available models are listed on our inference page. We offer a wide range of state-of-the-art open-source models, all running in GPU-powered secure hardware enclaves.

How is billing calculated?

Private Chat is a flat monthly fee. Private Inference is pay-as-you-go based on token usage. Enterprise plans are custom quoted based on your needs.

Cancel any time?

Yes, you can cancel your subscription at any time. There are no long-term contracts or cancellation fees. Your service will continue until the end of your current billing period.

What payment methods do you accept?

All payments are handled through Stripe for our self-serve plans. Enterprise customers can discuss alternative payment arrangements with our sales team.