Private Inference API

Our API is compatible with the OpenAI Chat Completions API format, making it easy to integrate with your existing applications while adding verifiable security.

All inference requests are processed in secure enclaves, ensuring complete privacy of your data.

Example Usage

curl -N https://inference.tinfoil.sh/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama3.2:1b",
    "messages": [
      {
        "role": "system",
        "content": "You are a helpful assistant."
      },
      {
        "role": "user",
        "content": "What are the key principles of secure system design?"
      }
    ],
    "stream": true
  }'

Available Models

llama3.2:1b

A lightweight but capable model, perfect for testing and development.

Coming Soon

More models will be available soon, including larger and more specialized variants.