DeepSeek-R1-Distill-Llama-70B

High-performance reasoning model with exceptional benchmarks

Model Details

Part of DeepSeek's first-generation reasoning models, achieving performance comparable to OpenAI-o1 across math, code, and reasoning tasks. This 70B parameter model is derived from Llama3.3-70B-Instruct and fine-tuned for enhanced reasoning capabilities.

Parameters

70.6 billion

Context Window

131k tokens

Recommended Use

Ideal for complex reasoning tasks, mathematical problems, and advanced coding applications requiring strong logical capabilities.

Supported Languages

Multilingual with strong performance across major languages

Usage Examples

curl -N https://inference.delta.tinfoil.sh/api/chat \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-r1:70b",
    "messages": [
      {
        "role": "system",
        "content": "You are a helpful assistant."
      },
      {
        "role": "user",
        "content": "Why is tinfoil now called aluminum foil?"
      }
    ],
    "stream": true
  }'