High-performance speech recognition and transcription model
Advanced speech recognition model with great accuracy, speed, and multilingual support. Optimized for real-time transcription with enhanced performance on diverse audio conditions, accents, and terminology.
809 million
30 seconds of audio
Ideal for transcription services, captioning, voice interfaces, and multilingual audio processing applications.
Supports 90+ languages with high accuracy across diverse accents and dialects
Installation:
pip install tinfoil
Inference:
from tinfoil import TinfoilAI
client = TinfoilAI(
enclave="whisper-large-v3-turbo.model.tinfoil.sh",
repo="tinfoilsh/confidential-whisper-large-v3",
api_key="YOUR_API_KEY",
)
# Open audio file
with open("audio.mp3", "rb") as audio_file:
transcription = client.audio.transcriptions.create(
model="whisper-large-v3-turbo",
file=audio_file,
)
print(transcription.text)