Octave text-to-speech pricing

Free

$0/ month

What's included:

  1. 10,000 characters of text to speech per month (~10 minutes)
  2. Unlimited custom voices

Starter

$3/ month

Everything in Free, plus:

  1. 30,000 characters of text to speech per month (~30 minutes)
  2. Unlimited custom voices
  3. 20 projects
  4. Commercial license
most popular

Creator

$10/ month

Everything in Starter, plus:

  1. 100,000 characters of text to speech per month (~100 minutes)
  2. Usage based pricing for additional characters ($0.20/1,000)
  3. Unlimited custom voices
  4. 1,000 projects
  5. Commercial license

Pro

$50/ month

Everything in Creator, plus:

  1. 500,000 characters of text to speech per month (~500 minutes)
  2. Usage based pricing for additional characters ($0.15/1,000)
  3. Unlimited custom voices
  4. 3,000 projects
  5. Commercial license

Scale

$150/ month

Everything in Pro, plus:

  1. 2,000,000 characters of text to speech per month (~2,000 minutes)
  2. Usage based pricing for additional characters ($0.13/1,000)
  3. Unlimited custom voices
  4. 10,000 projects
  5. Commercial license

Business

$900/ month

Everything in Scale, plus:

  1. 10,000,000 characters of text to speech per month (~10,000 minutes)
  2. Usage based pricing for additional characters ($0.10/1,000)
  3. Unlimited custom voices
  4. 20,000 projects
  5. Commercial license

Enterprise

Custom price

Everything in Business, plus:

  1. As much usage as you need
  2. Custom terms & assurance around DPA/SLAs
  3. Security questionnaires
  4. Unlimited custom voices
  5. Significantly discounted pricing at scale
  6. Priority support
  7. Commercial license

Compare our plans

BusinessGet started
EnterpriseContact sales
Price/month$0$3$10$50$150$900Custom
Monthly characters included10,000 characters (~10 minutes)30,000 characters (~30 minutes)100,000 characters (~100 minutes)500,000 characters (~500 minutes)2,000,000 characters (~2,000 minutes)10,000,000 characters (~10,000 minutes)As much as you need
Additional characters cost (usage-based)$0.20/1,000$0.15/1,000$0.13/1,000$0.10/1,000Custom
Projects201,0003,00010,00020,000As much as you need
Custom voicesUnlimitedUnlimitedUnlimitedUnlimitedUnlimitedUnlimitedUnlimited
Voice cloningComing soonComing soonComing soonComing soonComing soonComing soonComing soon
Commercial license

EVI & Expression Measurement pricing

Pay as you go

Ideal for individual developers, startups, and businesses that prefer a flexible pricing structure based on usage

  1. $20 in free credit
  2. Only pay for what you use
  3. No upfront payment or commitment
  4. Technical support in Discord

Enterprise

For businesses with high volume and advanced data control requirements

  1. High volume discounts
  2. Dataset licenses
  3. On-prem solutions
  4. Custom integrations and features
  5. Dedicated technical support

Pricing

Empathic Voice Interface API

EVI 1 (Legacy)
Price
$0.102 / min
Transcription with expression measures
Expressive TTS with prosody generation
Prosody generation
Interruptibility
Voice customizability
Low latency
900ms - 2000ms
Base voices
3 voices
Multilingual support
English only
EVI 2
Price
$0.072 / min
Transcription with expression measures
Expressive TTS with prosody generation
Prosody generation
Interruptibility
Voice customizability
Low latency
500ms - 800ms
Base voices
7 voices
Multilingual support
Multiple languages soon
Enterprise
Price
Volume discounts
Transcription with expression measures
Expressive TTS with prosody generation
Prosody generation
Interruptibility
Voice customizability
Low latency
On-premises solutions
Base voices
Custom voices
Multilingual support
Custom languages

Expression Measurement API

Pay as you go
Video with audio
Facial expression, Speech prosody, Vocal burst, Emotional language, Facemesh, Transcription
$0.0276 / min
Audio only
Speech prosody, Vocal burst, Emotional language, Transcription
$0.0213 / min
Video only
Facial expression, Facemesh
$0.015 / min
Images
Facial expression, Facemesh
$0.00068 / image
Text only
Emotional language
$0.00008 / word
Enterprise
Video with audio
Facial expression, Speech prosody, Vocal burst, Emotional language, Facemesh, Transcription
Volume discounts
Audio only
Speech prosody, Vocal burst, Emotional language, Transcription
Volume discounts
Video only
Facial expression, Facemesh
Volume discounts
Images
Facial expression, Facemesh
Volume discounts
Text only
Emotional language
Volume discounts

Custom Models API

Pay as you go
Training
Build and customize models to suit your needs
Free
Inference
Deploy and use your trained models
Same as Expression Measurement API
Enterprise
Training
Build and customize models to suit your needs
Free
Inference
Deploy and use your trained models
Volume discounts