hume.ai logo

The first LLMs for voice and emotional intelligence

Toward AI that understands and optimizes for human expression

40+

publications

3000+

citations

1 million+

participants

Speech recognition, understanding, and generation with the same core intelligence

With the first speech-language models, EVI 1 and 2, we pioneered voice AI that understands what it’s saying. Our latest model, Octave, models the multiplex of human personas.

Fine-tuned with scientifically controlled data

Traditional theories posited six discrete emotions, but we’ve discovered that emotional behavior is better explained by a high-dimensional, continuous space

speechProsody
Speech Prosody

Discover over 25 patterns of tune, rhythm, and timbre

Modalities

Speech

Emotions

Amusement, Anger, Awkwardness, Boredom, Calmness, Confusion, Contempt, Desire, Determination, Distress, Fear, Guilt, Horror, Pain, Pride, Sadness, Surprise, and Tiredness

speechProsodyvocalExpressionvocalCallTypes
Voice

Speech Prosody

1/3

Reward Modeling
Voice AI optimized for human preferences

Led by researchers at the intersection of psychology and AI, we run large-scale controlled studies to optimize our models for human preferences. In our most recent evaluation, we found that speech generated by Octave was greatly preferred over the previous state-of-the-art.

Reward Modeling
The Future of AI Interaction
Maintaining frontier language capability

Despite its diverse speech processing and generation capabilities, Octave maintains comparable performance on language understanding tasks to a similar-sized frontier LLM. That means that it is well-suited to power AI systems that follow detailed instructions, use tools, or control an interface.

Performance Summary

Publications

Discover the research foundational to our products

  • Sixteen facial expressions occur in similar contexts worldwide

    Nature.png
  • What music makes us feel: At least 13 dimensions organize subjective experiences associated with music across different cultures

    PNAs logo
  • Self-report captures 27 distinct categories of emotion bridged by continuous gradients

    Pnas logo
  • Universal facial expressions uncovered in art of the ancient Americas: A computational approach

    Science Advances logo
  • Mapping the passions: Toward a high-dimensional taxonomy of emotional experience and expression

    Trends in Cognitive Sciences logo
  • The neural representation of visually evoked emotion Is high-dimensional, categorical, and distributed across transmodal brain regions

    iScience logo
  • GoEmotions: A dataset of fine-grained emotions

    ACL logo
  • Facial movements have over twenty dimensions of perceived meaning that are only partially captured with traditional methods

    PsyArxiv logo
  • The primacy of categories in the recognition of 12 emotions in speech prosody across two cultures

    Nature human behaviour logo
  • How emotion is experienced and expressed in multiple cultures: a large-scale experiment across North America, Europe, and Japan

    Logo Frontiers Grey
  • Semantic Space Theory: Data-driven insights into basic emotions

    CDPS logo
  • Deep learning reveals what vocal bursts express in different cultures

    Nature human behaviour logo
  • The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress

    ACM
  • How emotions, relationships, and culture constitute each other: Advances in social functionalist theory

    Cognition and Emotion logo
  • Intersectionality in emotion signaling and recognition: The influence of gender, ethnicity, and social class

    Emotion journal logo
  • Mapping 24 emotions conveyed by brief human vocalization

    American Psychologist
  • What the face displays: Mapping 28 emotions conveyed by naturalistic expression

    American Psychologist
  • The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, generating, and personalizing vocal bursts

    ICML2022 logo
  • The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression

    ACII2022
  • Emotional expression: Advances in basic emotion theory

    Jnb 01.png
  • Deep learning reveals what facial expressions mean to people in different cultures

    Iscience

Join us in building the future of empathic technology

Careers at hume