LogoCollectAI
Submit
icon of Miso One

Miso One

Miso One is an 8B-parameter open-weights text-to-speech model by Miso Labs designed for expressive English conversational speech and low-latency research.

Summary

Miso One is an 8B-parameter open-weights text-to-speech model by Miso Labs designed for expressive English speech and low-latency voice agent applications.

What is Miso One?

Miso One is an 8B-parameter text-to-speech system developed by Miso Labs. It is designed for expressive English conversational speech, voice continuation, and low-latency voice-agent research.

Key Features
  • 8B Parameter Model: An open-weights text-to-speech system available for local inference.
  • Expressive Speech: Optimized for conversational delivery, emotion, and pacing in English.
  • Audio Context Support: Capable of conditioning on prompt audio for voice continuation and one-shot cloning.
  • Low-Latency Performance: Designed for voice-agent workflows with a published 110 ms latency benchmark.
  • Discrete Audio Coding: Utilizes Mimi audio codes for speech generation.

Key Highlights

  • 8B-parameter open-weights model architecture.
  • Optimized for expressive English conversational speech.
  • Supports one-shot voice cloning and voice continuation.
  • Published 110 ms latency benchmark for agent workflows.
  • Uses discrete audio-code modeling.
  • Publicly available weights via Hugging Face and GitHub.

Ideal For

  • 1.Developers building voice agents requiring low-latency speech.
  • 2.Researchers evaluating expressive conversational speech models.
  • 3.Creators needing voice continuation or one-shot voice cloning.
  • 4.Engineers testing local deployment of large speech models.

Pros

  • Open-weights model allows for local inspection and self-hosting.
  • Supports prompted generation for consistent speaker identity.
  • Provides a public demo for evaluating voice quality and latency.

Cons

  • English-only language support.
  • Requires significant local hardware resources for 8B parameter model inference.
  • Free tier is limited to 120 characters per conversion.

Frequently Asked Questions

What is Miso One?

Miso One is the product-facing name for Miso Labs' Miso TTS 8B, an open-weights text-to-speech system built for expressive English speech.

Is Miso One open source?

Yes, the Miso TTS 8B model weights and inference code are public and available via the MisoLabs GitHub repository and Hugging Face.

How many languages does Miso One support?

The current public release of Miso TTS 8B is focused on English only.

Can I run Miso TTS 8B locally?

Yes, developers can download the 8B weights and run inference locally, though it requires a GPU-capable environment.

Does Miso One support voice cloning?

Yes, the model supports prompted generation from audio context, allowing for voice continuation and one-shot voice cloning.

Information

Traffic

Last update: N/A

Latest month
N/A

No traffic data available yet.

Categories

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates