ChatTTS

by 2noise

ChatTTS is a text-to-speech (TTS) model designed for dialogue scenarios, supporting both Chinese and English, and capable of generating high-quality, natural-sounding speech.

What is ChatTTS?

ChatTTS is an open-source text-to-speech (TTS) model specifically designed for dialogue scenarios. It supports both Chinese and English and is trained on approximately 100,000 hours of data to produce high-quality, natural-sounding speech. The model is optimized for conversational tasks, offering fine-grained control over prosodic features like laughter and pauses, and supports multiple speakers.

Key Features

Text-to-Speech: Converts text into natural-sounding speech in real-time.
Multi-Language Support: Supports both Chinese and English.
Prosody Control: Adjusts emotional tone, speed, pitch, and pauses for more natural speech.
Voice Role Selection: Offers multiple preset voice roles for different scenarios.
Interactive Web Interface: Allows users to input text and receive speech output directly in their browser.
Real-Time Speech Interaction: Ideal for dialogue systems requiring immediate feedback.
Speech File Export: Exports synthesized speech as common audio file formats.

Getting Started

Online Demo

Experience ChatTTS through the online demo on ModelScope or Hugging Face.

Local Deployment

Install Environment: Ensure Python and Git are installed.
Download SDK: Install ModelScope and SDK model download.
Clone Source Code: Clone the ChatTTS repository from ModelScope.
Install Dependencies: Install required Python dependencies using pip.
Run WebUI: Build and run the WebUI for local use.

Use Cases

Virtual Assistants: Enhances speech output for virtual assistants and customer service bots.
Audiobooks: Converts text content into speech for audiobooks and e-books.
Social Media: Generates engaging voice content for social media platforms.
Accessibility: Provides voice assistance for visually impaired users.

Model Capabilities

Model Type

Text-to-Speech

Supported Tasks

Speech Synthesis Dialogue Generation Prosody Adjustment Multi-Language Tts

Usage & Integration

Pricing

free

API Access

Available

License

Open Source AGPL-3.0

Requirements

Python
Git
ModelScope SDK

Screenshots & Images

Primary Screenshot

Additional Images

Try Now View Demo Documentation

Stats

47 Views

0 Favorites

Community & Support

GitHub Repository

Similar Models

Ola by Tsinghua University, Tencent Hunyuan Research Team, NUS S-Lab

294

Zonos by Zyphra

274

Step-Video-T2V by Leapfrogging Star

293

ChatTTS

What is ChatTTS?

Key Features