1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
-
Updated
Dec 30, 2025 - Python
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Instant voice cloning by MIT and MyShell. Audio foundation model.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Foundational model for human-like, expressive TTS
A simple, high-quality voice conversion tool focused on ease of use and performance.
GPT-SoVITS ONNX Inference Engine & Model Converter
Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.
An Open-Sourced LLM-empowered Foundation TTS System
Turn PDFs and EPUBs into audiobooks, subtitles or videos into dubbed videos (including translation), and more. For free. Pandrator uses local models, notably XTTS, including voice-cloning (instant, RVC-enhanced, XTTS fine-tuning) and LLM processing. It aspires to be a user-friendly app with a GUI, an installer and all-in-one packages.
Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI API is used (e.g. Open WebUI, AnythingLLM, etc.)
Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.
Revolutionize Your Voice with AI Voice Cloner! Transform Your Speech into Your Favorite Celebrity's or Your Customized Voice. Our Cutting-edge Tool Converts Text or Any Audio into Your Desired Voice – Your Voice, Your Way
Automated voice dubbing for YouTube videos using Docker, OpenVoice, and FastAPI. Translates and dubs videos with original voice timbre.
ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice cloning and audio editing with emotion, style, speed control, and more.
Talking Head of your favorite rapper using Transformers, PyTorch, Tortoise TTS, and OpenCV 🎵
This repo is text to speech with learnable audio encoder without alignment with transcript reference
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
[Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"
Add a description, image, and links to the voice-clone topic page so that developers can more easily learn about it.
To associate your repository with the voice-clone topic, visit your repo's landing page and select "manage topics."