Get the latest tech news

Show HN: Voice Cloning and Multilingual TTS in One Click (Windows)

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (E2, F5-TTS), YouTub...

With comprehensive features for YouTube video downloading, voice separation, speech recognition, translation, and text-to-speech, it offers an all-in-one solution for content creators, researchers, and multilingual communication professionals. Voice-Pro offers a realistic alternative to ElevenLabs, catering to content creators, podcasters, researchers, and developers seeking advanced text-to-speech solutions. Dubbing Studio tab Provides integrated environment for YouTube downloader, noise removal, subtitles, translation, and TTS All video/audio formats supported by ffmpeg can be used Selectable output audio format (wav, flac, mp3) Speech recognition and subtitle creation for 100 languages Select subtitle creation options suitable for PC performance (Whisper Model & Compute Type) Translation into over 100 languages and voice generation through TTS The BGM and sound effects from the original video are maintained in the multilingual video.

Get the Android app

Or read this on Hacker News