Get the latest tech news

Show HN: Voice Cloning and Multilingual TTS in One Click (Windows)


Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (E2, F5-TTS), YouTub...

With comprehensive features for YouTube video downloading, voice separation, speech recognition, translation, and text-to-speech, it offers an all-in-one solution for content creators, researchers, and multilingual communication professionals. Voice-Pro offers a realistic alternative to ElevenLabs, catering to content creators, podcasters, researchers, and developers seeking advanced text-to-speech solutions. Dubbing Studio tab Provides integrated environment for YouTube downloader, noise removal, subtitles, translation, and TTS All video/audio formats supported by ffmpeg can be used Selectable output audio format (wav, flac, mp3) Speech recognition and subtitle creation for 100 languages Select subtitle creation options suitable for PC performance (Whisper Model & Compute Type) Translation into over 100 languages ​​and voice generation through TTS The BGM and sound effects from the original video are maintained in the multilingual video.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of Windows

Windows

Photo of Voice cloning

Voice cloning

Photo of multilingual tts

multilingual tts

Related news:

News photo

SQLook – A free online SQLite database manager with a Windows 2000 interface

News photo

Microsoft to force Windows 11 24H2 on Home and Pro users

News photo

How Windows got to version 3 – an illustrated history