Abhishek Joshi       Jun 19, 2023

Meta Introduces Voicebox

Voicebox is skilled at creating audio clips of the highest quality and editing previously recorded audio to remove undesirable background noise while preserving the original content and style.

By allowing AI to read textual messages in the voices of their friends, it may also help people who are blind or visually challenged.

Voicebox allows creators to easily create and edit audio tracks for videos, among other features.

Voicebox can produce text-to-speech using an audio sample as brief as two seconds, while maintaining the audio's quality.

Without having to re-record, it can substitute words that were mispronounced or reconstruct speech fragments that were interrupted.

Voicebox can create a reading of the text in any of the supported languages (English, French, German, Spanish, Polish, and Portuguese) given a speech sample and a text excerpt in several languages.

Learn more