Meta unveils Audiobox, an AI text-to-audio generator
Audiobox significantly expands the capabilities of generative AI for audio, enabling users to create custom audio content with greater ease and control.
![Audiobox](/content/images/size/w2000/2023/11/AudioBox.png)
Meta has today unveiled Audiobox, its latest foundation research model for audio generation. Building upon the success of its predecessor, Voicebox, Audiobox significantly expands the capabilities of generative AI for audio, enabling users to create custom audio content with greater ease and control.
Audiobox introduces several key advancements, including:
- Describe-and-generate sound: Users can provide a natural language prompt describing the desired sound, and Audiobox will generate the corresponding audio. For instance, a prompt like "a running river and birds chirping" will produce a soundscape with those elements.
- Describe-and-generate speech: Users can input a short description of the desired voice along with the transcript to be narrated, and Audiobox will generate speech in that voice.
- Dual-input vocal restyling: Audiobox allows users to combine an audio voice input with a text style prompt to synthesize speech of that voice in any environment or emotion. This enables users to manipulate voice characteristics without losing the speaker's identity.
- State-of-the-art controllability: Audiobox demonstrates superior controllability compared to previous models, allowing users to precisely specify the desired audio content.
Meta's commitment to responsible AI development is evident in Audiobox's design. The model incorporates automatic audio watermarking to trace audio created with Audiobox back to its origin, safeguarding against potential misuse. Additionally, a voice authentication feature prevents impersonation attempts.
Audiobox is currently being released to a hand-selected group of researchers and academic institutions with a proven track record in speech research. Meta seeks to foster collaboration within the research community to further develop Audiobox's capabilities and address potential ethical considerations responsibly.
In the long term, Meta envisions a future where Audiobox's capabilities empower anyone to create personalized audio content with ease. This technology holds immense potential for content creators, narrators, sound editors, game developers, and AI chatbot creators.
Meta's Audiobox marks a significant step forward in generative AI for audio, paving the way for a more accessible and creative audio landscape. With its emphasis on responsible development and collaboration, Meta says it is aiming to ensure that Audiobox's transformative power is harnessed for the benefit of all.
PPC Land is an international news publication headquartered in Frankfurt, Germany. PPC Land delivers daily articles brimming with the latest news for marketing professionals of all experience levels.
Subscribe to our newsletter for just $10/year and get marketing news delivered straight to your inbox. By subscribing, you are supporting PPC Land.
You can also follow PPC Land on LinkedIn, Bluesky, Reddit, Mastodon, X, Facebook, Threads, and Google News.
Know more about us or contact us via info@ppc.land
Our latest marketing news:
- Amazon launches Target Promotion for Sponsored Products advertisers
- Amazon Prime Video secures landmark NBA and WNBA broadcasting rights deal
- FTC Warns: Hashed data not anonymous, companies risk deceptive practice claims
- Taboola unveils AI-powered solution to boost publisher traffic amid digital shifts
- Meta tackles Nigerian financial Sextortion Scams in massive account purge
- FTC launches probe into surveillance pricing practices of eight companies
- Location-based advertising: revolutionizing digital marketing strategies
- Reddit's exclusive Search Deal with Google raises concerns over AI Data
- Netflix surges in Nielsen rankings, streaming dominates TV consumption
- Kargo and TikTok partner to enhance offline sales insights for advertisers