Skip to content

ThePawn02

Gaming and Streaming Content

  • Blog
  • Editor's Picks
  • eSports
  • Guides
  • Headlines
  • News
  • Reviews
  • Uncategorized
  • Website Update
Primary Menu
  • Home
  • Watch Live
  • News
  • eSports
  • Blog
  • Reviews
  • Guides
  • Guild Login
    • Guild Mentality
    • The Zealots
    • Malign
  • Socials
    • Youtube Channel
    • Twitch Channel
    • Kick.com
    • Twitter
    • Instagram
    • Facebook
Subscribe
  • Home
  • 2023
  • January
  • This new AI can mimic human voices with only 3 seconds of training
  • News

This new AI can mimic human voices with only 3 seconds of training

Vall-E, being developed by a team of researchers at Microsoft, uses an all-new system for learning how to talk.
January 9, 2023 3 min read
This new AI can mimic human voices with only 3 seconds of training

Vall-E, being developed by a team of researchers at Microsoft, uses an all-new system for learning how to talk.

Humanity has taken yet another step toward the inevitable war against the machines (which we will lose) with the creation of Vall-E, an AI developed by a team of researchers at Microsoft that can produce high quality human voice replications with only a few seconds of audio training.

Vall-E isn’t the first AI-powered voice tool—xVASynth, for instance, has been kicking around for a couple years now—but it promises to exceed them all in terms of pure capability. In a paper available at Cornell University (via Windows Central), the Vall-E researchers say that most current text-to-speech systems are limited by their reliance on “high-quality clean data” in order to accurately synthesize high-quality speech.

“Large-scale data crawled from the Internet cannot meet the requirement, and always lead to performance degradation,” the paper states. “Because the training data is relatively small, current TTS systems still suffer from poor generalization. Speaker similarity and speech naturalness decline dramatically for unseen speakers in the zero-shot scenario.”

(“Zero-shot scenario” in this case essentially means the ability of the AI to recreate voices without being specifically trained on them.)

Vall-E, on the other hand, is trained with a much larger and more diverse data set: 60,000 hours of English-language speech drawn from more than 7,000 unique speakers, all of it transcribed by speech recognition software. The data being fed to the AI contains “more noisy speech and inaccurate transcriptions” than that used by other text-to-speech systems, but researchers believe the sheer scale of the input, and its diversity, make it much more flexible, adaptable, and—this is the big one—natural than its predecessors.

“Experiment results show that Vall-E significantly outperforms the state-of-the-art zero-shot TTS system in terms of speech naturalness and speaker similarity,” states the paper, which is filled with numbers, equations, diagrams, and other such complexities. “In addition, we find VALL-E could preserve the speaker’s emotion and acoustic environment of the acoustic prompt in synthesis.”

(Image credit: Vall-E)

You can actually hear Vall-E in action on Github, where the research team has shared a brief breakdown of how it all works, along with dozens of samples of inputs and outputs. The quality varies: Some of the voices are notably robotic, while others sound quite human. But as a sort of first-pass tech demo, it’s impressive. Imagine where this technology will be in a year, or two or five, as systems improve and the voice training dataset expands even further.

Which is of course why it’s a problem. Dall-E, the AI art generator, is facing pushback over privacy and ownership concerns, and the ChatGPT bot is convincing enough that it was recently banned by the New York City Department of Education. Vall-E has the potential to be even more worrying because of the possible use in scam marketing calls or to reinforce deepfake videos. That may sound a bit hand-wringy but as our executive editor Tyler Wilde said at the start of the year, this stuff isn’t going away, and it’s vital that we recognize the issues and regulate the creation and use of AI systems before potential problems turn into real (and real big) ones.

The Vall-E research team addressed those “broader impacts” in the conclusion of its paper. “Since VALL-E could synthesize speech that maintains speaker identity, it may carry potential risks in misuse of the model, such as spoofing voice identification or impersonating a specific speaker,” the team wrote. “To mitigate such risks, it is possible to build a detection model to discriminate whether an audio clip was synthesized by VALL-E. We will also put Microsoft AI Principles into practice when further developing the models.”

In case you need further evidence that on-the-fly voice mimicry leads to bad places:

About Post Author

See author's posts

Continue Reading

Previous: New Sony Car Plays PS5 Games, But Also Turns Your Ride Into A Billboard
Next: A Fully Playable Left 4 Dead Prototype Has Been Discovered 15 Years Later

Related News

Sega just accidentally leaked its own sales numbers, and somehow Sonic Frontiers sold more than the last two mainline Yakuzas combined, but Persona 5’s putting the rest of the stable to shame
2 min read
  • News

Sega just accidentally leaked its own sales numbers, and somehow Sonic Frontiers sold more than the last two mainline Yakuzas combined, but Persona 5’s putting the rest of the stable to shame

ThePawn.com June 22, 2025
Elden Ring Nightreign modders are already going ham with custom skins, including Optimus Prime, Stellar Blade’s Eve, and my #1 most-wanted outfit from Dark Souls
2 min read
  • News

Elden Ring Nightreign modders are already going ham with custom skins, including Optimus Prime, Stellar Blade’s Eve, and my #1 most-wanted outfit from Dark Souls

ThePawn.com June 22, 2025
A criminally underrated action game with a soundtrack I’m still listening to 9 years later is on sale for just 2 bucks on Steam, and I already bought it again
2 min read
  • News

A criminally underrated action game with a soundtrack I’m still listening to 9 years later is on sale for just 2 bucks on Steam, and I already bought it again

ThePawn.com June 22, 2025

Latest YouTube Video

Check out these awesome streamers

ThePawn02 on twitch

From Gamewatcher

  • When Is the Next GOG Sale 2025 - Summer Sale 2025
  • Chrono Odyssey Preview
  • Warhammer 40,000: Space Marine Review
  • Dune: Awakening Review
  • RoadCraft Review

From IGN

  • Sonic Racing: CrossWorlds Reveals Nickelodeon Collaboration That Adds SpongeBob SquarePants, Avatar, and Teenage Mutant Ninja Turtles Guest Characters
  • The Best Deals Today: Street Fighter 6, Monster Hunter Wilds, and More
  • A Magic: The Gathering Beginner's Guide for Newcomers
  • The Best Deals Today: Donkey Kong Bananza, LEGO Animal Crossing, Super Mario Party Jamboree, and More
  • Splitgate 2 Dev 1047 Games Hit by Layoffs Amid Turbulent Launch, Co-Founders Say They Won’t Take Salaries ‘As We Lock in to Deliver the Next Phase of the Project’

From Kotaku

  • Must-Play Cyberpunk 2077 Side-Quests, Mario Kart World Pointers And More Of The Week's Top Tips
  • Marathon Gets Delayed, Donkey Kong Bananza Gets A Smashing Showing, And More Top Stories
  • Kotaku’s Weekend Guide: 3 Delightful Games We’re Swinging Into Summer With
  • Mario Kart World's Mirror Mode Is A Little Too Confusing To Activate
  • Six Things I Wish I Knew Before Setting Up My Switch 2

.

You may have missed

Sega just accidentally leaked its own sales numbers, and somehow Sonic Frontiers sold more than the last two mainline Yakuzas combined, but Persona 5’s putting the rest of the stable to shame
2 min read
  • News

Sega just accidentally leaked its own sales numbers, and somehow Sonic Frontiers sold more than the last two mainline Yakuzas combined, but Persona 5’s putting the rest of the stable to shame

ThePawn.com June 22, 2025
Sonic Racing: CrossWorlds Reveals Nickelodeon Collaboration That Adds SpongeBob SquarePants, Avatar, and Teenage Mutant Ninja Turtles Guest Characters
2 min read
  • Headlines

Sonic Racing: CrossWorlds Reveals Nickelodeon Collaboration That Adds SpongeBob SquarePants, Avatar, and Teenage Mutant Ninja Turtles Guest Characters

ThePawn.com June 22, 2025
The Best Deals Today: Street Fighter 6, Monster Hunter Wilds, and More
4 min read
  • Headlines

The Best Deals Today: Street Fighter 6, Monster Hunter Wilds, and More

ThePawn.com June 22, 2025
Elden Ring Nightreign modders are already going ham with custom skins, including Optimus Prime, Stellar Blade’s Eve, and my #1 most-wanted outfit from Dark Souls
2 min read
  • News

Elden Ring Nightreign modders are already going ham with custom skins, including Optimus Prime, Stellar Blade’s Eve, and my #1 most-wanted outfit from Dark Souls

ThePawn.com June 22, 2025
Privacy Policy
  • Home
  • Watch Live
  • News
  • eSports
  • Blog
  • Reviews
  • Guides
  • Guild Login
  • Socials
  • Twitch
  • YouTube
  • Instagram
  • Twitter
  • Facebook
  • Kick.com
Copyright © All rights reserved. | MoreNews by AF themes.