Speechless Recognition: Can AI Transcribe a Language It Has Never Heard?
Breakthroughs in multilingual speech models are rapidly expanding language coverage for ASR. But what about endangered languages with little to no training data?
Breakthroughs in multilingual speech models are rapidly expanding language coverage for ASR. But what about endangered languages with little to no training data?
As the race in all things language AI heats up, Google discloses more details about its Universal Speech Model that the search giant claims supports over 1,000 languages.
Deepgram CEO, Scott Stephenson, on scaling state-of-the-art speech API for enterprise and next-gen AI platforms, response to Whisper, and future of voice AI.
Among the platforms and workflows featured in Appen’s 2022 Investor Technology Day: Ontology Studio, which “productizes core components” of Appen’s “linguistic expertise.”
New research shows that automatic speech recognition and speech translation systems also improve when training data includes multiple instances of a given name.
Technological advances are affecting the audiovisual-translation landscape, shaping a new skill set for professionals aspiring to compete in the AVT market; and these job profiles are here to stay
From increasing productivity to allowing a more flexible, translator-centered, ergonomic workflow and workspace, this is how translators and post-editors benefit from speech technologies.
Dutch transcription, speech recognition, and subtitling startup Amberscript raises USD 10m from Endeit Capital, a growth capital firm founded by former Endemol Media executives.
YouTube lifts 1,000-subscriber cap on access to live automatic captions for content creators. Video transcripts soon to be available on mobile devices as well.
Templates are to subtitles what automatic speech recognition (ASR) and machine translation (MT) are to the subtitling workflow: revolutionary. Experts test AppTek tool, share candid feedback.
Google uses own system on a chip, Tensor, to power Google Live Caption and Interpreter Mode features for its latest smartphone, Pixel 6, slated for fall release 2021.
Google researchers say Translatotron 2 outperforms original in terms of speech-to-speech translation quality and naturalness. Plus the new version prevents users from generating speech for deepfakes.
After promising results in English, researchers hope Multilingual LibriSpeech’s 50,000 hours of audio across eight languages will improve automatic speech recognition around the world.
At startup Descript, translation is one of the most requested features. Descript’s technology may offer interesting new opportunities for LSPs to expand their offering in media content.
Flitto provides multilingual corpus, speech, and image data to train AI engines in 25 different languages, covering a number of domains including conversational, colloquial, and medical.
Slator Weekly: Join over 15,500 subscribers and get the latest language industry intelligence every Friday
Tool Box: Join 10,000 subscribers for your monthly linguist technology update.
Your information will not be shared with third parties. No Spam.
This will close in 0 seconds