Meta Claims It Just Made ‘Speech Translation a Whole Lot Better’
Built on the August 2023 SeamlessM4T, the Seamless suite includes models that preserve vocal style and translate text and speech in nearly 100 languages.
Built on the August 2023 SeamlessM4T, the Seamless suite includes models that preserve vocal style and translate text and speech in nearly 100 languages.
Amid a deluge of research into speech translation, many of the world’s leading technology companies are teaming up with academia to accelerate progress in this challenging field of language AI.
Researchers from NVIDIA, Factored.ai, Talon Voice, and others open-source a properly licensed dataset of 1,780 hours of speech in 77 different languages, plus transcriptions.
Google's AudioPaLM fuses text-based PaLM-2 and speech-based AudioLM into a large language model, elevating speech recognition and translation while preserving authentic voice quality.
European Union supports a project for the development of AI-based language solutions for defense applications to enhance the use of AI in the defense sector.
London-based AI dubbing startup, Papercup, raises USD 20m in oversubscribed round. Head of Growth, Amir Jirbandey, credits strong position to human-in-the-loop innovation, among others.
New research shows that automatic speech recognition and speech translation systems also improve when training data includes multiple instances of a given name.
Google uses own system on a chip, Tensor, to power Google Live Caption and Interpreter Mode features for its latest smartphone, Pixel 6, slated for fall release 2021.
Limecraft CEO Maarten Verwaest joins SlatorPod to talk about digital workflows and AI-enabled subtitling for media companies, and why AI should not be left unattended.
New research from DeepMind and Google explores end-to-end semi-automated dubbing; cites ongoing concerns around misuse of mimicry — from deep fakes to consent issues.
Facebook AI invites developers to improve a speech translation dataset and a open sources a toolkit that evaluates simultaneous speech and text translation.
Two research scientists from Apple surveyed 30 years worth of research into speech translation and here’s what they found.
MT and speech recognition company AppTek expands natural language understanding (NLU) capabilities with acquisition of speech technology provider Ignite-Tek.
Baidu launches API for speech-to-speech translation based on technology outlined in late-2018 research paper, which was criticized at the time for overpromising.
Slator Weekly: Join over 15,800 subscribers and get the latest language industry intelligence every Friday
Tool Box: Join 10,000 subscribers for your monthly linguist technology update.
Your information will not be shared with third parties. No Spam.
This will close in 0 seconds