Scott recalls how his experience working with dark matter detectors as a particle physicist in China led to him becoming a deep-learning entrepreneur. He discusses some challenges in solving ASR; from labeling data for machine learning to formulating and executing an effective go-to-market strategy.
The CEO gives his thoughts on Whisper, OpenAI’s open-source ASR model, and how it may actually grow the total addressable market for voice AI companies. He shares the difficulties when it comes to translating a transcript versus translating straight from audio into another language.
Scott gives his advice on how to build a successful AI company and appeal to investors. The pod rounds off with Deepgram’s roadmap for the next year, with text to speech, voice cloning, real-time translation, and sentiment analysis being potential step changes in their growth trajectory.
First up, Florian and Esther catch up on the language industry news from the past month, with Google announcing the launch of Translation Hub, its enterprise-scale document translation service.
Esther discusses some of the language highlights from Netflix’s third-quarter earnings call, including the titles of some of the best-performing non-English content. Meanwhile, Zoo Digital’s share price was at a near all-time high as they weighed in at an almost USD 170m market cap.
The duo also talk about funding, where multilingual AI writer Jasper announced it had raised USD 125m in its unicorn-making series A, which valued the startup at USD 1.5bn. And, after a dip in the Slator Language Industry Job Index in September, the LIJI defied expectations of a slowdown as it reached an all-time high in November.