logo image
  • News
    • People Moves
    • Deal Wins
    • Demand Drivers
    • M&A and Funding
    • Financial Results
    • Technology
    • Academia
    • Industry News
    • Features
    • Machine Translation
    • — Divider —
    • Slator Pro
    • — Divider —
    • Press Releases
    • Sponsored Content
  • Data & Research
    • Research Reports & Pro Guides
    • Language Industry Investor Map
    • Real-Time Charts of Listed LSPs
    • Language Service Provider Index
  • Podcasts & Videos
  • Events
    • Design Thinking – February 2021
    • — Divider —
    • SlatorCon Coverage
    • Other Events
  • Directory
  • RFP Center
  • Jobs
MENU
  • News
    • People Moves
    • Deal Wins
    • Demand Drivers
    • M&A and Funding
    • Financial Results
    • Technology
    • Academia
    • Industry News
    • Features
    • Machine Translation
    • — Divider —
    • Slator Pro
    • — Divider —
    • Press Releases
    • Sponsored Content
  • Data & Research
    • Research Reports & Pro Guides
    • Language Industry Investor Map
    • Real-Time Charts of Listed LSPs
    • Language Service Provider Index
  • Podcasts & Videos
  • Events
    • Design Thinking – February 2021
    • — Divider —
    • SlatorCon Coverage
    • Other Events
  • Directory
  • RFP Center
  • Jobs

Advertise on Slator! Download the 2021 Online Media Kit Now

  • Slator Market Intelligence
  • Slator Advertising Services
  • Slator Advisory
  • Login
Search
Generic filters
Exact matches only
Advertisement
“Nearly Indistinguishable From Human Translation”—Google Claims Breakthrough

4 years ago

September 27, 2016

“Nearly Indistinguishable From Human Translation”—Google Claims Breakthrough

Technology ·

by Florian Faes

On September 27, 2016

4 years ago
Technology ·

by Florian Faes

On September 27, 2016

“Nearly Indistinguishable From Human Translation”—Google Claims Breakthrough

The Google team working on neural machine translation (NMT) has finally revealed what they have been working on. In a paper published on September 26, 2016 entitled Google’s Neural Machine Translation System: Bridging the Gap Between Human and Machine Translation, the researchers present results of the search engine giant’s all-out effort to maintain its lead in machine translation.

The claims are bold. The researchers not only found that Google’s NMT (GNMT) beats Google’s existing production phrase-based statistical translation system (aka Google Translate) by miles based on the widely used BLEU score, they also claim that “in some cases human and GNMT translations are nearly indistinguishable” and that “our system’s translation quality approaches or surpasses all currently published results.”

The researchers set out to address what, up until now, have been considered NMT’s most apparent weaknesses. First, training is computationally expensive—especially in cases with large data sets. Second, NMT struggles with the translation of rare words. And third, NMT sometimes fails to translate all parts of the input sentence, which according to the researchers can result in “surprising translations.”

Advertisement

“The quality of the resulting translation system gets closer to that of average human translators”—Google Neural Translation Team

The obstacles have, so far, prevented NMT from being deployed in a production environment, which is key to making NMT available through a free service like today’s Google Translate.

The paper says human evaluations show that GNMT “has reduced translation errors by 60% compared to our previous phrase-based system on English↔French, English↔Spanish, and English↔Chinese.” Apparently, the system is not constrained too much by what language it translates and “performs well on a range of datasets across many pairs of languages without the need for language-specific adjustments.”

gnmt-quality

NMT has also been known to take considerable time to process even when run on graphical processing units; another impediment to deploying it in a live production environment such as Google Translate.

The researchers addressed this by running their model on a Google Tensor Processing Unit, which resulted in a processing time that was three times faster than on a CPU, and nearly eight times faster than on a GPU. Conventional wisdom to date was that NMT runs faster on GPUs than CPUs.

Google assembled a diverse group of researchers for the project. In the lead were Yonghui Wu, Senior Staff Software Developer, who has been with Google since 2008. Wu worked on Google’s machine learning search algorithm RankBrain. Research Scientist Mike Schuster, who’s been with Google since 2006, was previously with speech recognition software company Nuance, and Japanese telco NTT Japan.

Also in the group were Zhifeng Chen, a Software Engineer who worked on TensorFlow; Quoc V. Le, Software Engineer, who has been with the company since 2011 and also works on Google Brain; and Mohammad Norouzi, a Google PhD fellow, who has been a research intern for tech giants Microsoft and Google.

The rest of the team includes, among others, Wolfgang Macherey, who joined Google in 2006, worked in the machine translation group with Franz Och, and has been working on natural language processing since 1996.

“We observed that human raters, even though fluent in both languages, do not necessarily fully understand each randomly sampled sentence sufficiently”—Google Neural Translation Team

Back to the claim that GNMT now almost matches human translation. The GNMT team bases that claim on evaluation data consisting of “500 randomly sampled sentences from Wikipedia and news websites, and the corresponding human translations to the target language.”

Thankfully, the researchers do acknowledge they are dealing in human language and not mathematical certainty. It is the human raters of the translations, however, whose judgment is qualified: “We observed that human raters, even though fluent in both languages, do not necessarily fully understand each randomly sampled sentence sufficiently and hence cannot necessarily generate the best possible translation or rate a given translation accurately.”

What’s next? Google will now start testing the system on particularly difficult translation cases and inputs longer than just a single sentence.

Editor’s Note: A previous version of this article read that Wolfgang Macherey “works” in an MT group with Franz Och. The correct tense is “worked” as Och is no longer with Google.

TAGS

GoogleGoogle Translatemachine translationneural machine translation
SHARE
Florian Faes

By Florian Faes

Co-Founder of Slator. Linguist, business developer, and mountain runner. Based in the beautiful lakeside city of Zurich, Switzerland.

Advertisement

SUBSCRIBE TO THE SLATOR WEEKLY

Language Industry Intelligence
In Your Inbox. Every Friday

SUBSCRIBE

SlatorSweepSlatorPro
ResearchRFP CENTER

PUBLISH

PRESS RELEASEDIRECTORY LISTING
JOB ADEVENT LISTING

Bespoke advisory including speaking, briefings and M&A

SLATOR ADVISORY
Advertisement

Featured Reports

See all
Slator 2020 Language Industry M&A and Funding Report

Slator 2020 Language Industry M&A and Funding Report

by Slator

Slator 2021 Data-for-AI Market Report

Slator 2021 Data-for-AI Market Report

by Slator

Slator 2020 Medtech Translation and Localization Report

Slator 2020 Medtech Translation and Localization Report

by Slator

Pro Guide: Sales and Marketing for Language Service Providers

Pro Guide: Sales and Marketing for Language Service Providers

by Slator

Press Releases

See all
Across Systems will be part of the Volaris Group

Across Systems will be part of the Volaris Group

by Across Systems GmbH

How Localex Made It Through the Pandemic

How Localex Made It Through the Pandemic

by Localex

Join Us for the First Virtual Together 2021 Next Month!

Join Us for the First Virtual Together 2021 Next Month!

by Elia

Upcoming Events

See All
  1. Handling Sensitive Information Webinar

    Handling Sensitive Calls with Limited English Proficient Consumers

    by Lionbridge

    · February 10

    Learn more about how Lionbridge Over-the-Phone Interpretation Services can help bridge communication gaps with limited...

    More info FREE

Featured Companies

See all
Text United

Text United

Memsource

Memsource

Wordbank

Wordbank

Protranslating

Protranslating

Seprotec

Seprotec

Versacom

Versacom

SDL

SDL

Smartling

Smartling

Lingotek

Lingotek

XTM International

XTM International

Smartcat

Smartcat

Translators without Borders

Translators without Borders

STAR Group

STAR Group

memoQ Translation Technologies

memoQ Translation Technologies

Advertisement

Popular articles

Why Netflix Shut Down Its Translation Portal Hermes

Why Netflix Shut Down Its Translation Portal Hermes

by Esther Bond

Top Language Industry Quotes of 2020

Top Language Industry Quotes of 2020

by Monica Jamieson

The Slator 2020 Language Service Provider Index

The Slator 2020 Language Service Provider Index

by Slator

SlatorPod: The Weekly Language Industry Podcast

connect with us

footer logo

Slator makes business sense of the language services and technology market.

Our Company

  • Support
  • About us
  • Terms & Conditions
  • Privacy Policy

Subscribe to the Slator Weekly

Language Industry Intelligence
In Your Inbox. Every Friday

© 2021 Slator. All rights reserved.

Sign up to the Slator Weekly

Join over 13,000 subscribers and get the latest language industry intelligence every Friday

Your information will never be shared with third parties. No Spam.