logo image
  • News
    • People Moves
    • Deal Wins
    • Demand Drivers
    • M&A and Funding
    • Financial Results
    • Technology
    • Academia
    • Industry News
    • Features
    • Machine Translation
    • — Divider —
    • Slator Pro
    • — Divider —
    • Press Releases
    • Sponsored Content
  • Data & Research
    • Research Reports & Pro Guides
    • Language Industry Investor Map
    • Real-Time Charts of Listed LSPs
    • Language Service Provider Index
  • Podcasts & Videos
  • Events
    • SlatorCon Remote May 2021
    • Localizing at Scale for International Growth
    • Design Thinking May 2021
    • — Divider —
    • SlatorCon Coverage
    • Other Events
  • Directory
  • RFP Center
  • Jobs
MENU
  • News
    • People Moves
    • Deal Wins
    • Demand Drivers
    • M&A and Funding
    • Financial Results
    • Technology
    • Academia
    • Industry News
    • Features
    • Machine Translation
    • — Divider —
    • Slator Pro
    • — Divider —
    • Press Releases
    • Sponsored Content
  • Data & Research
    • Research Reports & Pro Guides
    • Language Industry Investor Map
    • Real-Time Charts of Listed LSPs
    • Language Service Provider Index
  • Podcasts & Videos
  • Events
    • SlatorCon Remote May 2021
    • Localizing at Scale for International Growth
    • Design Thinking May 2021
    • — Divider —
    • SlatorCon Coverage
    • Other Events
  • Directory
  • RFP Center
  • Jobs

Register Now for SlatorCon Remote on May 13th!

  • Slator Market Intelligence
  • Slator Advertising Services
  • Slator Advisory
  • Login
Search
Generic filters
Exact matches only
Advertisement
Harvard Launches Open-source Neural Machine Translation System

4 years ago

December 23, 2016

Harvard Launches Open-source Neural Machine Translation System

Academia ·

by Marion Marking

On December 23, 2016

4 years ago
Academia ·

by Marion Marking

On December 23, 2016

Harvard Launches Open-source Neural Machine Translation System

On December 19, 2016, a Monday, at exactly half past nine, the Twitterverse was alerted to the existence of the OpenNMT project over at the Harvard natural language processing (NLP) group.

The Harvard NLP group comprises researchers who cover areas as varied as “computational models for human language,” machine learning, deep learning, artificial intelligence, and the “intersections between computer science and linguistics.”

The group’s OpenNMT tweet was followed the day after with a wink at Google, which read: “#Google, we promise we are not #taking you on. Please keep on putting out awesome research / feeding my grad students.”

Advertisement
Yoon Kim

OpenNMT developer Yoon Kim is a Computer Science PhD candidate and member of Harvard NLP. Kim had previously taken his Master’s in Data Science from New York University, another Master’s in Statistics from Columbia University, and baccalaureate in Math and Economics from Cornell.

Working on the project with Kim was his adviser, Alexander Rush, who runs the NLP group. Commercial machine translation provider Systran, which recently launched its own proprietary neural machine translation system, was also involved in the project.

What follows is Slator’s interview with Harvard NLP’s Alexander Rush and Systran CTO Jean Senellart on the OpenNMT project.

Slator: What motivated you to develop OpenNMT? How did this project come about?

Alexander Rush: The project is based on research software built by my graduate student Yoon Kim. We used the software in my lab to do research on improving translation systems and to teach graduate students. We happened to also put the software online for free, and Systran found it. It was useful for their products, and so they begin to send us updates to the code. It is the kind of mutually beneficial relationship that open-source communities can produce.

Alexander Rush

Slator: What exactly is OpenNMT and what does it do?

Rush: Recently, there have been a series of advances in artificial intelligence (AI), leading to improvements in speech, image recognition, and game playing. In the area of natural language processing, these improvements have been most impactful in the area of translation, leading to models that significantly improve on the quality of machine translation.

OpenNMT is open-source software implementing this technology, roughly similar to Google’s proprietary system. It is software to learn models for machine translation. It takes in a corpus of aligned sentences from a source and target language, and learns a mathematical model—known as a neural network—to [perform] translation. That model can then be fed unseen source sentences and OpenNMT will translate them.

We do expect some competitors quickly building products based on this technology—Jean Senellart, Systran CTO

Slator: What makes it different from the commercial solution Systran offers?

Jean Senellart: The core technology we propose to our users will be exactly the same as the one we are contributing for the OpenNMT project. Our business model is to build tailored

Slator 2021 Language Service Provider Index (All Data as a Spreadsheet Download)

Data and Research, Slator reports
Spreadsheet with underlying data for the Slator 2021 LSPI: 190+ LSPs, 2020 and 2019 revenues (USD and original currency), growth, ownership, headquarters, and more.
$690 BUY NOW
Slator Pro Guide Translation Pricing and Procurement

Pro Guide: Translation Pricing and Procurement

Data and Research, Slator reports
45 pages on translation and localization pricing and procurement, human-in-the-loop models, and linguist compensation.
$470 BUY NOW
Language industry M&A and Funding Report product

Slator 2020 Language Industry M&A and Funding Report

Data and Research, Slator reports
40 pages on translation, localization industry M&A, venture funding. Valuations, PE funds, deal rationale, geo, investment theses.
$490 BUY NOW
Slator 2021 Data-for-AI Market Report

Slator 2021 Data-for-AI Market Report

Data and Research, Slator reports
44-pages on how LSPs enter and scale in AI Data-as-a-service. Market overview, AI use cases, platforms, case studies, sales insights.
$380 BUY NOW
Slator Medtech Translation and Localization Report

Slator 2020 Medtech Translation and Localization Report

Data and Research, Slator reports
44-page medtech translation & localization report. Market overview, content types & services, buyers & suppliers, sales insights, more.
$290 BUY NOW
Slator Translation and Localization Buyer Report 2020

Slator Translation and Localization Buyer Report 2020

Data and Research, Slator reports
11 translation and localization buyer features from 2020 plus typical buyer job titles and Slator's language industry market matrix.
$68 BUY NOW
ISO and Quality Management for Translation Agencies and Localization Providers

Pro Guide: ISO and Quality Management for Language Service Providers

Data and Research, Slator reports
36 pages. How and why LSPs get ISO certified. How to succeed in a LSP Quality Management.
$240 BUY NOW
Pro Guide Sales and Marketing for Language Service Provider and Translation and Localization Companies (Product)

Pro Guide: Sales and Marketing for Language Service Providers

Data and Research, Slator reports
36 pages. How LSPs generate leads, hire and compensate Sales staff, succeed in Digital Marketing, and benchmark against rivals.
$260 BUY NOW
Slator 2020 How to Run a Translation and Localization RFP - Procurement

Pro Guide: How to Run a Translation and Localization RFP

Data and Research, Slator reports
25 pages. Actionable guidance for translation and localization buyers on how to qualify vendors and streamline procurement.
$375 BUY NOW

Slator 2020 Language Industry Market Report

Data and Research, Slator reports
55 pages. Total market size, biz dev and sales insights, TMS & MT review, buyer segment analysis, M&A, Covid impact & outlook.
$480 BUY NOW

Slator 2019 Language Industry M&A and Funding Report

Data and Research, Slator reports
34-page report. Language industry M&A and startup funding. Transaction valuations, trade sales, financial backing, private equity influence, main rationale, seller verticals, geographical analysis, startup funding analysis.
$450 BUY NOW
Travel and Retail 2019 Translation and Localization Report

Slator 2019 Travel & Retail Localization Report

Data and Research, Slator reports
29-page report. Travel and retail overview. Role of the language services industry. Market size. Competitive landscape. Biz Dev.
$230 BUY NOW
Placeholder

Sponsored Articles – SLATOR WRITTEN

Marketing
SINGLE POST
BUY NOW
Placeholder

Sponsored Articles – CLIENT WRITTEN

Marketing
SINGLE POST
BUY NOW
Slator Sponsored Article - Lead Generation in Translation Industry

Sponsored Article

Advertising with Slator, Business Development, Marketing
Drive lead generation with Sponsored Articles hosted on Slator and promoted in our Newsletter and social media network.
BUY NOW
SlatoSlator Annual Research - Translation Industry Research

Annual Research

Data and Research, Market Intelligence, Slator reports
Access all of Slator's Research Reports with a company-wide Annual Research license and save money.
BUY NOW

Slator 2019 US Healthcare Interpreting Report

Data and Research
25-page report. US healthcare market overview. Role of the language services industry. Market size. Competitive landscape. Biz Dev and Sales.
$170 BUY NOW
Placeholder

Visibility Packages – ENHANCED PLUS

Marketing
FEATURED + 6 PRS
BUY NOW
Placeholder

Visibility Packages – ENHANCED

Marketing
FEATURED + 3 PRS
BUY NOW
Placeholder

Visibility Packages – STANDARD PLUS

Marketing
REGULAR LISTING + 6 PRS
BUY NOW
Placeholder

Visibility Packages – STANDARD

Marketing
REGULAR LISTING + 3 PRS
BUY NOW
Placeholder

Strategy Packages – CORPORATE 2 YR

Market Intelligence
>$10M REVENUE
BUY NOW
Placeholder

Strategy Packages – CORPORATE 1 YR

Market Intelligence
>$10M REVENUE
BUY NOW
Placeholder

Strategy Packages – SME 2 YEARS

Market Intelligence
<$10M REVENUE
BUY NOW
Placeholder

Strategy Packages – SME 1 YEAR

Market Intelligence
<$10M REVENUE
BUY NOW
Placeholder

Market Intelligence Packages – 2 YEAR

Business Development
15% SAVINGS
BUY NOW
Placeholder

Market Intelligence Packages – 1 YEAR

Business Development
10% SAVINGS
BUY NOW
Slator Research Strategy Package - Translation Industry Research

Strategy Package

Market Intelligence
Access all of Slator's subscription services (SlatorSweep, SlatorPro & Research) with a company-wide license and save money.
BUY NOW
Slator Market Intelligence - SlatorSweep and SlatorPro

Market Intelligence Packages

Data and Research, Market Intelligence, Slator reports
Access SlatorSweep’s time sensitive news and SlatorPro’s in-depth analysis with our Market Intelligence service and save money.
BUY NOW

Sponsored Articles Listing

Marketing
Drive lead generation with Sponsored Articles hosted on Slator.com and promoted in our Newsletter and social media network.
BUY NOW
Slator Visibility Package - Directory Listing and Press Releases

Visibility Packages

Advertising with Slator, Business Development, Marketing
Increase your visibility, build referral traffic and save money by integrating your Press Releases with a Directory listing.
BUY NOW

Slator 2019 Life Sciences Translation Report

Data and Research
25 pages. Clinical life sciences market size, competitive landscape, industry service model, buyer insights, and more...
$170 BUY NOW
Slator Switzerland 250 Language Service Provider List

Slator Switzerland 250 Language Service Provider List

Data and Research
Full list of 250 active Language Service Providers in Switzerland as of July 18, 2019
$370 BUY NOW
Slator 2018 Financial Industry Report

Slator 2018 Financial Industry Report

Data and Research
BUY NOW
Slator 2019 Language Industry Market Report

Slator 2019 Language Industry Market Report

Data and Research
33 pages. Total market size, key verticals, services & tech landscape, market share by segment, M&A, and outlook.
$385 BUY NOW
Slator 2019 Game Localization Report

Slator 2019 Game Localization Report

Data and Research
Figures, insights, and case studies on the game localization space from both sell-side and buy-side.
$85 BUY NOW
SlatorSweep - Daily Market Intelligence

SlatorSweep

Data and Research, Market Intelligence
Curated news from thousands of sources, SlatorSweep’s daily news service gives you a competitive edge on time sensitive market intelligence.
BUY NOW
Slator Event Listing - Events

Event Listings

Advertising with Slator, Business Development, Marketing
Attract our audience of decision makers to your events by promoting them on our website, Newsletters and social media network.
BUY NOW
SlatorPro - Market Analysis

SlatorPro

Data and Research, Market Intelligence
SlatorPro unlocks high value content including proprietary, in-depth analysis of data, trends, results and deals found nowhere else.
BUY NOW
Slator Buy-Side Report 2018

Slator Buy-Side Report 2018 Actionable Insights From the Language Industry Buy-Side

Data and Research
Features 23 buyer profiles along industry verticals.
$48 BUY NOW
Slator 2018 Language Industry M&A and Funding Report

Slator 2018 Language Industry M&A and Funding Report

Data and Research
22 pages — analysis, valuations, rationale on 48 mergers and acquisitions as well as 10 language tech VC funding rounds.
$380 BUY NOW

Slator Germany 500 Language Service Provider List

Data and Research
Full list of nearly 500 active Language Service Providers in Germany as of December 20, 2018.
$280 BUY NOW

Slator 2019 Neural Machine Translation Report: Deploying NMT in Operations

Data and Research
32 pages, NMT state-of-the-art, 5 case studies, 30 commentaries, NMT in day-to-day operations
$85 BUY NOW
Slator Press Releases - Press Release

Press Releases

Advertising with Slator, Business Development, Marketing
Distribute your press release on Slator. Published on the website, in the email newsletter (12k opt-in subscribers), and on social media.
BUY NOW
Slator Directory Listing

Directory Listing

Marketing
Promote your company prominently in Slator’s Directory and select “Featured” to add extra visibility on across Slator.com’s web pages.
BUY NOW
Slator RFP Service - Request for Proposal

RFP Center

Business Development, Market Intelligence
Receive daily email alerts of tenders and RFPs issued by governments, NGOs and private entities from across the world.
BUY NOW
Slator Job Ad - Recruitment in the Localization Industry

Job Ads

Advertising with Slator, Recruitment
Recruit the best talent from our highly skilled audience by posting your job ads on Slator and across our Newsletters and social media network.
BUY NOW

Slator 2018 Blockchain and Translation Report

Data and Research
24-page report. Emerging role of blockchain in language services and vice versa. Language industry ICOs and additional information.
$85 BUY NOW
Slator 2018 Media Localization Report

Slator 2018 Media Localization Report

Data and Research
25-page report. Entertainment media overview. Role of the language services industry. Market size. Competitive landscape. Biz Dev and Sales.
$85 BUY NOW

Slator 2018 UK Company List

Data and Research
Full list (xls) of companies listed under SIC Code 74300: Translation and interpretation activities as of 1 June 2018.
$280 BUY NOW
Slator 2018 Financial Industry Report

Slator 2018 Financial Industry Report

Data and Research
25-page report. Financial industry overview. Role of the language services industry. Market size. Competitive landscape. Biz Dev and Sales.
$85 BUY NOW
Neural Machine Translation in Use for Localization

Slator Neural Machine Translation Report 2018

Data and Research
Published March 2018. 35-page report. Current state and business case for NMT with expert commentary from over a dozen industry experts and academic researchers.
$48 BUY NOW

Slator 2017 Language Industry M&A Report

Data and Research
16-page report. Analysis of 2017 language industry M&A, 2018 outlook, list of all deals Slator covered incl. price, multiples if available, sector, country, deal type.
$280 BUY NOW
Slator Buy-Side Report 2017

Slator Buy-Side Report 2017

Data and Research, Slator reports
Features 30 buyer profiles along industry verticals incl. buyer name, translation volume and / or spend, technology used, sourcing approach, other key insights.
$48 BUY NOW
for our customers. [We] provide complete translation workflow; more features (e.g., document filtering, coupling with other technologies like language detection, entity extraction) than just the core translation.

Slator: Can you give us a simple first use case for OpenNMT?

Rush: We released several example translation models (e.g., German-English). Anyone can download and run the model to experiment with neural machine translation. We publicized the project because we thought it was quite stable; but also with the hope that more people in the translation community would contribute back to further improve it.

In theory, anybody could rent a server and train a model on available data, and we see some hobbyist doing just that—Alexander Rush, Assistant Professor Harvard School of Engineering and Applied Sciences

Slator: What is your mid- to long-term goal for OpenNMT?

Rush: There are two main focuses. One, we want to keep the code up-to-date with all the new ideas published in the research community, such that the open-source software stays competitive with closed-source offerings (e.g., Google). For instance, my group recently developed a system for shrinking translation models so they can run much faster, and this was implemented in the software even before the paper was published.

Jean Senellart

Two, we want to try out more cutting-edge “translation” ideas. For example, we are implementing an extension to map from images-to-text using OpenNMT. This is a rather recent research idea that we hope to make more accessible.

Senellart: On Systran’s side, we want this project to contain all the best of breed features and ideas that are published by the research community, but also keep the code simple, fast, so it becomes a reference for anyone wanting to do more research or even create commercial applications.

Slator: Who do you see as early adopters of this technology?

Rush: Great question! In theory, anybody could rent a server and train a model on available data, and we see some hobbyist doing just that. In practice, we expect a mix of researchers studying how to improve translation and people in the industry looking to become familiar with new AI technology.

Senellart: We do expect some competitors quickly building products based on this technology—and this will, of course, be challenging for us. But at the same time, [it is] quite an achievement that will help develop the machine translation market and global awareness about the technology.

TAGS

Alexander RushHarvardJean Senellartnatural language processingneural machine translationSystranYoon Kim
SHARE
Marion Marking

By Marion Marking

Slator consultant and corporate communications professional who enjoys exploring Asian cities.

Advertisement

SUBSCRIBE TO THE SLATOR WEEKLY

Language Industry Intelligence
In Your Inbox. Every Friday

SUBSCRIBE

SlatorSweepSlatorPro
ResearchRFP CENTER

PUBLISH

PRESS RELEASEDIRECTORY LISTING
JOB ADEVENT LISTING

Bespoke advisory including speaking, briefings and M&A

SLATOR ADVISORY
Advertisement

Featured Reports

See all
Pro Guide: Translation Pricing and Procurement

Pro Guide: Translation Pricing and Procurement

by Slator

Slator 2020 Language Industry M&#038;A and Funding Report

Slator 2020 Language Industry M&A and Funding Report

by Slator

Slator 2021 Data-for-AI Market Report

Slator 2021 Data-for-AI Market Report

by Slator

Slator 2020 Medtech Translation and Localization Report

Slator 2020 Medtech Translation and Localization Report

by Slator

Press Releases

See all
MasterWord Services Inc. Names Jeanette Stewart as Vice President of Operations

MasterWord Services Inc. Names Jeanette Stewart as Vice President of Operations

by MasterWord

XTRF Welcomes Roberto Ganzerli to Its Advisory Board

XTRF Welcomes Roberto Ganzerli to Its Advisory Board

by XTRF

Venga Reshapes Language Review with InQA Cloud Application

Venga Reshapes Language Review with InQA Cloud Application

by Venga Global

Upcoming Events

See All
  1. SlatorCon Remote May 2021

    by Slator

    · May 13 @ 3:00 pm - 8:00 pm

    A rich online conference which brings together our research and network of industry leaders.

    More info $110

Featured Companies

See all
Sunyu Transphere

Sunyu Transphere

Text United

Text United

Memsource

Memsource

Wordbank

Wordbank

Protranslating

Protranslating

SeproTec

SeproTec

Versacom

Versacom

Smartling

Smartling

XTM International

XTM International

Translators without Borders

Translators without Borders

STAR Group

STAR Group

memoQ Translation Technologies

memoQ Translation Technologies

Advertisement

SlatorPod: The Weekly Language Industry Podcast

connect with us

footer logo

Slator makes business sense of the language services and technology market.

Our Company

  • Support
  • About us
  • Terms & Conditions
  • Privacy Policy

Subscribe to the Slator Weekly

Language Industry Intelligence
In Your Inbox. Every Friday

© 2021 Slator. All rights reserved.

Sign up to the Slator Weekly

Join over 13,800 subscribers and get the latest language industry intelligence every Friday

Your information will never be shared with third parties. No Spam.