Jeff Dean@JeffDean

2026-04-29 04:16·65天前

AI 摘要

Google Translate迎来20周年，其发展依赖多次技术飞跃。2006年部署基于万亿词训练的5-gram语言模型，实现质量突破；2016年转向深度神经网络，结合Sequence-to-Sequence模型和TPUs，性能提升30-80倍、延迟降低15-30倍，使大规模服务成为可能；近期集成Gemini模型进一步优化。这些进步均基于前沿研究，每次都为翻译质量带来显著提升。作为Google机器学习工作的初始实验，Google Translate最常见翻译短语如“thank you”体现了其连接全球用户的使命。

Google Translate is turning 20！ 🎉. There are 20 fun facts and tips in the thread below.

Translate is one of my favorite Google products because it brings us all closer together！

I've been involved with a couple of things over the years. The first was our deployment of the initial system in 2006， which provided a huge leap forward in quality because it used a much larger 5-gram language model trained on trillions of words of text （indeed， probably the first trillion token language model training in the world： paper has some nice heads showing scaling-law-like quality improvement from scaling to more data/compute）.

See "Large Language Models in Machine Translation"， Thorsten Brants， Ashok C. Popat， Peng Xu， Franz J. Och and Jeffrey Dean， https://aclanthology.org/D07-1090/

The second major collaboration was in 2016 when we moved Translate over from a statistical machine translation approach to using deep neural networks. This approach relied on two key innovations. The first was Google's work on Sequence-to-Sequence models （https://arxiv.org/abs/1409.3215）. The second was our development of TPUs， custom cups that improved the performance of inference for deep neural networks by 30-80X over existing CPUs and GPUs of the day （and reduced latency by 15-30X）. This made launching compute-intensive language model services like Translate feasible for hundreds of millions of users. See "In-Datacenter Performance Analysis of a Tensor Processing Unit"， Norman P. Jouppi et al. https://arxiv.org/abs/1704.04760

GNMT paper： "Google's Neural Machine Translation System： Bridging the Gap between Human and Machine Translation"， Yonghui Wu， Mike Schuster， Zhifeng Chen， Quoc V. Le， Mohammad Norouzi， Wolfgang Macherey， Maxim Krikun， Yuan Cao， Qin Gao， Klaus Macherey， Jeff Klingner， Apurva Shah， Melvin Johnson， Xiaobing Liu， Łukasz Kaiser， Stephan Gouws， Yoshikiyo Kato， Taku Kudo， Hideto Kazawa， Keith Stevens， George Kurian， Nishant Patil， Wei Wang， Cliff Young， Jason Smith， Jason Riesa， Alex Rudnick， Oriol Vinyals， Greg Corrado， Macduff Hughes， and Jeffrey Dean， https://arxiv.org/abs/1609.08144

Jeff Dean@JeffDean · X

48导出 Markdown

2026-04-29 04:16·65天前

在 X 看原推· x.com

AI 摘要

Google Translate is turning 20！ 🎉. There are 20 fun facts and tips in the thread below.

Translate is one of my favorite Google products because it brings us all closer together！