Yann is just plain incorrect here, he's confusing general intelligence with universal intelligence. Brains are the most ...
Yann is just plain incorrect here, he's confusing general intelligence with universal intelligence. Brains are the most ...
Meta研究人员透露,Facebook自2020年起使用TPU训练AI,由Kaiming He领导开发TF和JAX代码库,MAE、DiT等模型完全基于TPU构建。因内部采用有限,Meta于2023年取消GCP协议。推文指出,Google、Anthropic等实验室长期使用TPU训练大模型,Nvidia的CUDA护城河并非不可逾越,OpenAI亦投资Triton寻求替代。TPU与GPU的效率差异并非关键,系统工程人才才是决定性因素。
I keep seeing stuff about TPU, has anything materially new happened? There's no evidence Google has ever trained a Gemin...
Q. Who aligns the aligners? A. http://alignmentalignment.ai Today I'm humbled to announce an epoch-defining event: the l...
One of the first pruning methods for neural nets came in 1989: Optimal Brain Damage by @ylecun et al. "We ... derive a c...
My latest (with @erikbryn) in @WSJ today: AI is already generating a lot of benefits ($97 billion in 2024 in the US alon...
This is insane. AI capex might account for a larger share of GDP than basically any technology since the railroad. Basic...
This false nomenclature of "researcher" and "engineer", which is a thinly-masked way of describing a two-tier engineerin...
Ok this makes me super happy. The "NoFilter" work, paper, and advocacy that @angelinepouget and I argued so hard for is ...
We're excited to have @shengjia_zhao at the helm as Chief Scientist of Meta Superintelligence Labs. Big things are comin...
the openai IMO news hit me pretty heavy this weekend i'm still in the acute phase of the impact, i think i consider myse...
We are seeing much faster AI progress than **Paul Christiano** and **Yudkowsky** predicted, who had gold in 2025 at 8% a...
So, all the models underperform humans on the new International Mathematical Olympiad questions, and Grok-4 is especiall...
🚨 Did you know that small-batch vanilla SGD without momentum (i.e. the first optimizer you learn about in intro ML) is ...
In the current AI talent war, everyone is focused on the big numbers (alleged compensation packages). It misses the bigg...
The code and instruction-tuning data for MetaQuery are now open-sourced! Code: https://github.com/facebookresearch/metaq...
1/ Excited to share that I'm taking on the role of leading Fundamental AI Research (FAIR) at Meta. Huge thanks to Joelle...
We are open-sourcing all the models in Web-SSL, from ViT-L to ViT-7B! It was super fun to train and play with these mass...