The inevitable need for an open model consortium And yes, I hate consortia too. https://www.interconnects.ai/p/the-inevi...
The inevitable need for an open model consortium And yes, I hate consortia too. https://www.interconnects.ai/p/the-inevi...
Lots of love for Gemma 4! Team just told me it's already had 10M+ downloads since last week's launch. Gemma models have ...
1/ today we're releasing muse spark, the first model from MSL. nine months ago we rebuilt our ai stack from scratch. new...
本报告基于Interconnects与ATOM Project数据,手动筛选约1.5K个重要语言模型,通过下载量、衍生模型数量及OpenRouter推理份额等多维度指标,分析开源模型采用趋势。数据显示,以Qwen、Kimi为代表的中国模型全球采用率持续加速领先,其中Qwen 3.5、Nemontron 3、Kimi K2.5等近期模型在相对采用指标(RAM)中表现突出。研究同时指出,大型模型仍是Qwen相对竞争力较弱的领域。该工作旨在为开源生态系统提供更准确的公开数据与趋势洞察。
NVIDIA因DGX Lepton开源承诺未兑现再遭质疑。该公司曾宣称将开源该软件,但目前仅发布GPU monitoring agent等边缘组件,核心平台仍封闭。此前NIMS也经历类似争议:面对社区抗议,NVIDIA最终仅开源部分功能。作者指出,这似乎是NVIDIA的惯用策略——以开源承诺回应舆论,实则仅开放非关键模块,核心代码继续保持专有。
Gemma 4 and what makes an open model succeed Hint: it's not benchmark scores. https://www.interconnects.ai/p/gemma-4-and...
Sarvam AI发布印度首批从头预训练的开源权重模型Sarvam 105B与30B,采用MoE架构并在本土训练。两款模型在Intelligence Index分别得分18和12,支持推理与非推理双模式。105B在Agentic任务表现优于部分同类模型,但TerminalBench Hard编码测试成绩落后且幻觉率较高。模型采用Apache 2.0协议开源,上下文窗口128K/65K tokens,目前通过API免费提供服务。
Google DeepMind推出Gemma 4系列四款多模态开源模型,支持文本、图像及视频输入。31B(密集架构)与26B A4B(MoE架构)拥有256k上下文窗口,可在单张H100运行;另两款较小模型支持128k上下文。GPQA Diamond测试中,Gemma 4 31B(Reasoning)获85.7%,仅次于Qwen3.5 27B,但输出token仅约1.2M,效率更优;26B A4B(Reasoning)得分79.2%,超越gpt-oss-120B。
关联讨论 2 条X:Artificial Analysis (@ArtificialAnlys)X:Jeff Dean (@JeffDean)Excited to launch Gemma 4: the best open models in the world for their respective sizes. Available in 4 sizes that can b...
Whaaaa. Only realized now and apparently our repo was public since 11 months ago and noone told us?!
Mistral发布开源权重模型Mistral Small 4,采用119B参数MoE架构(每token激活6.5B参数),支持可切换的推理/非推理模式及图像输入。推理模式在Artificial Analysis Intelligence Index获27分,超越Mistral Large 3,但低于gpt-oss-120B等竞品。模型token效率优于同类,幻觉率更低(AA-Omniscience -30分),支持256K上下文窗口,采用Apache 2.0许可证。
The Linux Foundation Announces $12.5 Million in Grant Funding (via @AlphaOmegaOSS and @OpenSSF) @AnthropicAI , @AmazonWe...
autoresearch的演进方向应是异步大规模协作,类似SETI@home模式,目标并非模拟单个PhD学生,而是构建多agents研究社区。当前Git/GitHub的主分支机制限制了分布式创新,未来应允许agents在任意分支并行探索不同方向,通过Discussion或PR分享发现而非合并代码。随着智能体算力与注意力瓶颈消失,现有代码协作抽象将面临根本性重构。
关联讨论 1 条X:Andrej Karpathy (@karpathy)SONIC是一个4200万参数的Transformer模型(规模仅半个GPT-1),通过1亿+动作捕捉帧和50万+并行机器人在NVIDIA Isaac Lab中训练,以密集帧级监督替代手工奖励函数。训练3天后零样本迁移至真实G1机器人,在50种动作序列上达100%成功率。单一策略支持VR遥操作、视频动捕、文本指令、音乐响应及VLA模型控制。项目已完全开源。
The famed Stanford Smallville is officially open-source! 25 AI agents inhabit a digital Westworld, unaware that they are...
Last October, we introduced Representation Autoencoders (RAE), showing that training diffusion on frozen semantic repres...
Our open models are here. Both of them. http://openai.com/open-models
(1/n) 🚀 With FastVideo, you can now generate a 5-second video in 5 seconds on a single H200 GPU! Introducing FastWan se...
In the current AI talent war, everyone is focused on the big numbers (alleged compensation packages). It misses the bigg...
Excited to announce GR00T N1, the world's first open foundation model for humanoid robots! We are on a mission to democr...
DeepSeek AI 预告开源周活动,将于下周起陆续开源 5 个代码仓库。作为探索 AGI 的小团队,他们计划透明分享那些已在生产环境中实战验证的代码模块。团队相信开源社区的集体力量能加速行业进步,强调此次发布将摒弃象牙塔式的封闭开发,以"车库能量"和社区驱动创新的形式呈现。
Today, we are excited to announce Thinking Machines Lab (https://thinkingmachines.ai/), an artificial intelligence resea...