Google Research 的 Gemini-SQL2 在 text-to-SQL 基准测试中以大幅优势领先
阅读原文· the-decoder.comGoogle Research 推出 Gemini-SQL2,基于 Gemini 3.1 Pro,可将自然语言转换为可执行 SQL 查询。该模型在 BIRD 基准上达到 80.04% 准确率,大幅领先 OpenAI 和 Anthropic。Google 表示该技术将改进其数据服务的自然语言功能。
Google Research's Gemini-SQL2 tops text-to-SQL benchmarks by a wide margin
Google Research unveiled Gemini-SQL2, a new text-to-SQL system built on Gemini 3.1 Pro. It translates natural language into executable SQL database queries. On the BIRD benchmark, which measures how accurately these translations work, Gemini-SQL2 hits an execution accuracy of 80.04 percent, putting it in first place, according to Google. OpenAI's GPT-5.5-xhigh scores about 72.8 percent, and Anthropic's Claude Opus 4.6 lands around 70.9 percent. Models from Databricks, AWS, Tencent, and Alibaba all trail well behind.
Google Research points out that turning natural language into correct SQL is especially hard because data is often layered and queries need to account for complex business logic. The generated SQL queries both look correct and execute successfully, the company says.