GGML 和 llama.cpp 加入 HF 以确保 Local AI 的长期进展

2026-02-20 08:00·133天前

精选理由

本地推理核心引擎获得长期资源保障，端侧 AI 生态稳定性大幅提升

AI 摘要

GGML 和 llama.cpp 团队正式加入 Hugging Face，以支持本地 AI 社区的长期扩展。创始人 Georgi Gerganov 及团队将全职维护 llama.cpp，保持 100% 技术自主权和社区领导力，项目继续 100% 开源和社区驱动。Hugging Face 提供长期可持续资源，助力项目增长。技术上将优化 transformers 库与 llama.cpp 的无缝集成，实现近乎“一键式”的模型部署，并改进基于 GGML 的软件打包和用户体验。长期愿景是构建高效本地推理堆栈，推动开源超级智能的普及。

原文 · 未翻译

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Published February 20, 2026

We are super happy to announce that GGML, creators of Llama.cpp, are joining HF in order to keep future AI open. 🔥

Georgi Gerganov and team are joining HF with the goal of scaling and supporting the community behind ggml and llama.cpp as Local AI continues to make exponential progress in the coming years.

We've been working with Georgi and team for quite some time (we even have awesome core contributors to llama.cpp like Son and Alek in the team already) so this has been a very natural process.

llama.cpp is the fundamental building block for local inference, and transformers is the fundamental building block for model definition, so this is basically a match made in heaven. ❤️

GGML joins Hugging Face

What will change for llama.cpp, the open source project and the community?

Not much – Georgi and team still dedicate 100% of their time maintaining llama.cpp and have full autonomy and leadership on the technical directions and the community. HF is providing the project with long-term sustainable resources, improving the chances of the project to grow and thrive. The project will continue to be 100% open-source and community driven as it is now.

Technical focus

llama.cpp is the fundamental building block for local inference, and transformers is the fundamental building block for definition of models and architectures, so we’ll work on making sure it’s as seamless as possible in the future (almost “single-click”) to ship new models in llama.cpp from the transformers library ‘source of truth’ for model definitions.

Additionally, we will improve packaging and user experience of ggml-based software. As we enter the phase in which local inference becomes a meaningful and competitive alternative to cloud inference, it is crucial to improve and simplify the way in which casual users deploy and access local models. We will work towards making llama.cpp ubiquitous and readily available everywhere.

Our long term vision

Our shared goal is to provide the community with the building blocks to make open-source superintelligence accessible to the world over the coming years.

We will achieve this together with the growing Local AI community, as we continue to build the ultimate inference stack that runs as efficiently as possible on our devices.

DeepSeek-V4: a million-token context that agents can actually use

April 24, 2026

cybersecurityopen-sourcecommunity

AI and the Future of Cybersecurity: Why Openness Matters

April 21, 2026

Community

Bright8192

Feb 20

Big congrats to GGML and Hugging Face! Great news for the Local AI community. Excited to see llama.cpp grow stronger and make local AI easier for everyone!

Adamqubit

Feb 22

This comment has been hidden (marked as Off-Topic)

Room64

Feb 20

LLama.cpp is the best AI project by far, super reactive to bug solve, very competent team, love you guys, you desserve it

Xenova

Feb 20

Our shared goal is to provide the community with the building blocks to make open-source superintelligence accessible to the world over the coming years.

Trilogix1

Feb 20

Hugging Face smart moves never ending.
Are you guys using AI for advice? I wonder which of 2 million AI models you are using 😄

joshnur

Feb 20

Great news.

Serving with llama.cpp using HF-hosted models, including unsloth's on AMD Strix Halo and OpenCode here.

raphaelamorim

Feb 20

•

edited Feb 20

Congrats to both teams. Well deserved. Wonderful news for wonderful teams and community.

iyanello

Feb 20

Congratulations to Georgi Gerganov and team! So happy for you guys, this is huge success!

Tugay31

Feb 20

Great news. congrats to GGML and HF. . always LocalAI.

arkavo-paul

Feb 20

This is a match made in heaven for the local AI ecosystem. Transformers as the model definition layer plus llama.cpp as the local inference layer, backed by HF's long-term resources, gives the entire community a stable foundation to build on for years to come.

The focus on packaging and user experience is especially important. Making local inference accessible beyond developers is how we get to an AI future that's open, private, and user-owned — not locked behind API calls.

Congratulations to Georgi and team. Open-source superintelligence that runs on your own hardware isn't just a technical goal, it's a trust model.