AI 摘要
ZML 发布 LLMD 技术预览版,提供硬件无关的 LLM 推理方案。单容器同时支持 NVIDIA 与 AMD GPU,镜像仅 2.4GB,支持挂载即运行的高性能部署。
Hardware independent LLM inference engine from ZML.
The tech preview of LLMD is out: - Easy Setup - Just mount your model and run - Cross-Platform GPU Support - Single container works on *both* NVIDIA and AMD GPU...