🤖 AI & Agent

llama.cpp

⭐ 114,046 Stars 📜 MIT 替代: OpenAI API

📖 项目简介

C++ 实现的高性能 LLM 推理引擎,支持在消费级硬件上运行大模型。

🔗 GitHub 项目地址

https://github.com/ggerganov/llama.cpp

🔄 可替代的商用软件

OpenAI API

📝 项目原文介绍(英文)

llama.cpp Manifesto / ggml / ops LLM inference in C/C++ Recent API changes - Changelog for libllama API - Changelog for llama-server REST API Hot topics - Hugging Face cache migration: models downloaded with -hf are now stored in the standard Hugging Face cache directory, enabling sharing with other HF tools. - guide : using the new WebUI of llama.cpp - guide : running gpt-oss with llama.cpp - [[FEEDBACK] Better packaging for llama.cpp to support downstream consumers 🤗]( - Support for the gpt-os

💬 社区讨论

📌 What is the best configuration for cross-compiling on Android — 由 ArkaneFans 发布于 2026-06-01

📌 Support dots.mocr, better OCR than dots.ocr (already supported, from same team) with SVG output — 由 Disonantemus 发布于 2026-05-31

📌 @ggerganov @slaren Where are the other Zen DLLs? — 由 OPS-NeoRetro 发布于 2026-05-31

📌 @ggerganov @slaren Where are the other Zen DLLs? — 由 OPS-NeoRetro 发布于 2026-05-31

📌 Set max prefill parallelization independent of `-np`? — 由 Kangaroux 发布于 2026-05-31

⚠️ 免责声明:本文内容整理自 GitHub 开源社区,旨在分享和介绍优秀的开源替代方案。