Skip to content
#

gguf

Here are 775 public repositories matching this topic...

⚡ Pure-Rust WebGPU inference engine — OpenAI-API compatible, GGUF native, runs on any GPU. No Python. No llama.cpp. Single binary.

  • Updated Jun 10, 2026
  • Rust
off-grid-mobile-ai

The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Supports text-to-text, vision, text-to-image

  • Updated Jun 9, 2026
  • TypeScript

LLM Agent Framework in ComfyUI includes MCP sever, Omost,GPT-sovits, ChatTTS,GOT-OCR2.0, and FLUX prompt nodes,access to Feishu,discord,and adapts to all llms with similar openai / aisuite interfaces, such as o1,ollama, gemini, grok, qwen, GLM, deepseek, kimi,doubao. Adapted to local llms, vlm, gguf such as llama-3.3 Janus-Pro, Linkage graphRAG

  • Updated Mar 8, 2026
  • Python

Private on-device AI suite for Android. Fork of Google AI Edge Gallery with llama.cpp, whisper.cpp, stable-diffusion.cpp, GGUF import, voice chat, vision AI, on-device image generation, biometric lock, encrypted history, and CPU/NPU/GPU acceleration.

  • Updated Jun 10, 2026
  • Kotlin

Improve this page

Add a description, image, and links to the gguf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gguf topic, visit your repo's landing page and select "manage topics."

Learn more