llama
Here are 44 public repositories matching this topic...
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference
-
Updated
Nov 3, 2025 - Go
Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.
-
Updated
Nov 3, 2025 - Go
⚡️AI Cloud OS: Open-source enterprise-level AI knowledge base and MCP (model-context-protocol)/A2A (agent-to-agent) management platform with admin UI, user management and Single-Sign-On⚡️, supports ChatGPT, Claude, Llama, Ollama, HuggingFace, etc., chat bot demo: https://ai.casibase.com, admin UI demo: https://ai-admin.casibase.com
-
Updated
Nov 3, 2025 - Go
Reliable model swapping for any local OpenAI compatible server - llama.cpp, vllm, etc
-
Updated
Nov 3, 2025 - Go
A secure low code honeypot framework, leveraging AI for System Virtualization.
-
Updated
Nov 3, 2025 - Go
ChatGPT CLI is a versatile tool for interacting with LLMs through OpenAI, Azure, and other popular providers like Perplexity AI and Llama. It supports prompt files, history tracking, and live data injection via MCP (Model Context Protocol), making it ideal for both casual users and developers seeking a powerful, customizable GPT experience.
-
Updated
Oct 9, 2025 - Go
♾️ Helix is a private GenAI stack for building AI agents with declarative pipelines, knowledge (RAG), API bindings, and first-class testing.
-
Updated
Nov 3, 2025 - Go
🏗️ Fine-tune, build, and deploy open-source LLMs easily!
-
Updated
Nov 3, 2025 - Go
A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.
-
Updated
Aug 20, 2024 - Go
OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
-
Updated
Nov 3, 2025 - Go
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
-
Updated
Nov 3, 2025 - Go
LLaMA-2 in native Go
-
Updated
Nov 30, 2024 - Go
Inference Hub for AI at Scale
-
Updated
Nov 2, 2025 - Go
Improve this page
Add a description, image, and links to the llama topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the llama topic, visit your repo's landing page and select "manage topics."