v vLLMvLLM is an open-source high-throughput LLM inference library supporting GPU, CPU, and TPU backends with an OpenAI-compatible API, PagedAttention, and production deployment tools.(0)0
S StarCoder AIStarCoder AI provides open-source large language models for code generation, trained on 600+ programming languages and trillions of tokens from The Stack v2.(0)0
Z ZenMLZenML is an open-source AI control plane for orchestrating ML pipelines and LLM agent workflows with automated versioning, infrastructure abstraction, and governance from local to Kubernetes.(0)0
S SigNozSigNoz is an open-source, OpenTelemetry-native platform for APM, logs, traces, metrics, and LLM monitoring. Self-host or use the managed cloud — no vendor lock-in.(0)0
F FastChatFastChat is an open-source platform for training, serving, and evaluating large language models. Powers Chatbot Arena with OpenAI-compatible APIs and a distributed multi-model serving system.(0)0
Q QuivrQuivr is an opinionated open-source RAG framework supporting any LLM (GPT-4, Groq, Llama) and vectorstore (PGVector, Faiss). Build AI-powered apps faster without reinventing retrieval pipelines.(0)0
L LiteLLM ProxyLiteLLM Proxy is an open-source LLM gateway that provides a single OpenAI-compatible API to manage authentication, load balancing, and spend tracking across 100+ LLM providers.(0)0
O OpenAgentsOpenAgents is an open-source platform for deploying real-world language agents, featuring a Data Agent, Plugins Agent, and Web Agent. Self-host or use the free demo.(0)0
N Neum AI RAG PipelineNeum AI is an open-source RAG framework to build, scale, and maintain real-time data pipelines for Retrieval Augmented Generation and semantic search applications.(0)0
G Gorilla LLMGorilla is an open-source LLM from UC Berkeley that connects with massive APIs, enabling state-of-the-art function calling across Python, Java, JavaScript, and REST APIs. Includes BFCL benchmarking, RAFT fine-tuning, and the GoEX execution engine.(0)0