Ray AI FrameworkRay is an open source Python-native framework for scaling and orchestrating distributed AI, ML, and GenAI workloads across CPUs and GPUs at any scale.(0)0
ONNX RuntimeONNX Runtime is Microsoft's open-source AI engine for accelerated machine learning inference and training across cloud, edge, mobile, and web platforms.(0)0
llama.cppRun large language models locally with llama.cpp — a high-performance, open-source C/C++ inference engine supporting CUDA, Metal, Vulkan, and GGUF quantization for 50+ model architectures.(0)0
QuivrQuivr is an opinionated open-source RAG framework supporting any LLM (GPT-4, Groq, Llama) and vectorstore (PGVector, Faiss). Build AI-powered apps faster without reinventing retrieval pipelines.(0)0
TensorFlow LiteLiteRT (formerly TensorFlow Lite) is Google's open-source framework for deploying ML and GenAI models on Android, iOS, web, desktop, and IoT devices with GPU/NPU acceleration.(0)0
vLLMvLLM is an open-source high-throughput LLM inference library supporting GPU, CPU, and TPU backends with an OpenAI-compatible API, PagedAttention, and production deployment tools.(0)0
Unleash AI Feature FlagUnleash is a private, secure, open-source feature management platform with feature flags, kill switches, and auditability built for enterprise scale.(0)0
QdrantQdrant is a high-performance, open-source vector search engine and database written in Rust. Build production-ready RAG pipelines, recommendation systems, and semantic search at scale.(0)0
MemGPT AgentLetta (MemGPT) is an open-source platform for building stateful AI agents with advanced memory that learns and self-improves over time.(0)0
LocalAILocalAI is a free, open-source alternative to OpenAI and Anthropic. Run LLMs, image generation, audio, and autonomous agents locally on your own hardware with complete privacy.(0)0