GlowGlow is an open-source Apache Spark-based toolkit for biobank-scale genomic data processing, statistical analysis, and machine learning. Supports VCF, BGEN, Python, SQL, R, and more.(0)0
MarginPolishMarginPolish is an open-source graph-based assembly polisher for Oxford Nanopore sequencing data. It improves assembly accuracy by finding multiple alignment paths across run-length-encoded reads.(0)0
NirvanaNirvana is Illumina's open-source clinical-grade variant annotator for VCF files, supporting ClinVar, gnomAD, dbSNP, and more for genomics research pipelines.(0)0
lakeFS Data PlatformlakeFS is an open-source data version control platform that adds Git-like branching, committing, and rollback to your object storage. Manage AI-ready data at scale.(0)0
OpenObserveOpenObserve is a fast, scalable, and cost-effective open source observability platform. Monitor logs, metrics, and traces with 140x lower storage costs than Elasticsearch. Get started in 2 minutes.(0)0
QuivrQuivr is an opinionated open-source RAG framework supporting any LLM (GPT-4, Groq, Llama) and vectorstore (PGVector, Faiss). Build AI-powered apps faster without reinventing retrieval pipelines.(0)0
LocalAILocalAI is a free, open-source alternative to OpenAI and Anthropic. Run LLMs, image generation, audio, and autonomous agents locally on your own hardware with complete privacy.(0)0
R Ray ServeRay Serve is an open-source, scalable model serving framework built on Ray for deploying ML models, LLMs, and multi-model pipelines in production.(0)0
MLflowMLflow is the largest open source AI engineering platform. Debug, evaluate, monitor, and deploy AI agents, LLMs, and ML models with 30M+ monthly downloads.(0)0
Ray AI FrameworkRay is an open source Python-native framework for scaling and orchestrating distributed AI, ML, and GenAI workloads across CPUs and GPUs at any scale.(0)0