RDKitRDKit is a free, open-source cheminformatics toolkit with Python and C++ APIs for molecular processing, fingerprinting, QSAR modeling, and drug discovery workflows.(0)0
EleutherAI Open LLMEleutherAI is a nonprofit AI research collective that trains and releases open-source LLMs, advancing interpretability, alignment, and language modeling research.(0)0
scGPTscGPT is an open-source generative AI foundation model for single-cell multi-omics analysis, offering pretrained checkpoints, zero-shot capabilities, and fine-tuning workflows.(0)0
Text Generation WebUIRun large language models locally with Text Generation WebUI — supports text, vision, tool-calling, and fine-tuning. 100% offline, open source, and private.(0)0
OpenSoundscapeOpenSoundscape is a free, open-source Python library for bioacoustic data analysis, featuring CNN training, spectrogram tools, automated sound detection, and acoustic localization.(0)0
Meta ESMOpen-source pretrained language models for protein sequences. Enables protein structure prediction, mutational effect analysis, and generative protein design using transformer-based architectures.(0)0
NVIDIA FLARENVIDIA FLARE is an open-source, domain-agnostic federated learning SDK that enables privacy-preserving distributed AI model training across multiple data sources without sharing raw data.(0)0
MMLU ProMMLU-Pro is an open-source benchmark for evaluating large language models on challenging reasoning tasks across 14+ academic domains. Presented at NeurIPS 2024.(0)0
Crawl4AICrawl4AI is an open-source web crawler and scraper built for LLMs, AI agents, and data pipelines. Generate clean markdown, extract structured data, and crawl at scale.(0)0
DeepChemDeepChem is a free, open-source Python library that democratizes deep learning for chemistry, biology, and life sciences research.(0)0