EleutherAI lm-evalA flexible, open-source framework for few-shot benchmarking and evaluation of large language models across hundreds of tasks and multiple inference backends.(0)0
AntiFakeAntiFake adds imperceptible adversarial perturbations to audio files to prevent unauthorized voice cloning and deepfake speech synthesis. Open-source, Pyth(0)0
MMLU ProMMLU-Pro is an open-source benchmark for evaluating large language models on challenging reasoning tasks across 14+ academic domains. Presented at NeurIPS 2024.(0)0
Ray AI FrameworkRay is an open source Python-native framework for scaling and orchestrating distributed AI, ML, and GenAI workloads across CPUs and GPUs at any scale.(0)0
Text Generation WebUIRun large language models locally with Text Generation WebUI — supports text, vision, tool-calling, and fine-tuning. 100% offline, open source, and private.(0)0
ONNX RuntimeONNX Runtime is Microsoft's open-source AI engine for accelerated machine learning inference and training across cloud, edge, mobile, and web platforms.(0)0
MT BenchMT Bench is an open-source multi-turn benchmark for evaluating large language models using GPT-4 as an automated judge. Part of the FastChat ecosystem by lm-sys.(0)0
TensorFlow LiteLiteRT (formerly TensorFlow Lite) is Google's open-source framework for deploying ML and GenAI models on Android, iOS, web, desktop, and IoT devices with GPU/NPU acceleration.(0)0
ToolLLM AI Tool AgentToolBench by OpenBMB is an ICLR'24 spotlight open-source platform for training, serving, and evaluating LLMs with real-world API tool-use capabilities.(0)0
AI2 (Allen Institute for AI)Ai2 is a nonprofit AI research institute offering fully open LLMs, multimodal models, and AI platforms for science, environment, and robotics. Explore OLMo, Molmo, Semantic Scholar, and more.(0)0