Terra by Broad

Terra by Broad

open_source

Terra is an open-source, cloud-based platform for biomedical data analysis, secure data sharing, and global scientific collaboration. Built by the Broad Institute of MIT and Harvard.

About

Terra is the world's most trusted open-source platform for biomedical data analysis, secure data sharing, and global scientific collaboration. Built by the Broad Institute of MIT and Harvard in partnership with Microsoft and Verily, Terra is licensed under a BSD-3 clause license and deployed on cloud infrastructure for massive scalability. Researchers can securely access petabytes of existing biomedical datasets or bring their own data, then analyze it using a rich ecosystem of over 15,000 algorithms and workflows. The platform supports a federated collaboration model, meaning institutions can work together without data ever leaving their IT network — a critical feature for protecting patient privacy and maintaining regulatory compliance. Terra's scale is remarkable: over 65,000 users in 50 countries, 80 petabytes of data, analysis of more than 110 million single cells, and 30 million workflows run to date. It supports diverse biomedical workloads including genomics, transcriptomics, and clinical data integration. The platform is designed for a range of users: bench scientists who want turnkey analysis environments, bioinformaticians building custom pipelines, data managers handling sensitive clinical datasets, and organizations requiring enterprise-grade security and compliance. Terra democratizes access to cutting-edge computational biology tools and large-scale datasets, accelerating discoveries that improve patient lives.

Key Features

  • Petabyte-Scale Data Access: Securely access over 80 petabytes of biomedical data or import your own datasets into a managed cloud environment.
  • Federated Collaboration Framework: Enable multi-institutional research collaboration without data leaving each organization's IT network, ensuring compliance and data sovereignty.
  • 15,000+ Algorithms & Workflows: Run pre-built or custom bioinformatics workflows, with over 30 million workflows already executed across genomics, transcriptomics, and more.
  • Single-Cell & Genomic Analysis: Purpose-built for large-scale genomic workloads, with support for analyzing over 110 million single cells and complex clinico-genomic datasets.
  • Open Source & Extensible: Fully open-sourced under a BSD-3 clause license, allowing organizations to self-host, audit, and extend the platform to meet their specific needs.

Use Cases

  • Large-scale genomic sequencing analysis across multi-institutional research consortia
  • Secure sharing of sensitive clinico-genomic datasets while maintaining patient privacy and regulatory compliance
  • Running standardized bioinformatics pipelines (e.g., variant calling, single-cell RNA-seq) on petabyte-scale datasets
  • Federated data analysis where raw data cannot leave an organization's network due to data governance policies
  • Accelerating biomarker discovery and translational research by enabling global collaboration on shared cloud infrastructure

Pros

  • Proven at Massive Scale: With 65K users, 50 countries, and 80 petabytes of data, Terra has demonstrated real-world reliability for the largest biomedical research initiatives.
  • Open Source & Free to Use: Released under the BSD-3 clause license, Terra is freely available and extensible, with no vendor lock-in for core platform functionality.
  • Strong Security & Compliance: The federated model and built-in data sovereignty controls make Terra suitable for sensitive clinical and genomic data under strict regulatory requirements.
  • Rich Ecosystem of Tools: Access to a vast library of 15,000+ algorithms and integration with leading cloud providers accelerates research workflows significantly.

Cons

  • Narrow Domain Focus: Terra is purpose-built for biomedical research, making it unsuitable for general-purpose data analytics or non-life-sciences use cases.
  • Technical Expertise Required: Getting the most out of Terra typically requires bioinformatics or cloud computing knowledge, presenting a steep learning curve for non-technical researchers.
  • Infrastructure Dependency: Full deployment and self-hosting relies on cloud infrastructure (Azure), which may introduce cost and configuration complexity for smaller institutions.

Frequently Asked Questions

What is Terra?

Terra is an open-source, cloud-based platform for biomedical data analysis, secure sharing, and global research collaboration, developed by the Broad Institute of MIT and Harvard with Microsoft and Verily.

Is Terra free to use?

Yes, Terra is open-sourced under the BSD-3 clause license and freely available. However, running large-scale analyses may incur cloud compute and storage costs depending on the deployment.

What types of data can I analyze on Terra?

Terra is optimized for biomedical datasets including genomic sequences, single-cell RNA data, clinico-genomic research datasets, and other large-scale life sciences data.

How does collaboration work without sharing raw data?

Terra uses a federated framework that allows researchers from different institutions to collaborate on analyses while keeping data within each organization's own IT network, preserving data sovereignty and compliance.

Who uses Terra?

Terra is used by over 65,000 researchers, data scientists, and data managers across 50 countries, including academic institutions, healthcare organizations, and biotech companies focused on genomics and clinical research.

Reviews

No reviews yet. Be the first to review this tool.

Alternatives

See all