About
FinnGen is one of the world's largest population-based genomics studies, combining genome sequences with longitudinal health records from Finnish national registries to build an unparalleled biomedical research resource. The project is structured as a large public-private partnership involving Finnish biobanks, universities, government institutions, and major pharmaceutical companies, enabling rapid translation of discoveries into drug targets and clinical applications. With over 500,000 participants enrolled across Finland's nationwide biobank network, FinnGen offers exceptional statistical power for identifying genetic associations with hundreds of clinical endpoints, including rare diseases made detectable by the population's unique genetic history. Researchers gain access to a suite of web-based bioinformatics tools including PheWeb for phenome-wide association study results, Risteys for clinical endpoint statistics, LAVAA volcano plots for variant analysis, Multiple Manhattan Plots, a Coding Variant Browser, and an Endpoint Browser. Regular public data freeze releases — now at Freeze 12 — make downloadable summary statistics freely available to the global research community. FinnGen is now expanding beyond association studies to examine disease progression and underlying biological mechanisms. Its open science model ensures genome data produced by the project remains available for research purposes, with medical breakthroughs ultimately benefiting healthcare systems and patients globally. The project is especially powerful due to Finland's comprehensive national health registries, strong biobank legislation, and genetically distinctive population.
Key Features
- Massive Genomic & Health Dataset: Aggregates genome sequences and longitudinal health records from over 500,000 Finnish donors, covering nearly 10% of Finland's population across a nationwide biobank network.
- Public Data Freeze Releases: Periodic data freeze releases — now at Freeze 12 — provide freely downloadable GWAS summary statistics and browsable results to the global research community.
- Bioinformatics Research Tools: Offers PheWeb, Risteys, LAVAA volcano plots, Multiple Manhattan Plots, Coding Variant Browser, and an Endpoint Browser for in-depth phenome-wide and variant-level analysis.
- National Registry Integration: Links genome data with Finnish national health registries enabling longitudinal studies on disease onset, clinical outcomes, and progression over time.
- Industry-Academia Open Collaboration: Unites biobanks, universities, government bodies, and pharmaceutical companies in a shared data environment to accelerate drug discovery and medical innovation.
Use Cases
- Genome-wide association studies (GWAS) to identify genetic variants linked to specific diseases using one of the world's largest biobank cohorts
- Drug target discovery and validation by pharmaceutical companies using linked genomic and clinical outcome data from Finnish national registries
- Disease progression and biological mechanism research to understand how genetic factors influence disease course over time
- Population genetics studies examining the unique evolutionary and demographic history of the Finnish population
- Rare disease genetics research enabled by the large cohort size and Finland's extensive, longitudinal health record infrastructure
Pros
- Exceptional Population Coverage: With over 500,000 participants representing ~10% of Finland's population, FinnGen delivers one of the most statistically powerful genomic cohorts available for disease research.
- Open Access Tools and Summary Statistics: Publicly available data releases and web-based research tools make FinnGen a highly accessible resource for academic researchers worldwide without requiring formal data access.
- Unique Population Genetics Advantage: Finland's genetically distinct and relatively homogeneous population, combined with rich national registry data, provides exceptional power for identifying rare disease associations.
- Accelerated Drug Discovery: Deep pharmaceutical industry involvement helps translate genomic findings directly into validated drug targets, speeding up the path from research to clinical benefit.
Cons
- Population-Specific Dataset: Data is predominantly Finnish, which may limit the direct generalizability of genetic findings to more ancestrally diverse global populations.
- Restricted Individual-Level Data Access: Accessing individual-level genotype and phenotype data requires a formal application and approval process, limiting immediate availability for all researchers.
- Requires Specialized Expertise: Effectively using FinnGen's tools and interpreting its data demands strong background knowledge in genomics, bioinformatics, and statistical genetics.
Frequently Asked Questions
FinnGen is a large public-private research project that collects and analyzes genome and health data from over 500,000 Finnish biobank donors to understand the genetic basis of diseases and drive advances in personalized medicine.
Public summary statistics and web-based tools such as PheWeb and Risteys are freely accessible to all researchers. Individual-level genotype and phenotype data requires a formal application through the FinnGen collaboration framework.
FinnGen offers PheWeb for phenome-wide association study browsing, Risteys for endpoint statistics, LAVAA volcano plots, Multiple Manhattan Plots, a Coding Variant Browser, and an Endpoint Browser, all accessible online.
Finland has a genetically distinctive and relatively homogeneous population, comprehensive national health registries, a well-established biobank network, and strong legislative frameworks for health data research — all of which make it ideal for large-scale population genomics.
A data freeze is a versioned snapshot of the aggregated, quality-controlled genomic and health data released at a specific point in time. The latest release, Data Freeze 12, covers over 500,000 individuals and includes freely downloadable genome-wide summary statistics.