CodeFormer

open_source

CodeFormer is an open-source AI model for robust blind face restoration. It uses a Transformer-based codebook lookup approach to recover high-quality faces from degraded images with controllable fidelity.

Research & Education

AI Image Generators

Photo Editors

About

CodeFormer is a state-of-the-art blind face restoration system introduced in the NeurIPS 2022 paper 'Towards Robust Blind Face Restoration with Codebook Lookup TransFormer.' Developed by researchers at S-Lab, Nanyang Technological University, CodeFormer tackles the highly ill-posed problem of restoring degraded face images without explicit guidance about the type or degree of degradation. At its core, CodeFormer learns a discrete codebook of high-quality facial visual atoms through self-reconstruction. At inference time, a Transformer module predicts the sequence of codebook entries that best represent the global composition of a low-quality input face, enabling recovery of natural, high-fidelity faces even under severe degradation. A key differentiator is its controllable feature transformation module, which allows users to dial a scalar weight (w) to flexibly trade off between output quality (sharper, more aesthetically pleasing results) and fidelity (closeness to the original input). This adaptability makes it useful for a wide range of real-world restoration scenarios. CodeFormer has demonstrated superior performance over prior state-of-the-art methods on both synthetic and real-world datasets. It is available as open-source code on GitHub, through a HuggingFace demo, and accompanied by the Wider-Test dataset. It is well-suited for researchers, developers, and creative professionals working on image enhancement, old photo restoration, video upscaling, and forensic imaging.

Key Features

Blind Face Restoration: Restores severely degraded face images without requiring prior knowledge of the degradation type or degree.
Discrete Codebook Prior: Uses a learned codebook of high-quality visual facial atoms to dramatically reduce the uncertainty and ambiguity of the restoration mapping.
Transformer-Based Global Modeling: A Transformer module captures global composition and context of low-quality faces to predict the best-matching codebook entries.
Controllable Quality-Fidelity Trade-off: A scalar weight (w) lets users flexibly balance between higher perceptual quality and closer fidelity to the original degraded input.
HuggingFace & GitHub Availability: Accessible as an interactive HuggingFace demo and fully open-source GitHub repository with accompanying Wider-Test dataset.

Use Cases

Restoring old or damaged family photos where faces are blurry or degraded due to age or poor scanning quality.
Enhancing low-quality facial images in surveillance or forensic contexts to improve identifiability.
Upscaling and restoring faces in historical video footage or archival media for documentary or broadcast production.
Pre-processing degraded facial images in datasets to improve the training quality of downstream computer vision models.
Academic and research benchmarking of blind face restoration algorithms against CodeFormer's published results.

Pros

State-of-the-Art Performance: Outperforms prior methods in both quality and fidelity on synthetic and real-world face restoration benchmarks.
Fully Open Source: Code, model weights, and dataset are freely available, making it accessible to researchers and developers worldwide.
Flexible User Control: The controllable trade-off parameter allows users to tune output for specific use cases, from aesthetic enhancement to forensic fidelity.
Robust to Severe Degradation: Designed to handle extremely low-quality inputs where conventional methods fail, including heavy blur, noise, and compression artifacts.

Cons

Faces Only: CodeFormer is specialized exclusively for face restoration and is not designed for general-purpose image enhancement or non-face subjects.
Technical Setup Required: Running the model locally requires familiarity with Python, deep learning frameworks, and GPU hardware; not suitable for non-technical users without the HuggingFace demo.
Research-Grade Tooling: As an academic project, it lacks a polished commercial UI, ongoing product support, or out-of-the-box integrations with creative workflows.

Frequently Asked Questions

Blind face restoration refers to the task of recovering a high-quality face image from a degraded input when the type and degree of degradation (e.g., blur, noise, compression) are unknown. It is called 'blind' because no explicit information about the degradation process is provided.

CodeFormer first trains a discrete codebook — a dictionary of high-quality facial visual atoms — through self-reconstruction learning. During restoration, a Transformer network predicts which codebook entries best represent the input face's global composition, and a decoder uses those entries to generate a high-quality output.

The scalar weight w controls the balance between fidelity and quality. A lower w prioritizes perceptual quality (sharper, more natural-looking output), while a higher w preserves more of the original input's identity and details, improving fidelity at the possible cost of some visual sharpness.

Yes, CodeFormer is fully open source. The code, pretrained models, and the Wider-Test dataset are freely available on GitHub and Google Drive. An interactive demo is also available on HuggingFace at no cost.

CodeFormer is designed to handle a wide range of real-world degradations including blur, noise, JPEG compression artifacts, low resolution, and combinations thereof — including severely degraded inputs where other methods typically fail.