Spaces:
Running on Zero
Apply for a GPU community grant: Academic project
This project focuses on blind face restoration (BFR) by leveraging latent consistency models (LCMs) as efficient and semantically consistent diffusion priors. Existing diffusion-model-based BFR methods typically fine-tune diffusion models on restoration datasets to recover low-quality facial images. However, directly applying diffusion models to BFR suffers from limited semantic consistency in identity, structure, and color, as well as inefficient inference caused by hundreds of denoising iterations. These limitations make it difficult to optimize restoration models and hinder the effective use of perceptual losses, which are essential for faithful face restoration.
To address these challenges, this project introduces InterLCM, a restoration framework that exploits the consistency noise-to-data mapping learned by LCMs along the ODE trajectory. By treating low-quality images as intermediate states within the LCM process, InterLCM starts restoration from earlier LCM steps to achieve a better balance between fidelity and perceptual quality. The efficiency of LCM also enables the integration of perceptual loss during training, improving restoration performance, especially in real-world degraded scenarios.
In addition, the project incorporates a Visual Module to extract high-level visual features and a Spatial Encoder to capture fine-grained spatial details. These components help reduce structural and semantic uncertainties, thereby improving the fidelity and realism of restored facial images. Experimental results on both synthetic and real-world datasets show that InterLCM achieves superior restoration quality compared with existing methods while also providing faster inference speed.
Hi @senmaonk , we've assigned ZeroGPU to this Space. Please check the compatibility and usage sections of the ZeroGPU documentation to make sure your Space runs correctly on ZeroGPU. If you're using a coding agent like Claude Code, you can also try Hugging Face's official huggingface-zerogpu skill, which guides the agent through ZeroGPU's constraints and migration steps.
If you're able to, please consider upgrading to Pro ($9/month) for a higher ZeroGPU quota plus features like Dev Mode and Private Storage.
This Space hosts an academic demo for InterLCM, a blind face restoration (BFR) method based on latent consistency models (LCMs). The demo loads the pretrained LCM-based restoration pipeline together with our proposed Visual Module and Spatial Encoder, and performs interactive inference for restoring low-quality facial images.
CPU-based inference is too slow to provide a responsive public demo experience due to iterative feature extraction and high-resolution restoration requirements. We therefore request a GPU with around 40GB VRAM. An NVIDIA L40S (48GB) or A100 instance would be sufficient. The GPU will be used exclusively for hosting this open academic demo and enabling reproducible inference for visitors.
Related resources:
Paper: https://arxiv.org/abs/2502.02215
Code: https://github.com/sen-mao/InterLCM