Update README.md
Browse files
README.md
CHANGED
|
@@ -12,9 +12,8 @@ datasets:
|
|
| 12 |
<a href="https://research.adobe.com/person/romain-rouffet/" target="_blank">Romain Rouffet</a></p>
|
| 13 |
|
| 14 |
<p align="center"><a href="https://sites.google.com/view/morse2025" target="_blank">CVPR 2025 Workshop MORSE</a> </p>
|
| 15 |
-
<p align="center"><img src=https://huggingface.co/NewtNewt/MESA/blob/main/mesa-header-nz.png></p>
|
| 16 |
|
| 17 |
-
MESA is a novel generative model based on latent denoising diffusion capable of generating 2.5D representations of terrain based on the text prompt conditioning supplied via natural language. The model produces two co-registered modalities of optical and depth maps.
|
| 18 |
|
| 19 |
## Model Description
|
| 20 |
- **Paper:** [MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data](https://arxiv.org/abs/2504.07210)
|
|
@@ -35,6 +34,19 @@ cd MESA
|
|
| 35 |
huggingface-cli download NewtNewt/MESA --local-dir ./weights
|
| 36 |
```
|
| 37 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 38 |
```latex
|
| 39 |
@inproceedings{mesa2025,
|
| 40 |
title={MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data},
|
|
@@ -47,6 +59,4 @@ url={https://arxiv.org/abs/2504.07210},}
|
|
| 47 |
|
| 48 |
## Acknowledgements
|
| 49 |
|
| 50 |
-
This implementation builds upon Hugging Face’s [Diffusers](https://github.com/huggingface/diffusers) library. We also acknowledge [Gradio](https://www.gradio.app/) for providing an easy-to-use interface that allowed us to create the inference demos for our models.
|
| 51 |
-
|
| 52 |
This model is the product of a collaboration between [Φ-lab, European Space Agency (ESA)](https://philab.esa.int/) and the [Adobe Research (Paris, France)](https://research.adobe.com/careers/paris/).
|
|
|
|
| 12 |
<a href="https://research.adobe.com/person/romain-rouffet/" target="_blank">Romain Rouffet</a></p>
|
| 13 |
|
| 14 |
<p align="center"><a href="https://sites.google.com/view/morse2025" target="_blank">CVPR 2025 Workshop MORSE</a> </p>
|
|
|
|
| 15 |
|
| 16 |
+
MESA is a novel generative model based on latent denoising diffusion capable of generating 2.5D representations of terrain based on the text prompt conditioning supplied via natural language. The model produces two co-registered modalities of optical and depth maps. This model is a finetune of [stable-diffusion-2-1](https://huggingface.co/stabilityai/stable-diffusion-2-1) and is builds upon Hugging Face’s [Diffusers](https://github.com/huggingface/diffusers) library.
|
| 17 |
|
| 18 |
## Model Description
|
| 19 |
- **Paper:** [MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data](https://arxiv.org/abs/2504.07210)
|
|
|
|
| 34 |
huggingface-cli download NewtNewt/MESA --local-dir ./weights
|
| 35 |
```
|
| 36 |
|
| 37 |
+
|
| 38 |
+
## Usage
|
| 39 |
+
```python
|
| 40 |
+
from MESA.pipeline_terrain import TerrainDiffusionPipeline
|
| 41 |
+
import MESA.models as models
|
| 42 |
+
|
| 43 |
+
pipe = TerrainDiffusionPipeline.from_pretrained("./weights", torch_dtype=torch.float16)
|
| 44 |
+
pipe.to("cuda");
|
| 45 |
+
|
| 46 |
+
image,dem = pipe(prompt, num_inference_steps=50, guidance_scale=7.5)
|
| 47 |
+
```
|
| 48 |
+
|
| 49 |
+
## Citation
|
| 50 |
```latex
|
| 51 |
@inproceedings{mesa2025,
|
| 52 |
title={MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data},
|
|
|
|
| 59 |
|
| 60 |
## Acknowledgements
|
| 61 |
|
|
|
|
|
|
|
| 62 |
This model is the product of a collaboration between [Φ-lab, European Space Agency (ESA)](https://philab.esa.int/) and the [Adobe Research (Paris, France)](https://research.adobe.com/careers/paris/).
|