Text-to-Image
Diffusers
Safetensors
StableDiffusionPipeline
NewtNewt commited on
Commit
72ffbb1
·
verified ·
1 Parent(s): b73dcad

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -4
README.md CHANGED
@@ -12,9 +12,8 @@ datasets:
12
  <a href="https://research.adobe.com/person/romain-rouffet/" target="_blank">Romain Rouffet</a></p>
13
 
14
  <p align="center"><a href="https://sites.google.com/view/morse2025" target="_blank">CVPR 2025 Workshop MORSE</a> </p>
15
- <p align="center"><img src=https://huggingface.co/NewtNewt/MESA/blob/main/mesa-header-nz.png></p>
16
 
17
- MESA is a novel generative model based on latent denoising diffusion capable of generating 2.5D representations of terrain based on the text prompt conditioning supplied via natural language. The model produces two co-registered modalities of optical and depth maps.
18
 
19
  ## Model Description
20
  - **Paper:** [MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data](https://arxiv.org/abs/2504.07210)
@@ -35,6 +34,19 @@ cd MESA
35
  huggingface-cli download NewtNewt/MESA --local-dir ./weights
36
  ```
37
 
 
 
 
 
 
 
 
 
 
 
 
 
 
38
  ```latex
39
  @inproceedings{mesa2025,
40
  title={MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data},
@@ -47,6 +59,4 @@ url={https://arxiv.org/abs/2504.07210},}
47
 
48
  ## Acknowledgements
49
 
50
- This implementation builds upon Hugging Face’s [Diffusers](https://github.com/huggingface/diffusers) library. We also acknowledge [Gradio](https://www.gradio.app/) for providing an easy-to-use interface that allowed us to create the inference demos for our models.
51
-
52
  This model is the product of a collaboration between [Φ-lab, European Space Agency (ESA)](https://philab.esa.int/) and the [Adobe Research (Paris, France)](https://research.adobe.com/careers/paris/).
 
12
  <a href="https://research.adobe.com/person/romain-rouffet/" target="_blank">Romain Rouffet</a></p>
13
 
14
  <p align="center"><a href="https://sites.google.com/view/morse2025" target="_blank">CVPR 2025 Workshop MORSE</a> </p>
 
15
 
16
+ MESA is a novel generative model based on latent denoising diffusion capable of generating 2.5D representations of terrain based on the text prompt conditioning supplied via natural language. The model produces two co-registered modalities of optical and depth maps. This model is a finetune of [stable-diffusion-2-1](https://huggingface.co/stabilityai/stable-diffusion-2-1) and is builds upon Hugging Face’s [Diffusers](https://github.com/huggingface/diffusers) library.
17
 
18
  ## Model Description
19
  - **Paper:** [MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data](https://arxiv.org/abs/2504.07210)
 
34
  huggingface-cli download NewtNewt/MESA --local-dir ./weights
35
  ```
36
 
37
+
38
+ ## Usage
39
+ ```python
40
+ from MESA.pipeline_terrain import TerrainDiffusionPipeline
41
+ import MESA.models as models
42
+
43
+ pipe = TerrainDiffusionPipeline.from_pretrained("./weights", torch_dtype=torch.float16)
44
+ pipe.to("cuda");
45
+
46
+ image,dem = pipe(prompt, num_inference_steps=50, guidance_scale=7.5)
47
+ ```
48
+
49
+ ## Citation
50
  ```latex
51
  @inproceedings{mesa2025,
52
  title={MESA: Text-Driven Terrain Generation Using Latent Diffusion and Global Copernicus Data},
 
59
 
60
  ## Acknowledgements
61
 
 
 
62
  This model is the product of a collaboration between [Φ-lab, European Space Agency (ESA)](https://philab.esa.int/) and the [Adobe Research (Paris, France)](https://research.adobe.com/careers/paris/).