Instructions to use microsoft/Lens-Base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use microsoft/Lens-Base with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("microsoft/Lens-Base", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
- DiffusionBee
Upload folder using huggingface_hub
Browse filesThis view is limited to 50 files because it contains too many changes. See raw diff
- .gitattributes +34 -0
- README.md +404 -2
- assets/gallery/000-1440x1440.png +3 -0
- assets/gallery/000-1440x1440.txt +1 -0
- assets/gallery/001-1440x1440.png +3 -0
- assets/gallery/001-1440x1440.txt +1 -0
- assets/gallery/002-1440x1440.png +3 -0
- assets/gallery/002-1440x1440.txt +1 -0
- assets/gallery/003-1248x1664.png +3 -0
- assets/gallery/003-1248x1664.txt +1 -0
- assets/gallery/004-1664x1248.png +3 -0
- assets/gallery/004-1664x1248.txt +1 -0
- assets/gallery/005-1664x1248.png +3 -0
- assets/gallery/005-1664x1248.txt +1 -0
- assets/gallery/006-1248x1664.png +3 -0
- assets/gallery/006-1248x1664.txt +1 -0
- assets/gallery/007-1440x1440.png +3 -0
- assets/gallery/007-1440x1440.txt +1 -0
- assets/gallery/008-1440x1440.png +3 -0
- assets/gallery/008-1440x1440.txt +1 -0
- assets/gallery/009-1664x1248.png +3 -0
- assets/gallery/009-1664x1248.txt +1 -0
- assets/gallery/010-1664x1248.png +3 -0
- assets/gallery/010-1664x1248.txt +1 -0
- assets/gallery/011-1664x1248.png +3 -0
- assets/gallery/011-1664x1248.txt +1 -0
- assets/gallery/012-1664x1248.png +3 -0
- assets/gallery/012-1664x1248.txt +1 -0
- assets/gallery/013-1664x1248.png +3 -0
- assets/gallery/013-1664x1248.txt +1 -0
- assets/gallery/014-1248x1664.png +3 -0
- assets/gallery/014-1248x1664.txt +1 -0
- assets/gallery/015-1248x1664.png +3 -0
- assets/gallery/015-1248x1664.txt +1 -0
- assets/gallery/016-1664x1248.png +3 -0
- assets/gallery/016-1664x1248.txt +1 -0
- assets/gallery/017-1248x1664.png +3 -0
- assets/gallery/017-1248x1664.txt +1 -0
- assets/gallery/018-1248x1664.png +3 -0
- assets/gallery/018-1248x1664.txt +1 -0
- assets/gallery/019-1664x1248.png +3 -0
- assets/gallery/019-1664x1248.txt +1 -0
- assets/gallery/020-1664x1248.png +3 -0
- assets/gallery/020-1664x1248.txt +1 -0
- assets/gallery/021-1664x1248.png +3 -0
- assets/gallery/021-1664x1248.txt +1 -0
- assets/gallery/022-1440x1440.png +3 -0
- assets/gallery/022-1440x1440.txt +1 -0
- assets/gallery/023-1248x1664.png +3 -0
- assets/gallery/023-1248x1664.txt +1 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,37 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
assets/gallery/000-1440x1440.png filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
assets/gallery/001-1440x1440.png filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
assets/gallery/002-1440x1440.png filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
assets/gallery/003-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 40 |
+
assets/gallery/004-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 41 |
+
assets/gallery/005-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 42 |
+
assets/gallery/006-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 43 |
+
assets/gallery/007-1440x1440.png filter=lfs diff=lfs merge=lfs -text
|
| 44 |
+
assets/gallery/008-1440x1440.png filter=lfs diff=lfs merge=lfs -text
|
| 45 |
+
assets/gallery/009-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 46 |
+
assets/gallery/010-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 47 |
+
assets/gallery/011-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 48 |
+
assets/gallery/012-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 49 |
+
assets/gallery/013-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 50 |
+
assets/gallery/014-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 51 |
+
assets/gallery/015-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 52 |
+
assets/gallery/016-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 53 |
+
assets/gallery/017-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 54 |
+
assets/gallery/018-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 55 |
+
assets/gallery/019-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 56 |
+
assets/gallery/020-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 57 |
+
assets/gallery/021-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 58 |
+
assets/gallery/022-1440x1440.png filter=lfs diff=lfs merge=lfs -text
|
| 59 |
+
assets/gallery/023-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 60 |
+
assets/gallery/024-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 61 |
+
assets/gallery/025-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 62 |
+
assets/gallery/026-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 63 |
+
assets/gallery/027-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 64 |
+
assets/gallery/028-1248x1664.png filter=lfs diff=lfs merge=lfs -text
|
| 65 |
+
assets/gallery/029-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 66 |
+
assets/gallery/030-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 67 |
+
assets/gallery/031-1664x1248.png filter=lfs diff=lfs merge=lfs -text
|
| 68 |
+
assets/teaser.webp filter=lfs diff=lfs merge=lfs -text
|
| 69 |
+
tokenizer/tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,3 +1,405 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
-
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
<div align="center">
|
| 2 |
+
|
| 3 |
+
# Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models
|
| 4 |
+
|
| 5 |
+
<img src="assets/teaser.webp" alt="Lens Teaser" width="100%" />
|
| 6 |
+
|
| 7 |
+
<p>
|
| 8 |
+
<sub>
|
| 9 |
+
<strong>Jinjing Zhao</strong>*,
|
| 10 |
+
<strong>Sirui Zhang</strong>*,
|
| 11 |
+
<strong>Zhiyang Liang</strong>*,
|
| 12 |
+
<strong>Yang Yue</strong>*,
|
| 13 |
+
<strong>Jiawei Zhang</strong>*,
|
| 14 |
+
<strong>Qinhong Yang</strong>,
|
| 15 |
+
<strong>Yanchen Dong</strong>,
|
| 16 |
+
<strong>Yitong Wang</strong>,
|
| 17 |
+
<strong>Yunuo Chen</strong>,
|
| 18 |
+
<strong>Xiuyu Wu</strong>,
|
| 19 |
+
<strong>Fangyun Wei</strong>†,
|
| 20 |
+
<strong>Dong Chen</strong>†,
|
| 21 |
+
<strong>Dongdong Chen</strong>,
|
| 22 |
+
<strong>Ziyu Wan</strong>,
|
| 23 |
+
<strong>Lei Shi</strong>,
|
| 24 |
+
<strong>Ji Li</strong>,
|
| 25 |
+
<strong>Chong Luo</strong>,
|
| 26 |
+
<strong>Yan Lu</strong>,
|
| 27 |
+
<strong>Baining Guo</strong>
|
| 28 |
+
</sub>
|
| 29 |
+
<br />
|
| 30 |
+
<sub>Microsoft | *Equal Contribution | †Project Lead</sub>
|
| 31 |
+
</p>
|
| 32 |
+
|
| 33 |
+
<p>
|
| 34 |
+
<a href="https://arxiv.org/abs/PLACEHOLDER"><img alt="arXiv" src="https://img.shields.io/badge/arXiv-Paper-b31b1b?logo=arxiv&logoColor=white" height="22" /></a>
|
| 35 |
+
|
| 36 |
+
<a href="https://huggingface.co/microsoft/Lens"><img alt="Hugging Face" src="https://img.shields.io/badge/%F0%9F%A4%97-Models-yellow" height="22" /></a>
|
| 37 |
+
|
| 38 |
+
<a href="LICENSE"><img alt="License: MIT" src="https://img.shields.io/badge/License-MIT-green.svg" height="22" /></a>
|
| 39 |
+
</p>
|
| 40 |
+
|
| 41 |
+
</div>
|
| 42 |
+
|
| 43 |
---
|
| 44 |
+
|
| 45 |
+
**Lens** is a **3.8B-parameter** foundational text-to-image model designed for **efficient training** and **fast high-resolution generation**. It combines dense-caption pre-training, mixed-resolution learning, GPT-OSS multi-layer text features, and the FLUX.2 semantic VAE to reach competitive quality with substantially less training compute than larger T2I models.
|
| 46 |
+
|
| 47 |
+
This repository provides the minimal inference code for generating images from Lens DiT checkpoints.
|
| 48 |
+
|
| 49 |
+
## Highlights
|
| 50 |
+
|
| 51 |
+
- **Efficient Foundation** — Trained on **Lens-800M**, an 800M image-text corpus with long GPT-4.1 captions, maximizing information density per training batch.
|
| 52 |
+
- **Compact & Expressive** — A 48-block MMDiT denoiser leverages FLUX.2 latents and concatenated multi-layer GPT-OSS features for stronger prompt following and multilingual generalization.
|
| 53 |
+
- **Flexible Resolution** — Mixed-resolution training enables inference across aspect ratios from `1:2` to `2:1` and resolutions up to **1440×1440**.
|
| 54 |
+
- **Post-trained Variants** — RL tuning improves visual quality and artifact suppression; the distilled **Lens-Turbo** supports fast **4-step** generation.
|
| 55 |
+
|
| 56 |
+
## Gallery
|
| 57 |
+
|
| 58 |
+
<!-- LENS_GALLERY_START -->
|
| 59 |
+
|
| 60 |
+
<details name="lens-gallery" open>
|
| 61 |
+
<summary><b>Page 1 / 6</b> samples 000-005</summary>
|
| 62 |
+
|
| 63 |
+
<table>
|
| 64 |
+
<tr>
|
| 65 |
+
<td width="33%" valign="top">
|
| 66 |
+
<img src="assets/gallery/000-1440x1440.png" alt="Lens gallery sample 000" width="100%" />
|
| 67 |
+
<br />
|
| 68 |
+
<sub><b>Sample 000</b> · 1440x1440<br />A generous portion of classic British fish and chips served on a sheet of white paper, golden crispy beer-battered cod fillet alongside thick-cut chips, a wedge of lemon, mushy peas in a small dish, malt vinegar bottle nearby, wooden pub table, overhead shot</sub>
|
| 69 |
+
</td>
|
| 70 |
+
<td width="33%" valign="top">
|
| 71 |
+
<img src="assets/gallery/001-1440x1440.png" alt="Lens gallery sample 001" width="100%" />
|
| 72 |
+
<br />
|
| 73 |
+
<sub><b>Sample 001</b> · 1440x1440<br />The iconic Big Ben clock tower and the Houses of Parliament in London at golden hour, the River Thames reflecting warm amber light, Westminster Bridge in the foreground, a classic red double-decker bus crossing, dramatic clouds lit by sunset</sub>
|
| 74 |
+
</td>
|
| 75 |
+
<td width="33%" valign="top">
|
| 76 |
+
<img src="assets/gallery/002-1440x1440.png" alt="Lens gallery sample 002" width="100%" />
|
| 77 |
+
<br />
|
| 78 |
+
<sub><b>Sample 002</b> · 1440x1440<br />La Tour Eiffel au crépuscule vue depuis le Trocadéro, la structure en fer illuminée de milliers de lumières dorées scintillantes, le ciel passant du bleu profond au violet, les fontaines du Trocadéro au premier plan avec des reflets dorés, silhouettes de promeneurs</sub>
|
| 79 |
+
</td>
|
| 80 |
+
</tr>
|
| 81 |
+
<tr>
|
| 82 |
+
<td width="33%" valign="top">
|
| 83 |
+
<img src="assets/gallery/003-1248x1664.png" alt="Lens gallery sample 003" width="100%" />
|
| 84 |
+
<br />
|
| 85 |
+
<sub><b>Sample 003</b> · 1248x1664<br />A crystal dragon soaring through an aurora borealis sky, its entire body made of transparent faceted crystal refracting the green and purple aurora light into rainbow spectra, ice particles trailing from its wings, high fantasy digital art</sub>
|
| 86 |
+
</td>
|
| 87 |
+
<td width="33%" valign="top">
|
| 88 |
+
<img src="assets/gallery/004-1664x1248.png" alt="Lens gallery sample 004" width="100%" />
|
| 89 |
+
<br />
|
| 90 |
+
<sub><b>Sample 004</b> · 1664x1248<br />Aerial view of Yuanyang rice terraces in Yunnan province at sunrise, thousands of cascading water-filled paddies reflecting golden and pink sky colors, morning mist weaving between terrace layers, lush green hillside with scattered palm trees, drone photography</sub>
|
| 91 |
+
</td>
|
| 92 |
+
<td width="33%" valign="top">
|
| 93 |
+
<img src="assets/gallery/005-1664x1248.png" alt="Lens gallery sample 005" width="100%" />
|
| 94 |
+
<br />
|
| 95 |
+
<sub><b>Sample 005</b> · 1664x1248<br />A green iguana basking on a moss-covered fallen log in a tropical rainforest, every scale and spine rendered in sharp detail, dewdrops clinging to its skin, a blurred waterfall and lush tropical foliage in the background, National Geographic wildlife photography style</sub>
|
| 96 |
+
</td>
|
| 97 |
+
</tr>
|
| 98 |
+
</table>
|
| 99 |
+
</details>
|
| 100 |
+
|
| 101 |
+
<details name="lens-gallery">
|
| 102 |
+
<summary><b>Page 2 / 6</b> samples 006-011</summary>
|
| 103 |
+
|
| 104 |
+
<table>
|
| 105 |
+
<tr>
|
| 106 |
+
<td width="33%" valign="top">
|
| 107 |
+
<img src="assets/gallery/006-1248x1664.png" alt="Lens gallery sample 006" width="100%" />
|
| 108 |
+
<br />
|
| 109 |
+
<sub><b>Sample 006</b> · 1248x1664<br />Oil painting portrait of a Renaissance noblewoman in a deep blue velvet dress with pearl drop earrings, soft chiaroscuro lighting revealing delicate skin, craquelure texture on the painted surface, in the style of Vermeer</sub>
|
| 110 |
+
</td>
|
| 111 |
+
<td width="33%" valign="top">
|
| 112 |
+
<img src="assets/gallery/007-1440x1440.png" alt="Lens gallery sample 007" width="100%" />
|
| 113 |
+
<br />
|
| 114 |
+
<sub><b>Sample 007</b> · 1440x1440<br />An artisan honey jar with a hand-illustrated vintage botanical label reading "Mountain Wildflower Honey" in brown serif letterpress-style typography with decorative flourishes, detailed ink drawings of wildflowers, clover and honeybees surrounding the text, kraft paper label on clear glass jar</sub>
|
| 115 |
+
</td>
|
| 116 |
+
<td width="33%" valign="top">
|
| 117 |
+
<img src="assets/gallery/008-1440x1440.png" alt="Lens gallery sample 008" width="100%" />
|
| 118 |
+
<br />
|
| 119 |
+
<sub><b>Sample 008</b> · 1440x1440<br />Watercolor portrait of a thoughtful young man reading a worn leather book in a Parisian cafe, loose wet-on-wet brushstrokes bleeding into warm amber and burnt sienna washes, visible paper grain texture</sub>
|
| 120 |
+
</td>
|
| 121 |
+
</tr>
|
| 122 |
+
<tr>
|
| 123 |
+
<td width="33%" valign="top">
|
| 124 |
+
<img src="assets/gallery/009-1664x1248.png" alt="Lens gallery sample 009" width="100%" />
|
| 125 |
+
<br />
|
| 126 |
+
<sub><b>Sample 009</b> · 1664x1248<br />An explorer's oak desk with an aged world map spread open, a brass sextant, leather-bound navigation journal with handwritten entries, melting candle in a copper holder, scattered compass and quill pen, warm window light, still life photography</sub>
|
| 127 |
+
</td>
|
| 128 |
+
<td width="33%" valign="top">
|
| 129 |
+
<img src="assets/gallery/010-1664x1248.png" alt="Lens gallery sample 010" width="100%" />
|
| 130 |
+
<br />
|
| 131 |
+
<sub><b>Sample 010</b> · 1664x1248<br />New York Grand Central Terminal subway station with the classic station name "GRAND CENTRAL" spelled out in elegant white ceramic mosaic tile letters embedded in a dark green tile wall, each letter approximately eight inches tall, ornate tile border frames, the S-curve of train tracks visible</sub>
|
| 132 |
+
</td>
|
| 133 |
+
<td width="33%" valign="top">
|
| 134 |
+
<img src="assets/gallery/011-1664x1248.png" alt="Lens gallery sample 011" width="100%" />
|
| 135 |
+
<br />
|
| 136 |
+
<sub><b>Sample 011</b> · 1664x1248<br />A ruby-throated hummingbird hovering in front of a bright red heliconia flower, wings frozen in a figure-eight pattern showing iridescent feather detail, individual water droplets suspended around the bird, high-speed macro photography with dark background</sub>
|
| 137 |
+
</td>
|
| 138 |
+
</tr>
|
| 139 |
+
</table>
|
| 140 |
+
</details>
|
| 141 |
+
|
| 142 |
+
<details name="lens-gallery">
|
| 143 |
+
<summary><b>Page 3 / 6</b> samples 012-017</summary>
|
| 144 |
+
|
| 145 |
+
<table>
|
| 146 |
+
<tr>
|
| 147 |
+
<td width="33%" valign="top">
|
| 148 |
+
<img src="assets/gallery/012-1664x1248.png" alt="Lens gallery sample 012" width="100%" />
|
| 149 |
+
<br />
|
| 150 |
+
<sub><b>Sample 012</b> · 1664x1248<br />An old Remington typewriter with a sheet of cream-colored paper rolled into the carriage, the typed words "Chapter One: The Beginning" visible in slightly uneven Courier typeface with characteristic ink density variations, some letters slightly misaligned, warm desk lamp lighting</sub>
|
| 151 |
+
</td>
|
| 152 |
+
<td width="33%" valign="top">
|
| 153 |
+
<img src="assets/gallery/013-1664x1248.png" alt="Lens gallery sample 013" width="100%" />
|
| 154 |
+
<br />
|
| 155 |
+
<sub><b>Sample 013</b> · 1664x1248<br />The Great Wildebeest Migration crossing the Mara River at golden hour, hundreds of animals plunging into churning water sending spray everywhere, dust clouds rising from the riverbank, dramatic backlit scene, National Geographic documentary style</sub>
|
| 156 |
+
</td>
|
| 157 |
+
<td width="33%" valign="top">
|
| 158 |
+
<img src="assets/gallery/014-1248x1664.png" alt="Lens gallery sample 014" width="100%" />
|
| 159 |
+
<br />
|
| 160 |
+
<sub><b>Sample 014</b> · 1248x1664<br />A charming flower shop storefront window with hand-painted white script lettering on the glass reading "Fresh Flowers Daily" in flowing connected cursive with decorative swashes, roses and peonies arranged in buckets visible through the lettering, morning sunlight catching the painted letters</sub>
|
| 161 |
+
</td>
|
| 162 |
+
</tr>
|
| 163 |
+
<tr>
|
| 164 |
+
<td width="33%" valign="top">
|
| 165 |
+
<img src="assets/gallery/015-1248x1664.png" alt="Lens gallery sample 015" width="100%" />
|
| 166 |
+
<br />
|
| 167 |
+
<sub><b>Sample 015</b> · 1248x1664<br />A steampunk floating sky-city built on massive gear-driven platforms, brass and copper towers connected by chain bridges, steam-powered airships and hot air balloons docking at various levels, sunset clouds below the city, detailed concept art</sub>
|
| 168 |
+
</td>
|
| 169 |
+
<td width="33%" valign="top">
|
| 170 |
+
<img src="assets/gallery/016-1664x1248.png" alt="Lens gallery sample 016" width="100%" />
|
| 171 |
+
<br />
|
| 172 |
+
<sub><b>Sample 016</b> · 1664x1248<br />Milford Sound in New Zealand at dawn, a perfect mirror reflection of steep fjord walls on glass-still water, waterfalls streaming down thousand-foot cliffs, morning mist hovering above the water surface, panoramic landscape photography</sub>
|
| 173 |
+
</td>
|
| 174 |
+
<td width="33%" valign="top">
|
| 175 |
+
<img src="assets/gallery/017-1248x1664.png" alt="Lens gallery sample 017" width="100%" />
|
| 176 |
+
<br />
|
| 177 |
+
<sub><b>Sample 017</b> · 1248x1664<br />An Indian Bharatanatyam classical dancer in the aramandi pose, bronze ankle bells and elaborate hand mudra gestures, rich silk costume with gold temple jewelry, captured mid-performance with dramatic stage lighting</sub>
|
| 178 |
+
</td>
|
| 179 |
+
</tr>
|
| 180 |
+
</table>
|
| 181 |
+
</details>
|
| 182 |
+
|
| 183 |
+
<details name="lens-gallery">
|
| 184 |
+
<summary><b>Page 4 / 6</b> samples 018-023</summary>
|
| 185 |
+
|
| 186 |
+
<table>
|
| 187 |
+
<tr>
|
| 188 |
+
<td width="33%" valign="top">
|
| 189 |
+
<img src="assets/gallery/018-1248x1664.png" alt="Lens gallery sample 018" width="100%" />
|
| 190 |
+
<br />
|
| 191 |
+
<sub><b>Sample 018</b> · 1248x1664<br />A narrow alleyway in Marrakech's old medina with walls painted in vivid cobalt blue, colorful handwoven rugs and ceramic plates displayed along the walls, ornate wooden doors, warm sunlight from above creating dramatic shadows, Moroccan architecture</sub>
|
| 192 |
+
</td>
|
| 193 |
+
<td width="33%" valign="top">
|
| 194 |
+
<img src="assets/gallery/019-1664x1248.png" alt="Lens gallery sample 019" width="100%" />
|
| 195 |
+
<br />
|
| 196 |
+
<sub><b>Sample 019</b> · 1664x1248<br />A rustic wooden sign at a fishing village dock reading "Fresh Catch of the Day" in hand-carved letters painted nautical blue, thick hemp rope threaded through the sign as a border, fishing nets and lobster traps stacked in the background, seaside atmosphere</sub>
|
| 197 |
+
</td>
|
| 198 |
+
<td width="33%" valign="top">
|
| 199 |
+
<img src="assets/gallery/020-1664x1248.png" alt="Lens gallery sample 020" width="100%" />
|
| 200 |
+
<br />
|
| 201 |
+
<sub><b>Sample 020</b> · 1664x1248<br />A sunken shipwreck on the ocean floor completely overgrown with colorful coral formations, schools of tropical fish swimming through the broken hull and portholes, shafts of sunlight streaming down from the surface above, underwater archaeology photography</sub>
|
| 202 |
+
</td>
|
| 203 |
+
</tr>
|
| 204 |
+
<tr>
|
| 205 |
+
<td width="33%" valign="top">
|
| 206 |
+
<img src="assets/gallery/021-1664x1248.png" alt="Lens gallery sample 021" width="100%" />
|
| 207 |
+
<br />
|
| 208 |
+
<sub><b>Sample 021</b> · 1664x1248<br />Zhangjiajie pillar mountains rising above a sea of clouds at sunrise, golden light painting the sandstone peaks, the surreal Avatar-like floating mountain landscape stretching to the horizon, aerial drone photography capturing immense vertical scale</sub>
|
| 209 |
+
</td>
|
| 210 |
+
<td width="33%" valign="top">
|
| 211 |
+
<img src="assets/gallery/022-1440x1440.png" alt="Lens gallery sample 022" width="100%" />
|
| 212 |
+
<br />
|
| 213 |
+
<sub><b>Sample 022</b> · 1440x1440<br />A red-eyed tree frog perched on a bright red bromeliad flower in the Costa Rican cloud forest, its neon green body contrasting with blue-striped flanks and orange feet, water droplets on its smooth skin, extreme macro with ring flash lighting</sub>
|
| 214 |
+
</td>
|
| 215 |
+
<td width="33%" valign="top">
|
| 216 |
+
<img src="assets/gallery/023-1248x1664.png" alt="Lens gallery sample 023" width="100%" />
|
| 217 |
+
<br />
|
| 218 |
+
<sub><b>Sample 023</b> · 1248x1664<br />Inside a massive limestone cave, ancient stalactites and stalagmites meeting to form columns, an underground river reflecting the formations like a mirror, subtle warm lighting revealing millions of years of mineral deposits, spelunking exploration photography</sub>
|
| 219 |
+
</td>
|
| 220 |
+
</tr>
|
| 221 |
+
</table>
|
| 222 |
+
</details>
|
| 223 |
+
|
| 224 |
+
<details name="lens-gallery">
|
| 225 |
+
<summary><b>Page 5 / 6</b> samples 024-029</summary>
|
| 226 |
+
|
| 227 |
+
<table>
|
| 228 |
+
<tr>
|
| 229 |
+
<td width="33%" valign="top">
|
| 230 |
+
<img src="assets/gallery/024-1664x1248.png" alt="Lens gallery sample 024" width="100%" />
|
| 231 |
+
<br />
|
| 232 |
+
<sub><b>Sample 024</b> · 1664x1248<br />A weathered 1960s gas station with a large roadside sign reading "ROUTE 66 GAS & GO" in retro rounded sans-serif letters with a red and white color scheme, vintage gas pumps with analog dials in the foreground, a classic Chevrolet parked to the side, Americana nostalgia</sub>
|
| 233 |
+
</td>
|
| 234 |
+
<td width="33%" valign="top">
|
| 235 |
+
<img src="assets/gallery/025-1664x1248.png" alt="Lens gallery sample 025" width="100%" />
|
| 236 |
+
<br />
|
| 237 |
+
<sub><b>Sample 025</b> · 1664x1248<br />Construction site hoarding covered in unauthorized street art with "ART IS EVERYWHERE" spray-painted in large freehand capital letters using multiple overlapping colors of red, yellow and blue, paint drips running down from each letter, chaotic beautiful urban canvas</sub>
|
| 238 |
+
</td>
|
| 239 |
+
<td width="33%" valign="top">
|
| 240 |
+
<img src="assets/gallery/026-1664x1248.png" alt="Lens gallery sample 026" width="100%" />
|
| 241 |
+
<br />
|
| 242 |
+
<sub><b>Sample 026</b> · 1664x1248<br />Top-down view of a koi pond, dozens of ornamental koi fish in vivid red white orange and gold patterns swimming through crystal-clear emerald water, fallen cherry blossom petals floating on the surface, Japanese garden aerial photography</sub>
|
| 243 |
+
</td>
|
| 244 |
+
</tr>
|
| 245 |
+
<tr>
|
| 246 |
+
<td width="33%" valign="top">
|
| 247 |
+
<img src="assets/gallery/027-1664x1248.png" alt="Lens gallery sample 027" width="100%" />
|
| 248 |
+
<br />
|
| 249 |
+
<sub><b>Sample 027</b> · 1664x1248<br />The Potala Palace in Lhasa under a canopy of stars with the Milky Way arching overhead, Tibetan prayer wheels and butter lamps in the foreground casting warm golden light, the massive white and red palace walls glowing in moonlight, night photography</sub>
|
| 250 |
+
</td>
|
| 251 |
+
<td width="33%" valign="top">
|
| 252 |
+
<img src="assets/gallery/028-1248x1664.png" alt="Lens gallery sample 028" width="100%" />
|
| 253 |
+
<br />
|
| 254 |
+
<sub><b>Sample 028</b> · 1248x1664<br />Yellowstone's Grand Prismatic Spring shot from directly above by drone, concentric rings of vivid blue turquoise green yellow and orange created by thermophilic bacteria, steam rising from the surface, abstract natural color palette</sub>
|
| 255 |
+
</td>
|
| 256 |
+
<td width="33%" valign="top">
|
| 257 |
+
<img src="assets/gallery/029-1664x1248.png" alt="Lens gallery sample 029" width="100%" />
|
| 258 |
+
<br />
|
| 259 |
+
<sub><b>Sample 029</b> · 1664x1248<br />A herd of African elephants walking in a line across the savanna with Mount Kilimanjaro's snow-capped peak behind them, golden sunset dust kicked up by their feet creating a hazy atmosphere, telephoto wildlife photography showing massive scale</sub>
|
| 260 |
+
</td>
|
| 261 |
+
</tr>
|
| 262 |
+
</table>
|
| 263 |
+
</details>
|
| 264 |
+
|
| 265 |
+
<details name="lens-gallery">
|
| 266 |
+
<summary><b>Page 6 / 6</b> samples 030-031</summary>
|
| 267 |
+
|
| 268 |
+
<table>
|
| 269 |
+
<tr>
|
| 270 |
+
<td width="33%" valign="top">
|
| 271 |
+
<img src="assets/gallery/030-1664x1248.png" alt="Lens gallery sample 030" width="100%" />
|
| 272 |
+
<br />
|
| 273 |
+
<sub><b>Sample 030</b> · 1664x1248<br />The Hall of Mirrors at the Palace of Versailles, hundreds of candles reflected infinitely in the massive gilded mirrors, crystal chandeliers casting prismatic light across painted ceilings and gold leaf ornamentation, Baroque opulence</sub>
|
| 274 |
+
</td>
|
| 275 |
+
<td width="33%" valign="top">
|
| 276 |
+
<img src="assets/gallery/031-1664x1248.png" alt="Lens gallery sample 031" width="100%" />
|
| 277 |
+
<br />
|
| 278 |
+
<sub><b>Sample 031</b> · 1664x1248<br />A pirate captain's cabin, navigation charts pinned to the wall, a brass telescope and astrolabe on the desk, stacks of gold coins and a jewel-encrusted goblet, rum bottle, warm swinging lantern light casting shadows with the ship's motion</sub>
|
| 279 |
+
</td>
|
| 280 |
+
<td width="33%"></td>
|
| 281 |
+
</tr>
|
| 282 |
+
<tr>
|
| 283 |
+
<td width="33%"></td>
|
| 284 |
+
<td width="33%"></td>
|
| 285 |
+
<td width="33%"></td>
|
| 286 |
+
</tr>
|
| 287 |
+
</table>
|
| 288 |
+
</details>
|
| 289 |
+
<!-- LENS_GALLERY_END -->
|
| 290 |
+
|
| 291 |
+
## Installation
|
| 292 |
+
|
| 293 |
+
> **Tested environment:** Python 3.12 · CUDA 12.6 · PyTorch 2.11.0+cu126 · TorchVision 0.26.0+cu126
|
| 294 |
+
|
| 295 |
+
```bash
|
| 296 |
+
conda create -n lens python=3.12 -y
|
| 297 |
+
conda activate lens
|
| 298 |
+
|
| 299 |
+
uv pip install torch==2.11.0+cu126 torchvision==0.26.0+cu126 \
|
| 300 |
+
--index-url https://download.pytorch.org/whl/cu126
|
| 301 |
+
uv pip install -r requirements.txt
|
| 302 |
+
```
|
| 303 |
+
|
| 304 |
+
The default GPT-OSS encoder and FLUX.2 VAE are loaded from Hugging Face. Make sure your environment has access to any gated model repositories you use.
|
| 305 |
+
|
| 306 |
+
## Checkpoints
|
| 307 |
+
|
| 308 |
+
| Repo | Description | Steps | CFG |
|
| 309 |
+
| :--- | :--- | :---: | :---: |
|
| 310 |
+
| [`microsoft/Lens`](https://huggingface.co/microsoft/Lens) | **Default.** RL-tuned for visual quality | 20 | 5.0 |
|
| 311 |
+
| [`microsoft/Lens-Turbo`](https://huggingface.co/microsoft/Lens-Turbo) | Distilled from the RL model for fast 4-step sampling | 4 | 1.0 |
|
| 312 |
+
| [`microsoft/Lens-Base`](https://huggingface.co/microsoft/Lens-Base) | Supervised base model (no RL, no distillation) | 50 | 5.0 |
|
| 313 |
+
|
| 314 |
+
Pick a variant by passing its repo id to `--repo_id` (CLI) or `LensPipeline.from_pretrained(...)` (Python).
|
| 315 |
+
|
| 316 |
+
## Inference
|
| 317 |
+
|
| 318 |
+
> **Important:** run from the cloned repo root so `from lens import LensPipeline` resolves to this package — importing `lens` is what registers `LensGptOssEncoder` / `LensTransformer2DModel` with the `transformers` and `diffusers` namespaces that `model_index.json` references.
|
| 319 |
+
|
| 320 |
+
**Python API:**
|
| 321 |
+
|
| 322 |
+
```python
|
| 323 |
+
import torch
|
| 324 |
+
from lens import LensPipeline
|
| 325 |
+
|
| 326 |
+
pipe = LensPipeline.from_pretrained(
|
| 327 |
+
"microsoft/Lens-Base", torch_dtype=torch.bfloat16
|
| 328 |
+
).to("cuda")
|
| 329 |
+
|
| 330 |
+
image = pipe(
|
| 331 |
+
prompt="A cat holding a sign that says hello world",
|
| 332 |
+
base_resolution=1440, aspect_ratio="1:1",
|
| 333 |
+
num_inference_steps=20, guidance_scale=5.0,
|
| 334 |
+
generator=torch.Generator("cuda").manual_seed(0),
|
| 335 |
+
).images[0]
|
| 336 |
+
image.save("lens.png")
|
| 337 |
+
```
|
| 338 |
+
|
| 339 |
+
To trade speed for VRAM, replace `.to("cuda")` with `pipe.enable_model_cpu_offload()`.
|
| 340 |
+
|
| 341 |
+
**CLI — basic usage:**
|
| 342 |
+
|
| 343 |
+
```bash
|
| 344 |
+
python inference.py \
|
| 345 |
+
--repo_id "microsoft/Lens-Base" \
|
| 346 |
+
--prompt "A cinematic mountain lake at sunrise, soft mist, detailed reflections" \
|
| 347 |
+
--base_resolution 1440 --aspect_ratio 1:1 \
|
| 348 |
+
--steps 20 --cfg 5.0 --n 1 --seed 42 \
|
| 349 |
+
--out ./outputs
|
| 350 |
+
```
|
| 351 |
+
|
| 352 |
+
**Batch generation** — join multiple prompts with `|`:
|
| 353 |
+
|
| 354 |
+
```bash
|
| 355 |
+
python inference.py \
|
| 356 |
+
--repo_id "microsoft/Lens-Base" \
|
| 357 |
+
--prompt "a red fox in snow|a glass greenhouse at night"
|
| 358 |
+
```
|
| 359 |
+
|
| 360 |
+
**A100 / V100 (no MXFP4 kernels)** — dequantize the GPT-OSS encoder to bf16:
|
| 361 |
+
|
| 362 |
+
```bash
|
| 363 |
+
python inference.py --repo_id "microsoft/Lens-Base" --prompt "a cat" --disable_mxfp4 --offload
|
| 364 |
+
```
|
| 365 |
+
|
| 366 |
+
### Options
|
| 367 |
+
|
| 368 |
+
| Flag | Description | Default |
|
| 369 |
+
| :--- | :--- | :--- |
|
| 370 |
+
| `--repo_id` | HF repo id (or local path) of the assembled Lens pipeline | `microsoft/Lens` |
|
| 371 |
+
| `--base_resolution` | `1024` or `1440` | `1440` |
|
| 372 |
+
| `--aspect_ratio` | `1:2`, `9:16`, `2:3`, `3:4`, `1:1`, `4:3`, `3:2`, `16:9`, `2:1` | `1:1` |
|
| 373 |
+
| `--steps` | Number of denoising steps | `20` |
|
| 374 |
+
| `--cfg` | Classifier-free guidance scale | `5.0` |
|
| 375 |
+
| `--n` | Number of images per prompt | `1` |
|
| 376 |
+
| `--seed` | Random seed (omit for non-deterministic) | — |
|
| 377 |
+
| `--out` | Output directory | `./outputs` |
|
| 378 |
+
| `--dtype` | Compute dtype: `bfloat16`, `float16`, `float32` | `bfloat16` |
|
| 379 |
+
| `--disable_mxfp4` | Dequantize the GPT-OSS text encoder to `--dtype` (required on A100 / V100; Hopper+ keeps MXFP4 by default for less VRAM) | — |
|
| 380 |
+
| `--offload` | Enable diffusers CPU offload (`text_encoder->transformer->vae`) to reduce peak VRAM | — |
|
| 381 |
+
| `--reasoner` | Refine prompts with the loaded GPT-OSS encoder before generation | — |
|
| 382 |
+
| `--api_url` / `--api_key` / `--api_model` | Use an OpenAI-compatible API for prompt refinement (takes precedence over `--reasoner`) | — |
|
| 383 |
+
|
| 384 |
+
## Citation
|
| 385 |
+
|
| 386 |
+
```bibtex
|
| 387 |
+
@article{zhao2026lens,
|
| 388 |
+
title = {Lens: Rethinking Training Efficiency for Foundational Text-to-Image Models},
|
| 389 |
+
author = {Zhao, Jinjing and Zhang, Sirui and Liang, Zhiyang and Yue, Yang and Zhang, Jiawei and Yang, Qinhong and Dong, Yanchen and Wang, Yitong and Chen, Yunuo and Wu, Xiuyu and Wei, Fangyun and Chen, Dong and Chen, Dongdong and Wan, Ziyu and Shi, Lei and Li, Ji and Luo, Chong and Lu, Yan and Guo, Baining},
|
| 390 |
+
journal = {arXiv preprint arXiv:PLACEHOLDER},
|
| 391 |
+
year = {2026}
|
| 392 |
+
}
|
| 393 |
+
```
|
| 394 |
+
|
| 395 |
+
## Responsible AI
|
| 396 |
+
|
| 397 |
+
The release is intended for research purposes only and does not involve any product or service deployment. Responsible AI considerations were factored into all stages. The datasets used in this paper are public and have been reviewed to ensure there is no personally identifiable information or harmful content. However, as these datasets are sourced from the Internet, potential bias may still be present.
|
| 398 |
+
|
| 399 |
+
## Privacy
|
| 400 |
+
|
| 401 |
+
This project does not collect any usage data. For more information, see the [Microsoft Privacy Statement](https://go.microsoft.com/fwlink/?LinkId=521839).
|
| 402 |
+
|
| 403 |
+
## License
|
| 404 |
+
|
| 405 |
+
This project is released under the [MIT License](LICENSE).
|
assets/gallery/000-1440x1440.png
ADDED
|
Git LFS Details
|
assets/gallery/000-1440x1440.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A generous portion of classic British fish and chips served on a sheet of white paper, golden crispy beer-battered cod fillet alongside thick-cut chips, a wedge of lemon, mushy peas in a small dish, malt vinegar bottle nearby, wooden pub table, overhead shot
|
assets/gallery/001-1440x1440.png
ADDED
|
Git LFS Details
|
assets/gallery/001-1440x1440.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
The iconic Big Ben clock tower and the Houses of Parliament in London at golden hour, the River Thames reflecting warm amber light, Westminster Bridge in the foreground, a classic red double-decker bus crossing, dramatic clouds lit by sunset
|
assets/gallery/002-1440x1440.png
ADDED
|
Git LFS Details
|
assets/gallery/002-1440x1440.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
La Tour Eiffel au crépuscule vue depuis le Trocadéro, la structure en fer illuminée de milliers de lumières dorées scintillantes, le ciel passant du bleu profond au violet, les fontaines du Trocadéro au premier plan avec des reflets dorés, silhouettes de promeneurs
|
assets/gallery/003-1248x1664.png
ADDED
|
Git LFS Details
|
assets/gallery/003-1248x1664.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A crystal dragon soaring through an aurora borealis sky, its entire body made of transparent faceted crystal refracting the green and purple aurora light into rainbow spectra, ice particles trailing from its wings, high fantasy digital art
|
assets/gallery/004-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/004-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
Aerial view of Yuanyang rice terraces in Yunnan province at sunrise, thousands of cascading water-filled paddies reflecting golden and pink sky colors, morning mist weaving between terrace layers, lush green hillside with scattered palm trees, drone photography
|
assets/gallery/005-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/005-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A green iguana basking on a moss-covered fallen log in a tropical rainforest, every scale and spine rendered in sharp detail, dewdrops clinging to its skin, a blurred waterfall and lush tropical foliage in the background, National Geographic wildlife photography style
|
assets/gallery/006-1248x1664.png
ADDED
|
Git LFS Details
|
assets/gallery/006-1248x1664.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
Oil painting portrait of a Renaissance noblewoman in a deep blue velvet dress with pearl drop earrings, soft chiaroscuro lighting revealing delicate skin, craquelure texture on the painted surface, in the style of Vermeer
|
assets/gallery/007-1440x1440.png
ADDED
|
Git LFS Details
|
assets/gallery/007-1440x1440.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
An artisan honey jar with a hand-illustrated vintage botanical label reading "Mountain Wildflower Honey" in brown serif letterpress-style typography with decorative flourishes, detailed ink drawings of wildflowers, clover and honeybees surrounding the text, kraft paper label on clear glass jar
|
assets/gallery/008-1440x1440.png
ADDED
|
Git LFS Details
|
assets/gallery/008-1440x1440.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
Watercolor portrait of a thoughtful young man reading a worn leather book in a Parisian cafe, loose wet-on-wet brushstrokes bleeding into warm amber and burnt sienna washes, visible paper grain texture
|
assets/gallery/009-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/009-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
An explorer's oak desk with an aged world map spread open, a brass sextant, leather-bound navigation journal with handwritten entries, melting candle in a copper holder, scattered compass and quill pen, warm window light, still life photography
|
assets/gallery/010-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/010-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
New York Grand Central Terminal subway station with the classic station name "GRAND CENTRAL" spelled out in elegant white ceramic mosaic tile letters embedded in a dark green tile wall, each letter approximately eight inches tall, ornate tile border frames, the S-curve of train tracks visible
|
assets/gallery/011-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/011-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A ruby-throated hummingbird hovering in front of a bright red heliconia flower, wings frozen in a figure-eight pattern showing iridescent feather detail, individual water droplets suspended around the bird, high-speed macro photography with dark background
|
assets/gallery/012-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/012-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
An old Remington typewriter with a sheet of cream-colored paper rolled into the carriage, the typed words "Chapter One: The Beginning" visible in slightly uneven Courier typeface with characteristic ink density variations, some letters slightly misaligned, warm desk lamp lighting
|
assets/gallery/013-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/013-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
The Great Wildebeest Migration crossing the Mara River at golden hour, hundreds of animals plunging into churning water sending spray everywhere, dust clouds rising from the riverbank, dramatic backlit scene, National Geographic documentary style
|
assets/gallery/014-1248x1664.png
ADDED
|
Git LFS Details
|
assets/gallery/014-1248x1664.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A charming flower shop storefront window with hand-painted white script lettering on the glass reading "Fresh Flowers Daily" in flowing connected cursive with decorative swashes, roses and peonies arranged in buckets visible through the lettering, morning sunlight catching the painted letters
|
assets/gallery/015-1248x1664.png
ADDED
|
Git LFS Details
|
assets/gallery/015-1248x1664.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A steampunk floating sky-city built on massive gear-driven platforms, brass and copper towers connected by chain bridges, steam-powered airships and hot air balloons docking at various levels, sunset clouds below the city, detailed concept art
|
assets/gallery/016-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/016-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
Milford Sound in New Zealand at dawn, a perfect mirror reflection of steep fjord walls on glass-still water, waterfalls streaming down thousand-foot cliffs, morning mist hovering above the water surface, panoramic landscape photography
|
assets/gallery/017-1248x1664.png
ADDED
|
Git LFS Details
|
assets/gallery/017-1248x1664.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
An Indian Bharatanatyam classical dancer in the aramandi pose, bronze ankle bells and elaborate hand mudra gestures, rich silk costume with gold temple jewelry, captured mid-performance with dramatic stage lighting
|
assets/gallery/018-1248x1664.png
ADDED
|
Git LFS Details
|
assets/gallery/018-1248x1664.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A narrow alleyway in Marrakech's old medina with walls painted in vivid cobalt blue, colorful handwoven rugs and ceramic plates displayed along the walls, ornate wooden doors, warm sunlight from above creating dramatic shadows, Moroccan architecture
|
assets/gallery/019-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/019-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A rustic wooden sign at a fishing village dock reading "Fresh Catch of the Day" in hand-carved letters painted nautical blue, thick hemp rope threaded through the sign as a border, fishing nets and lobster traps stacked in the background, seaside atmosphere
|
assets/gallery/020-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/020-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A sunken shipwreck on the ocean floor completely overgrown with colorful coral formations, schools of tropical fish swimming through the broken hull and portholes, shafts of sunlight streaming down from the surface above, underwater archaeology photography
|
assets/gallery/021-1664x1248.png
ADDED
|
Git LFS Details
|
assets/gallery/021-1664x1248.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
Zhangjiajie pillar mountains rising above a sea of clouds at sunrise, golden light painting the sandstone peaks, the surreal Avatar-like floating mountain landscape stretching to the horizon, aerial drone photography capturing immense vertical scale
|
assets/gallery/022-1440x1440.png
ADDED
|
Git LFS Details
|
assets/gallery/022-1440x1440.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
A red-eyed tree frog perched on a bright red bromeliad flower in the Costa Rican cloud forest, its neon green body contrasting with blue-striped flanks and orange feet, water droplets on its smooth skin, extreme macro with ring flash lighting
|
assets/gallery/023-1248x1664.png
ADDED
|
Git LFS Details
|
assets/gallery/023-1248x1664.txt
ADDED
|
@@ -0,0 +1 @@
|
|
|
|
|
|
|
| 1 |
+
Inside a massive limestone cave, ancient stalactites and stalagmites meeting to form columns, an underground river reflecting the formations like a mirror, subtle warm lighting revealing millions of years of mineral deposits, spelunking exploration photography
|