robertoesa commited on
Commit
075fb2f
Β·
verified Β·
1 Parent(s): 6ab5ce1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +80 -35
README.md CHANGED
@@ -10,69 +10,114 @@ thumbnail: >-
10
  https://cdn-uploads.huggingface.co/production/uploads/68a43a86eea45496edb28ba6/hJyITwdmrofMtEtX8qFmW.png
11
  ---
12
 
13
- # 🏒 Maya4
 
 
14
 
15
- ![Maya4 Logo](Maya4.png)
16
 
 
 
 
17
 
18
- # 🌌 Maya4
 
 
19
 
20
- **Maya4** is a project dedicated to curating and providing **multi-level intermediate SAR representations** from **Sentinel-1** acquisitions, spanning the entire chain from **Level 0 to Level 1**.
21
 
22
- The name *Maya4* draws inspiration from the **Māyā veil** in philosophy, where reality is hidden behind successive layersβ€”just as radar echoes undergo transformations before forming a final SAR image.
23
 
24
- ---
 
 
25
 
26
- ## πŸš€ Mission
 
 
 
 
27
 
28
- - πŸ“‘ Curate and structure **Sentinel-1 Stripmap SAR data**
29
- - 🧩 Provide **intermediate signal representations** for research & ML pre-training
30
- - ☁️ Enable scalable, **cloud-native access** in **Zarr format**
31
 
32
  ---
33
 
34
- ## πŸ“¦ Pre-Training Dataset Spaces
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
 
36
- Maya4 provides **Pre-Training (PT) dataset** exposing intermediate SAR signal states in a modern (Zarr Format):
37
 
38
- | πŸ“‚ Dataset Split | πŸ“Š Contents | πŸ›°οΈ Acquisition Mode | πŸ’Ύ Size |
39
- |------------------|-------------|----------------------|---------|
40
- | **pt1** | [LINK](https://huggingface.co/datasets/Maya4/pt1) | Stripmap | 17 TB |
41
- | **pt2** | [LINK](https://huggingface.co/datasets/Maya4/pt2) | Stripmap | 17 TB |
42
- | **pt3** | [LINK](https://huggingface.co/datasets/Maya4/pt3) | Stripmap | 17 TB |
43
- | **pt4** | [LINK](https://huggingface.co/datasets/Maya4/pt4) | Stripmap | 17 TB |
44
- | **Total** | β€” | β€” | **68 TB** |
45
 
 
 
 
46
 
47
- ### πŸ“Š Processing Levels
48
 
49
- A major characteristic of this dataset lies in its sharding multi-level organisation:
50
 
51
- <img src="https://i.ibb.co/Wv7SXd4N/intermediates.jpg" alt="Intermediates" width="100%">
52
 
 
 
 
53
 
54
- | 🌐 Level | πŸ”€ Abbrev. | πŸ“ Description | 🎯 Purpose / Value |
55
- |----------|-----------|----------------|--------------------|
56
- | πŸ“‘ **Raw** | raw | Unprocessed radar echoes as recorded by Sentinel-1 | Baseline data; allows full custom SAR processing |
57
- | 🎚️ **Range Compressed** | rc | Echoes compressed in the range dimension via matched filtering | Improves SNR; isolates scatterers along range |
58
- | 🎯 **Range Cell Migration Corrected** | rcmc | Motion-compensated echoes with corrected range migration | Preserves geometric fidelity; enables proper azimuth focusing |
59
- | πŸ–ΌοΈ **Azimuth Compressed** | ac | Fully focused SAR image in slant-range geometry | Standard **Level-1 product**; interpretable imagery |
60
 
 
61
 
 
62
 
63
- ---
64
 
 
 
 
 
 
 
65
 
66
- ### 🌍 PT Use Cases
67
 
68
- The following use cases can be addressed with Maya4:
69
 
70
- ![🌌 Maya4 Dataset Cards](Maya4_cards.jpg)
71
 
 
 
 
 
 
 
 
 
72
 
73
  ---
74
 
75
- ## πŸ™ Acknowledgements
 
 
76
 
77
- - πŸ›°οΈ Data provided by the **Copernicus Sentinel-1 mission (ESA)**
78
- - 🏒 Curated and maintained by the **Maya4 organization**
 
10
  https://cdn-uploads.huggingface.co/production/uploads/68a43a86eea45496edb28ba6/hJyITwdmrofMtEtX8qFmW.png
11
  ---
12
 
13
+ <p align="center">
14
+ <img src="Maya4.png" alt="Maya4 Logo" width="780">
15
+ </p>
16
 
17
+ <h1 align="center">Maya4</h1>
18
 
19
+ <p align="center">
20
+ Multi-level intermediate SAR representations from Sentinel-1 Stripmap acquisitions
21
+ </p>
22
 
23
+ <p align="center">
24
+ <strong>Level-0 to Level-1</strong> Β· <strong>Zarr-native</strong> Β· <strong>Cloud-accessible</strong> Β· <strong>2 TB</strong>
25
+ </p>
26
 
27
+ ---
28
 
29
+ ## Overview
30
 
31
+ **Maya4** is a curated SAR data resource designed to expose the full progression of **Sentinel-1 Stripmap acquisitions** from **raw radar echoes** to **fully focused Level-1 imagery**.
32
+
33
+ Unlike conventional datasets that provide only final products, Maya4 preserves and organizes the **intermediate signal representations** generated across the SAR processing chain. This makes the dataset particularly suitable for:
34
 
35
+ - SAR signal processing research
36
+ - physics-aware machine learning
37
+ - self-supervised pre-training
38
+ - representation learning across processing levels
39
+ - algorithm benchmarking and reconstruction studies
40
 
41
+ The name *Maya4* draws from the concept of **Māyā**: the idea that reality is revealed through successive layers. In the same way, SAR imagery emerges through a sequence of transformations from raw measurements to interpretable image products.
 
 
42
 
43
  ---
44
 
45
+ ## Why Maya4
46
+
47
+ <table>
48
+ <tr>
49
+ <td valign="top" width="33%">
50
+ <h3>Multi-level access</h3>
51
+ <p>Provides consistent access to multiple SAR processing stages rather than only the final image product.</p>
52
+ </td>
53
+ <td valign="top" width="33%">
54
+ <h3>Research-oriented structure</h3>
55
+ <p>Designed for analysis of information flow, model pre-training, and development of custom SAR pipelines.</p>
56
+ </td>
57
+ <td valign="top" width="33%">
58
+ <h3>Cloud-native delivery</h3>
59
+ <p>Distributed in <strong>Zarr</strong> format for scalable storage, streaming, and computation.</p>
60
+ </td>
61
+ </tr>
62
+ </table>
63
 
64
+ ---
65
 
66
+ ## Dataset Access
 
 
 
 
 
 
67
 
68
+ | Dataset | Access | Mission / Mode | Format | Size |
69
+ |---------|--------|----------------|--------|------|
70
+ | **Maya4** | [Open bucket](https://huggingface.co/buckets/ESA-philab/Maya4) | Sentinel-1 Stripmap | Zarr | 2 TB |
71
 
72
+ ---
73
 
74
+ ## Processing Chain
75
 
76
+ A defining feature of Maya4 is its **sharded multi-level organization**, which preserves the major intermediate states of the SAR focusing pipeline.
77
 
78
+ <p align="center">
79
+ <img src="https://i.ibb.co/Wv7SXd4N/intermediates.jpg" alt="Maya4 intermediate SAR representations" width="100%">
80
+ </p>
81
 
82
+ | Processing Level | Abbrev. | Description | Technical Value |
83
+ |------------------|---------|-------------|-----------------|
84
+ | **Raw** | `raw` | Unprocessed radar echoes as acquired by Sentinel-1 | Enables custom end-to-end SAR processing and low-level signal analysis |
85
+ | **Range Compressed** | `rc` | Echoes compressed in the range dimension using matched filtering | Improves signal-to-noise ratio and resolves scatterers along range |
86
+ | **Range Cell Migration Corrected** | `rcmc` | Echoes after compensation of range migration effects | Preserves geometric consistency and prepares the signal for azimuth focusing |
87
+ | **Azimuth Compressed** | `ac` | Fully focused SAR image in slant-range geometry | Corresponds to the interpretable focused SAR image product |
88
 
89
+ ---
90
 
91
+ ## Technical Positioning
92
 
93
+ Maya4 is intended to support work at the intersection of:
94
 
95
+ - SAR signal processing
96
+ - remote sensing foundation models
97
+ - self-supervised and masked modeling approaches
98
+ - physics-guided representation learning
99
+ - inverse problems and reconstruction
100
+ - benchmarking of processing-aware architectures
101
 
102
+ Because the dataset exposes multiple internal stages of SAR formation, it enables experiments that are not possible with image-only repositories.
103
 
104
+ ---
105
 
106
+ ## Key Characteristics
107
 
108
+ | Attribute | Value |
109
+ |-----------|-------|
110
+ | **Mission** | Copernicus Sentinel-1 |
111
+ | **Acquisition Mode** | Stripmap |
112
+ | **Processing Coverage** | Level-0 to Level-1 intermediates |
113
+ | **Primary Distribution Format** | Zarr |
114
+ | **Access Paradigm** | Cloud-native bucket access |
115
+ | **Primary Target Users** | SAR researchers, ML practitioners, remote sensing scientists |
116
 
117
  ---
118
 
119
+ ## Acknowledgements
120
+
121
+ Maya4 is based on data from the **Copernicus Sentinel-1 mission** of the **European Space Agency (ESA)**.
122
 
123
+ Dataset curation and organization are maintained by the **Maya4 organization**.