cmolinac commited on
Commit
80a7545
Β·
verified Β·
1 Parent(s): bc8eed2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +218 -8
README.md CHANGED
@@ -1,10 +1,220 @@
1
- ---
2
- title: README
3
- emoji: 🌍
4
- colorFrom: purple
5
- colorTo: pink
6
- sdk: static
7
- pinned: false
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  ---
9
 
10
- Edit this `README.md` markdown file to author your organization card.
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license:
5
+ - cc-by-4.0
6
+ tags:
7
+ - semantic-segmentation
8
+ - scene-understanding
9
+ - 3d-point-clouds
10
+ - multimodal
11
+ - drone-imagery
12
+ pretty_name: NEST3D - Sociable Weaver Nest 3D Dataset
13
+ title: 'NEST3D: A High-Resolution Multimodal Dataset of Sociable Weaver Tree Nests '
14
+ ---
15
+
16
+ # NEST3D: A High-Resolution Multimodal Dataset of Sociable Weaver Tree Nests
17
+
18
+ ![NEST3D Workflow](./workflow.png)
19
+
20
+ ## Dataset Description
21
+
22
+ NEST3D is a multimodal dataset of 104 sociable weaver nests, combining drone-based RGB and multispectral imagery with a semantically annotated 3D RGB point cloud. It captures trees hosting these nests through drone-based remote sensing, providing rich spatial and spectral information to benchmark and advance scene-level semantic segmentation methods for computer vision and ecological monitoring applications.
23
+
24
+ ### Key Characteristics
25
+
26
+ - **Modality**: Multimodal (RGB imagery, multispectral bands, 3D point clouds)
27
+ - **Task**: Scene-level semantic segmentation
28
+ - **Scale**: Multiple tree-nest scenes with consistent spatial and spectral coverage
29
+ - **Annotation**: Point-level semantic labels for 3D point clouds
30
+ - **Data Source**: Drone-based RGB and multispectral imagery
31
+ - **Application Domain**: Ecological monitoring, wildlife management, 3d semantic segmenation, 3d reconstruction.
32
+
33
+ ## Dataset Organization
34
+
35
+ The dataset is organized into modality-specific directories to support flexible access and reuse:
36
+
37
+ ### Directory Structure
38
+
39
+ ```
40
+ NEST3D/
41
+ β”œβ”€β”€ train/
42
+ β”‚ β”œβ”€β”€ sample_001/
43
+ β”‚ β”‚ β”œβ”€β”€ RGB/ # RGB drone images
44
+ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_RGB_001.JPG
45
+ β”‚ β”‚ β”‚ └── ...
46
+ β”‚ β”‚ β”œβ”€β”€ MS/ # Multispectral imagery
47
+ β”‚ β”‚ β”‚ β”œβ”€β”€ Green/
48
+ β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_G_001.TIF
49
+ β”‚ β”‚ β”‚ β”‚ └── ...
50
+ β”‚ β”‚ β”‚ β”œβ”€β”€ Red/
51
+ β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_R_001.TIF
52
+ β”‚ β”‚ β”‚ β”‚ └── ...
53
+ β”‚ β”‚ β”‚ β”œβ”€β”€ Red_Edge/
54
+ β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_RE_001.TIF
55
+ β”‚ β”‚ β”‚ β”‚ └── ...
56
+ β”‚ β”‚ β”‚ └── NIR/
57
+ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_NIR_001.TIF
58
+ β”‚ β”‚ β”‚ └── ...
59
+ β”‚ β”‚ └── sample001.npy # 3D point cloud with labels
60
+ β”‚ └── sample_002/
61
+ β”‚ └── ...
62
+ β”‚
63
+ └── test/
64
+ β”œβ”€β”€ sample_084/
65
+ β”‚ β”œβ”€β”€ RGB/
66
+ β”‚ β”œβ”€β”€ MS/
67
+ β”‚ └── sample084.npy
68
+ └── ...
69
+ ```
70
+
71
+ ### Data Modalities
72
+
73
+ #### 1. **RGB Imagery**
74
+ - Raw drone images from aerial acquisition
75
+ - Format: JPEG
76
+ - Organized by data split and scene identifier
77
+ - Example path: `train/sample_001/RGB/sample001_RGB_119.JPG`
78
+
79
+ #### 2. **Multispectral Imagery**
80
+ - Four spectral bands from the same acquisitions as RGB
81
+ - Organized into four band-specific folders:
82
+ - **Green (G)**: Green channel imagery
83
+ - **Red (R)**: Red channel imagery
84
+ - **Red Edge (RE)**: Red Edge channel for vegetation analysis
85
+ - **NIR**: Near-Infrared channel for vegetation health assessment
86
+ - Format: GeoTIFF (.TIF)
87
+ - Example paths:
88
+ - `train/sample_001/MS/Green/sample001_G_119.TIF`
89
+ - `train/sample_001/MS/Red/sample001_R_119.TIF`
90
+ - `train/sample_001/MS/Red_Edge/sample001_RE_119.TIF`
91
+ - `train/sample_001/MS/NIR/sample001_NIR_119.TIF`
92
+
93
+ #### 3. **3D Point Clouds**
94
+ - One NumPy file per scene containing the complete 3D reconstruction
95
+ - Format: `.npy` (NumPy binary format)
96
+ - Per-point attributes: `[x, y, z, r, g, b, label]`
97
+ - **x, y, z**: 3D spatial coordinates (meters)
98
+ - **r, g, b**: RGB color values (0-255)
99
+ - **label**: Semantic class label (integer)
100
+ - Example path: `train/sample_001/sample001.npy`
101
+
102
+ ## Data Splits
103
+
104
+ The dataset is divided into fixed training and test sets:
105
+
106
+ - **Training Set**: Used for model training and development
107
+ - **Test Set**: Reserved for model evaluation and benchmarking
108
+
109
+ Each split contains a consistent collection of scenes to ensure reliable evaluation.
110
+
111
+ ## Usage
112
+
113
+ ### Loading 3D Point Clouds
114
+
115
+ ```python
116
+ import numpy as np
117
+
118
+ # Load point cloud with semantic labels
119
+ point_cloud = np.load('train/sample_001/sample001.npy')
120
+
121
+ # Extract coordinates
122
+ xyz = point_cloud[:, :3]
123
+
124
+ # Extract colors
125
+ rgb = point_cloud[:, 3:6]
126
+
127
+ # Extract semantic labels
128
+ labels = point_cloud[:, 6]
129
+ ```
130
+
131
+ ### Loading Multispectral Imagery
132
+
133
+ ```python
134
+ from PIL import Image
135
+ import numpy as np
136
+
137
+ # Load a single band
138
+ green_band = np.array(Image.open('train/sample_001/MS/Green/sample001_G_001.TIF'))
139
+
140
+ # Load all four bands for a given image
141
+ green = np.array(Image.open('train/sample_001/MS/Green/sample001_G_001.TIF'))
142
+ red = np.array(Image.open('train/sample_001/MS/Red/sample001_R_001.TIF'))
143
+ red_edge = np.array(Image.open('train/sample_001/MS/Red_Edge/sample001_RE_001.TIF'))
144
+ nir = np.array(Image.open('train/sample_001/MS/NIR/sample001_NIR_001.TIF'))
145
+
146
+ # Stack into multiband image
147
+ multispectral = np.stack([green, red, red_edge, nir], axis=-1)
148
+ ```
149
+
150
+ ### Using with Hugging Face Datasets Library
151
+
152
+ ```python
153
+ from datasets import load_dataset
154
+
155
+ # Load the dataset
156
+ dataset = load_dataset('NEST3D/dataset')
157
+ ```
158
+
159
+ ## Downloading the Dataset
160
+
161
+ ### Option 1: Using Hugging Face Hub
162
+ ```bash
163
+ pip install huggingface_hub
164
+
165
+ huggingface-cli download NEST3D/dataset --repo-type dataset --local-dir ./NEST3D
166
+ ```
167
+
168
+ ### Option 2: Direct Download
169
+ Visit [https://huggingface.co/datasets/NEST3D/dataset](https://huggingface.co/datasets/NEST3D/dataset) and download files directly.
170
+
171
+ ## Citation
172
+
173
+ If you use this dataset in your research, please cite:
174
+
175
+ ```bibtex
176
+ @dataset{NEST3D,
177
+ title={NEST3D: A High-Resolution Multimodal Dataset of Sociable Weaver Tree Nests},
178
+ organization={NEST3D},
179
+ url={https://huggingface.co/datasets/NEST3D},
180
+ year={2026}
181
+ }
182
+ ```
183
+
184
+ ## License
185
+
186
+ This dataset is released under the Creative Commons Attribution 4.0 International (CC-BY-4.0) License. You are free to use, distribute, and adapt the dataset as long as you provide appropriate attribution.
187
+
188
+ See [LICENSE](LICENSE) for details.
189
+
190
+ ## Dataset Information
191
+
192
+ - **Total Size**:
193
+ - **Number of Scenes**: 104 samples
194
+ - **Modalities**: RGB, Multispectral (4 bands), 3D Point Clouds
195
+ - **Image Format**: JPEG (RGB), GeoTIFF (Multispectral)
196
+ - **Point Cloud Format**: NumPy arrays
197
+ - **Annotation Type**: Per-point semantic labels
198
+
199
+ ## Acknowledgments
200
+
201
+ This work was funded by:
202
+
203
+ - **European Union's Horizon Europe** research and innovation programme through the Marie SkΕ‚odowska-Curie project **"WildDrone – Autonomous Drones for Nature Conservation"** (grant agreement no. 101071224)
204
+ - **EPSRC-funded** "Autonomous Drones for Nature Conservation Missions" grant (EP/X029077/1)
205
+ - **Swiss State Secretariat for Education, Research and Innovation (SERI)** under contract number 22.00280
206
+
207
+ We extend our gratitude to our collaborators and field partners in Namibia for their invaluable support during data collection.
208
+
209
+ ## Contact & Support
210
+
211
+ For questions, issues, or contributions, please visit the [dataset discussion forum](https://huggingface.co/datasets/NEST3D/dataset/discussions).
212
+
213
+ ## Disclaimer
214
+
215
+ The NEST3D dataset is provided as-is for research and development purposes. Users are responsible for ensuring their use complies with all applicable laws and regulations, particularly regarding ecological monitoring and wildlife protection.
216
+
217
  ---
218
 
219
+ **Last Updated**: February 2026
220
+ **Dataset Version**: 1.0