cmolinac commited on
Commit
09f2bb8
Β·
verified Β·
1 Parent(s): da924e8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -206
README.md CHANGED
@@ -1,206 +0,0 @@
1
-
2
- # NEST3D: A High-Resolution Multimodal Dataset of Sociable Weaver Tree Nests
3
-
4
- ![NEST3D Workflow](./workflow.png)
5
-
6
- ## Dataset Description
7
-
8
- NEST3D is a multimodal dataset of 104 sociable weaver nests, combining drone-based RGB and multispectral imagery with a semantically annotated 3D RGB point cloud. It captures trees hosting these nests through drone-based remote sensing, providing rich spatial and spectral information to benchmark and advance scene-level semantic segmentation methods for computer vision and ecological monitoring applications.
9
-
10
- ### Key Characteristics
11
-
12
- - **Modality**: Multimodal (RGB imagery, multispectral bands, 3D point clouds)
13
- - **Task**: Scene-level semantic segmentation
14
- - **Scale**: Multiple tree-nest scenes with consistent spatial and spectral coverage
15
- - **Annotation**: Point-level semantic labels for 3D point clouds
16
- - **Data Source**: Drone-based RGB and multispectral imagery
17
- - **Application Domain**: Ecological monitoring, wildlife management, 3d semantic segmenation, 3d reconstruction.
18
-
19
- ## Dataset Organization
20
-
21
- The dataset is organized into modality-specific directories to support flexible access and reuse:
22
-
23
- ### Directory Structure
24
-
25
- ```
26
- NEST3D/
27
- β”œβ”€β”€ train/
28
- β”‚ β”œβ”€β”€ sample_001/
29
- β”‚ β”‚ β”œβ”€β”€ RGB/ # RGB drone images
30
- β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_RGB_001.JPG
31
- β”‚ β”‚ β”‚ └── ...
32
- β”‚ β”‚ β”œβ”€β”€ MS/ # Multispectral imagery
33
- β”‚ β”‚ β”‚ β”œβ”€β”€ Green/
34
- β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_G_001.TIF
35
- β”‚ β”‚ β”‚ β”‚ └── ...
36
- β”‚ β”‚ β”‚ β”œβ”€β”€ Red/
37
- β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_R_001.TIF
38
- β”‚ β”‚ β”‚ β”‚ └── ...
39
- β”‚ β”‚ β”‚ β”œβ”€β”€ Red_Edge/
40
- β”‚ β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_RE_001.TIF
41
- β”‚ β”‚ β”‚ β”‚ └── ...
42
- β”‚ β”‚ β”‚ └── NIR/
43
- β”‚ β”‚ β”‚ β”œβ”€β”€ sample001_NIR_001.TIF
44
- β”‚ β”‚ β”‚ └── ...
45
- β”‚ β”‚ └── sample001.npy # 3D point cloud with labels
46
- β”‚ └── sample_002/
47
- β”‚ └── ...
48
- β”‚
49
- └── test/
50
- β”œβ”€β”€ sample_084/
51
- β”‚ β”œβ”€β”€ RGB/
52
- β”‚ β”œβ”€β”€ MS/
53
- β”‚ └── sample084.npy
54
- └── ...
55
- ```
56
-
57
- ### Data Modalities
58
-
59
- #### 1. **RGB Imagery**
60
- - Raw drone images from aerial acquisition
61
- - Format: JPEG
62
- - Organized by data split and scene identifier
63
- - Example path: `train/sample_001/RGB/sample001_RGB_119.JPG`
64
-
65
- #### 2. **Multispectral Imagery**
66
- - Four spectral bands from the same acquisitions as RGB
67
- - Organized into four band-specific folders:
68
- - **Green (G)**: Green channel imagery
69
- - **Red (R)**: Red channel imagery
70
- - **Red Edge (RE)**: Red Edge channel for vegetation analysis
71
- - **NIR**: Near-Infrared channel for vegetation health assessment
72
- - Format: GeoTIFF (.TIF)
73
- - Example paths:
74
- - `train/sample_001/MS/Green/sample001_G_119.TIF`
75
- - `train/sample_001/MS/Red/sample001_R_119.TIF`
76
- - `train/sample_001/MS/Red_Edge/sample001_RE_119.TIF`
77
- - `train/sample_001/MS/NIR/sample001_NIR_119.TIF`
78
-
79
- #### 3. **3D Point Clouds**
80
- - One NumPy file per scene containing the complete 3D reconstruction
81
- - Format: `.npy` (NumPy binary format)
82
- - Per-point attributes: `[x, y, z, r, g, b, label]`
83
- - **x, y, z**: 3D spatial coordinates (meters)
84
- - **r, g, b**: RGB color values (0-255)
85
- - **label**: Semantic class label (integer)
86
- - Example path: `train/sample_001/sample001.npy`
87
-
88
- ## Data Splits
89
-
90
- The dataset is divided into fixed training and test sets:
91
-
92
- - **Training Set**: Used for model training and development
93
- - **Test Set**: Reserved for model evaluation and benchmarking
94
-
95
- Each split contains a consistent collection of scenes to ensure reliable evaluation.
96
-
97
- ## Usage
98
-
99
- ### Loading 3D Point Clouds
100
-
101
- ```python
102
- import numpy as np
103
-
104
- # Load point cloud with semantic labels
105
- point_cloud = np.load('train/sample_001/sample001.npy')
106
-
107
- # Extract coordinates
108
- xyz = point_cloud[:, :3]
109
-
110
- # Extract colors
111
- rgb = point_cloud[:, 3:6]
112
-
113
- # Extract semantic labels
114
- labels = point_cloud[:, 6]
115
- ```
116
-
117
- ### Loading Multispectral Imagery
118
-
119
- ```python
120
- from PIL import Image
121
- import numpy as np
122
-
123
- # Load a single band
124
- green_band = np.array(Image.open('train/sample_001/MS/Green/sample001_G_001.TIF'))
125
-
126
- # Load all four bands for a given image
127
- green = np.array(Image.open('train/sample_001/MS/Green/sample001_G_001.TIF'))
128
- red = np.array(Image.open('train/sample_001/MS/Red/sample001_R_001.TIF'))
129
- red_edge = np.array(Image.open('train/sample_001/MS/Red_Edge/sample001_RE_001.TIF'))
130
- nir = np.array(Image.open('train/sample_001/MS/NIR/sample001_NIR_001.TIF'))
131
-
132
- # Stack into multiband image
133
- multispectral = np.stack([green, red, red_edge, nir], axis=-1)
134
- ```
135
-
136
- ### Using with Hugging Face Datasets Library
137
-
138
- ```python
139
- from datasets import load_dataset
140
-
141
- # Load the dataset
142
- dataset = load_dataset('NEST3D/dataset')
143
- ```
144
-
145
- ## Downloading the Dataset
146
-
147
- ### Option 1: Using Hugging Face Hub
148
- ```bash
149
- pip install huggingface_hub
150
-
151
- huggingface-cli download NEST3D/dataset --repo-type dataset --local-dir ./NEST3D
152
- ```
153
-
154
- ### Option 2: Direct Download
155
- Visit [https://huggingface.co/datasets/NEST3D/dataset](https://huggingface.co/datasets/NEST3D/dataset) and download files directly.
156
-
157
- ## Citation
158
-
159
- If you use this dataset in your research, please cite:
160
-
161
- ```bibtex
162
- @dataset{NEST3D,
163
- title={NEST3D: A High-Resolution Multimodal Dataset of Sociable Weaver Tree Nests},
164
- organization={NEST3D},
165
- url={https://huggingface.co/datasets/NEST3D},
166
- year={2026}
167
- }
168
- ```
169
-
170
- ## License
171
-
172
- This dataset is released under the Creative Commons Attribution 4.0 International (CC-BY-4.0) License. You are free to use, distribute, and adapt the dataset as long as you provide appropriate attribution.
173
-
174
- See [LICENSE](LICENSE) for details.
175
-
176
- ## Dataset Information
177
-
178
- - **Total Size**:
179
- - **Number of Scenes**: 104 samples
180
- - **Modalities**: RGB, Multispectral (4 bands), 3D Point Clouds
181
- - **Image Format**: JPEG (RGB), GeoTIFF (Multispectral)
182
- - **Point Cloud Format**: NumPy arrays
183
- - **Annotation Type**: Per-point semantic labels
184
-
185
- ## Acknowledgments
186
-
187
- This work was funded by:
188
-
189
- - **European Union's Horizon Europe** research and innovation programme through the Marie SkΕ‚odowska-Curie project **"WildDrone – Autonomous Drones for Nature Conservation"** (grant agreement no. 101071224)
190
- - **EPSRC-funded** "Autonomous Drones for Nature Conservation Missions" grant (EP/X029077/1)
191
- - **Swiss State Secretariat for Education, Research and Innovation (SERI)** under contract number 22.00280
192
-
193
- We extend our gratitude to our collaborators and field partners in Namibia for their invaluable support during data collection.
194
-
195
- ## Contact & Support
196
-
197
- For questions, issues, or contributions, please visit the [dataset discussion forum](https://huggingface.co/datasets/NEST3D/dataset/discussions).
198
-
199
- ## Disclaimer
200
-
201
- The NEST3D dataset is provided as-is for research and development purposes. Users are responsible for ensuring their use complies with all applicable laws and regulations, particularly regarding ecological monitoring and wildlife protection.
202
-
203
- ---
204
-
205
- **Last Updated**: February 2026
206
- **Dataset Version**: 1.0