Update README.md
Browse files
README.md
CHANGED
|
@@ -147,61 +147,43 @@ This AI model can be embedded as an Application Programming Interface (API) call
|
|
| 147 |
|
| 148 |
## Training Dataset
|
| 149 |
|
| 150 |
-
**
|
| 151 |
-
|
| 152 |
-
|
| 153 |
-
** Data
|
| 154 |
-
|
| 155 |
-
**
|
| 156 |
-
* 1 Million to 1 Billion Images <br>
|
| 157 |
-
** Data Collection Method by dataset <br>
|
| 158 |
-
* Automated <br>
|
| 159 |
-
** Labeling Method by dataset <br>
|
| 160 |
-
* Not Applicable (no labels are needed) <br>
|
| 161 |
**Properties:** 700 Million Images <br>
|
| 162 |
|
| 163 |
## Evaluation Datasets
|
| 164 |
|
| 165 |
-
ImageNet
|
| 166 |
-
|
| 167 |
-
|
| 168 |
-
**
|
| 169 |
-
|
| 170 |
-
**
|
| 171 |
-
|
| 172 |
-
**
|
| 173 |
-
* 1,281,167 <br>
|
| 174 |
-
** Validation Images <br>
|
| 175 |
-
* 50,000 <br>
|
| 176 |
-
** Test Images <br>
|
| 177 |
-
* 100,000
|
| 178 |
|
| 179 |
To perform the semantic segmentation evaluation, we use training sets from ADE20K and PascalVOC to train a linear layer, and subsequently performed evaluations on the validation set.
|
| 180 |
See below for further details:
|
| 181 |
|
| 182 |
-
ADE20k
|
| 183 |
-
|
| 184 |
-
|
| 185 |
-
**
|
| 186 |
-
|
| 187 |
-
**
|
| 188 |
-
|
| 189 |
-
|
| 190 |
-
|
| 191 |
-
|
| 192 |
-
|
| 193 |
-
|
| 194 |
-
|
| 195 |
-
**
|
| 196 |
-
|
| 197 |
-
** Data Collection <br>
|
| 198 |
-
* Human <br>
|
| 199 |
-
** Labeling Method <br>
|
| 200 |
-
* Human <br>
|
| 201 |
-
** Training Images <br>
|
| 202 |
-
* 1,464 <br>
|
| 203 |
-
** Validation Images <br>
|
| 204 |
-
* 1,449 <br>
|
| 205 |
|
| 206 |
| Benchmark | C-RADIOv3-B | C-RADIOv3-L | C-RADIOv4-SO400M | C-RADIOv3-H | C-RADIOv4-H |
|
| 207 |
|-----------|-------------|-------------|------------------|-------------|-------------|
|
|
@@ -273,4 +255,4 @@ Field | Response
|
|
| 273 |
Model Application Field(s): | Generation of visual embeddings
|
| 274 |
Describe the life critical impact (if present). | Not Applicable
|
| 275 |
Use Case Restrictions: | Abide by NVIDIA Open Model License Agreement
|
| 276 |
-
Model and dataset restrictions: | The Principle of least privilege (PoLP) is applied limiting access for dataset generation and model development. Restrictions enforce dataset access during training, and dataset license constraints adhered to.
|
|
|
|
| 147 |
|
| 148 |
## Training Dataset
|
| 149 |
|
| 150 |
+
**NV-CC-Img-Text-Dataset**
|
| 151 |
+
|
| 152 |
+
**Data Modality:** Image <br>
|
| 153 |
+
**Image Training Data Size:** 1 Million to 1 Billion Images <br>
|
| 154 |
+
**Data Collection Method by dataset:** Automated <br>
|
| 155 |
+
**Labeling Method by dataset:** Not Applicable (no labels are needed) <br>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 156 |
**Properties:** 700 Million Images <br>
|
| 157 |
|
| 158 |
## Evaluation Datasets
|
| 159 |
|
| 160 |
+
**ImageNet**
|
| 161 |
+
|
| 162 |
+
**Link:** [ImageNet](https://www.image-net.org/) <br>
|
| 163 |
+
**Data Collection:** Automated <br>
|
| 164 |
+
**Labeling Method:** Human <br>
|
| 165 |
+
**Training Images:** 1,281,167 <br>
|
| 166 |
+
**Validation Images:** 50,000 <br>
|
| 167 |
+
**Test Images:** 100,000 <br>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 168 |
|
| 169 |
To perform the semantic segmentation evaluation, we use training sets from ADE20K and PascalVOC to train a linear layer, and subsequently performed evaluations on the validation set.
|
| 170 |
See below for further details:
|
| 171 |
|
| 172 |
+
**ADE20k**
|
| 173 |
+
|
| 174 |
+
**Link:** [ADE20K](https://ade20k.csail.mit.edu/) <br>
|
| 175 |
+
**Data Collection:** Human <br>
|
| 176 |
+
**Labeling Method:** Human <br>
|
| 177 |
+
**Training Images:** 25,574 <br>
|
| 178 |
+
**Validation Images:** 2,000 <br>
|
| 179 |
+
|
| 180 |
+
**Pascal VOC**
|
| 181 |
+
|
| 182 |
+
**Link:** [Pascal VOC](http://host.robots.ox.ac.uk/pascal/VOC/) <br>
|
| 183 |
+
**Data Collection:** Human <br>
|
| 184 |
+
**Labeling Method:** Human <br>
|
| 185 |
+
**Training Images:** 1,464 <br>
|
| 186 |
+
**Validation Images:** 1,449 <br>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 187 |
|
| 188 |
| Benchmark | C-RADIOv3-B | C-RADIOv3-L | C-RADIOv4-SO400M | C-RADIOv3-H | C-RADIOv4-H |
|
| 189 |
|-----------|-------------|-------------|------------------|-------------|-------------|
|
|
|
|
| 255 |
Model Application Field(s): | Generation of visual embeddings
|
| 256 |
Describe the life critical impact (if present). | Not Applicable
|
| 257 |
Use Case Restrictions: | Abide by NVIDIA Open Model License Agreement
|
| 258 |
+
Model and dataset restrictions: | The Principle of least privilege (PoLP) is applied limiting access for dataset generation and model development. Restrictions enforce dataset access during training, and dataset license constraints adhered to.
|