AbstractPhil commited on
Commit
fd95e88
·
verified ·
1 Parent(s): b8e7137

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -26,7 +26,8 @@ base_model:
26
 
27
 
28
 
29
- # Newest: Prepping 12m conceptual-captions bert extractions aka 36m full extractions
 
30
 
31
  The dataset is going to be in pt chunks because they load directly to vram nearly instantly in colab, and the system operates on them quicker than dataloaders.
32
 
 
26
 
27
 
28
 
29
+ # Newest: Prepping 12m conceptual-captions bert extractions aka 36m extractions * 5 models
30
+ So around, 180,000,000 total samples.
31
 
32
  The dataset is going to be in pt chunks because they load directly to vram nearly instantly in colab, and the system operates on them quicker than dataloaders.
33