Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,68 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
This is the 26 categorical finetune of nomic-bert-2048.
|
| 6 |
+
|
| 7 |
+
130,000,000 - 4-30 masked token samples with 80% mask rate
|
| 8 |
+
253,952,000 - 77 token samples with 20% mask rate
|
| 9 |
+
|
| 10 |
+
The model has learned to categorize certain masked patterns with their categories and special tokens.
|
| 11 |
+
|
| 12 |
+
<subject>
|
| 13 |
+
<subject1>
|
| 14 |
+
<subject2>
|
| 15 |
+
<pose>
|
| 16 |
+
<emotion>
|
| 17 |
+
<surface>
|
| 18 |
+
<lighting>
|
| 19 |
+
<material>
|
| 20 |
+
<accessory>
|
| 21 |
+
<footwear>
|
| 22 |
+
<upper_body_clothing>
|
| 23 |
+
<hair_style>
|
| 24 |
+
<hair_length>
|
| 25 |
+
<headwear>
|
| 26 |
+
<texture>
|
| 27 |
+
<pattern>
|
| 28 |
+
<grid>
|
| 29 |
+
<zone>
|
| 30 |
+
<offset>
|
| 31 |
+
<object_left>
|
| 32 |
+
<object_right>
|
| 33 |
+
<relation>
|
| 34 |
+
<intent>
|
| 35 |
+
<style>
|
| 36 |
+
<fabric>
|
| 37 |
+
<jewelry>
|
| 38 |
+
|
| 39 |
+
With the categorical shunts;
|
| 40 |
+
|
| 41 |
+
[SHUNT_1000000]
|
| 42 |
+
[SHUNT_1000001]
|
| 43 |
+
[SHUNT_1000002]
|
| 44 |
+
[SHUNT_1000003]
|
| 45 |
+
[SHUNT_1000004]
|
| 46 |
+
[SHUNT_1000005]
|
| 47 |
+
[SHUNT_1000006]
|
| 48 |
+
[SHUNT_1000007]
|
| 49 |
+
[SHUNT_1000008]
|
| 50 |
+
[SHUNT_1000009]
|
| 51 |
+
[SHUNT_1000010]
|
| 52 |
+
[SHUNT_1000011]
|
| 53 |
+
[SHUNT_1000012]
|
| 54 |
+
[SHUNT_1000013]
|
| 55 |
+
[SHUNT_1000014]
|
| 56 |
+
[SHUNT_1000015]
|
| 57 |
+
[SHUNT_1000016]
|
| 58 |
+
[SHUNT_1000017]
|
| 59 |
+
[SHUNT_1000018]
|
| 60 |
+
[SHUNT_1000019]
|
| 61 |
+
[SHUNT_1000020]
|
| 62 |
+
[SHUNT_1000021]
|
| 63 |
+
[SHUNT_1000022]
|
| 64 |
+
[SHUNT_1000023]
|
| 65 |
+
[SHUNT_1000024]
|
| 66 |
+
[SHUNT_1000025]
|
| 67 |
+
|
| 68 |
+
Each shunt meant to activate cross-categorical conceptualization within their 77 token window.
|