AbstractPhil commited on
Commit
0f9c5ea
·
verified ·
1 Parent(s): 34ce035

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +66 -0
README.md CHANGED
@@ -4,6 +4,72 @@ base_model:
4
  - nomic-ai/nomic-bert-2048
5
  ---
6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  # Release - bert-beatrix-2048 v1
8
 
9
  Entirely saturated pretrained masking window, fixated on expanding the masking potential using subject and shunt allocation tokenization systems.
 
4
  - nomic-ai/nomic-bert-2048
5
  ---
6
 
7
+
8
+
9
+
10
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/6iTdKxUiUtzm5MWoZpOdM.png)
11
+
12
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/nffcaOzBsC0ymG80NWudD.png)
13
+
14
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/OKE94fmGlfFhRAKUpubrR.png)
15
+
16
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/gLiYYH9nkvpXyDLF2ssGF.png)
17
+
18
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/tl0bDNnFL_kdSDnhTMf56.png)
19
+
20
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/kz4J3YTnBOaySdgf3-rzp.png)
21
+
22
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/1KEiW_GS5l-BTt4VYb_Tt.png)
23
+
24
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/630cf55b15433862cfc9556f/YopKBUJoKPJVUXSgdl5mX.png)
25
+
26
+
27
+ SEMANTIC TOKEN STATISTICS:
28
+ Average similarity between tokens: 0.232
29
+ Std dev of similarities: 0.043
30
+ Max similarity: 0.368
31
+ Min similarity: 0.082
32
+
33
+ Most similar token pairs:
34
+ <intent> ↔ <style>: 0.368
35
+ <hair_style> ↔ <hair_length>: 0.336
36
+ <grid> ↔ <fabric>: 0.316
37
+ <footwear> ↔ <jewelry>: 0.315
38
+ <grid> ↔ <offset>: 0.315
39
+
40
+
41
+ SHUNT TOKEN STATISTICS:
42
+ Average distance between shunts: 1.146
43
+ Std dev of distances: 0.040
44
+ Min distance: 1.056
45
+ Max distance: 1.262
46
+
47
+
48
+ CATEGORY ANALYSIS:
49
+ Subject/Object:
50
+ Tokens: 5
51
+ Avg within-category similarity: 0.234
52
+ Appearance:
53
+ Tokens: 4
54
+ Avg within-category similarity: 0.260
55
+ Clothing:
56
+ Tokens: 5
57
+ Avg within-category similarity: 0.280
58
+ Material/Texture:
59
+ Tokens: 5
60
+ Avg within-category similarity: 0.251
61
+ Spatial/Style:
62
+ Tokens: 7
63
+ Avg within-category similarity: 0.270
64
+
65
+
66
+ DIMENSIONALITY ANALYSIS:
67
+ Variance explained by first 10 PCs: 37.6%
68
+ Components needed for 90% variance: 1
69
+
70
+ ============================================================
71
+
72
+
73
  # Release - bert-beatrix-2048 v1
74
 
75
  Entirely saturated pretrained masking window, fixated on expanding the masking potential using subject and shunt allocation tokenization systems.