GOD-IAM commited on
Commit
b466016
·
1 Parent(s): feeaa5d

Release vil-encoder-v1.2 (GVL-P v1.2, canon-locked)

Browse files
Files changed (3) hide show
  1. README.md +95 -0
  2. config.json +7 -0
  3. vil-encoder-v1.2.pt +3 -0
README.md ADDED
@@ -0,0 +1,95 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - vision
5
+ - encoder
6
+ - multimodal
7
+ - self-supervised
8
+ - video
9
+ - execution
10
+ - symbolic
11
+ library_name: pytorch
12
+ pipeline_tag: feature-extraction
13
+ datasets:
14
+ - Nine1Eight/vil-canonical-glyph-system
15
+ ---
16
+
17
+ # VIL Encoder v1.2 (GVL-P)
18
+
19
+ **VIL Encoder v1.2** is a glyphmatic vision encoder trained using
20
+ **GVL-P (Glyphmatic Video-Language Pretraining) v1.2**.
21
+
22
+ This model learns **temporal execution structure** from canonical glyph
23
+ sequences derived from text, code, binaries, and other data.
24
+
25
+ > ⚠️ This model does **not tokenize language**.
26
+ > All inputs are compiled into a **canonical glyph IR (base-111)**.
27
+
28
+ ---
29
+
30
+ ## Architecture
31
+
32
+ - **Vision Encoder:** GlyphVisionEncoder
33
+ - **Temporal Head:** TemporalGlyphTransformer
34
+ - **Embedding Dimension:** 768
35
+ - **Canon Size:** 111
36
+ - **Deterministic:** Yes
37
+
38
+ ---
39
+
40
+ ## Training (GVL-P v1.2)
41
+
42
+ Training is **fully self-supervised**:
43
+
44
+ 1. Arbitrary input (text, code, binary)
45
+ 2. Deterministic compilation → glyph indices
46
+ 3. Sliding temporal windows
47
+ 4. Next-step temporal consistency objective
48
+
49
+ No labels, captions, or annotations were used.
50
+
51
+ ---
52
+
53
+ ## Intended Use
54
+
55
+ - Execution-aware embeddings
56
+ - Vision–language research
57
+ - Glyph-based reasoning systems
58
+ - Multimodal IR experiments
59
+
60
+ This is **not** a language model.
61
+
62
+ ---
63
+
64
+ ## Limitations
65
+
66
+ - Requires canonical glyph compilation
67
+ - No text generation
68
+ - No decoding or execution
69
+
70
+ ---
71
+
72
+ ## Weights
73
+
74
+ File:
75
+ vil-encoder-v1.2.pt
76
+ Checkpoint contains:
77
+ - `vision_encoder`
78
+ - `temporal_head`
79
+ - `embed_dim`
80
+ - `canon_size`
81
+ - `gvlp_version = 1.2`
82
+
83
+ ---
84
+
85
+ ## Relationship to VIL
86
+
87
+ Canonical dataset:
88
+ https://huggingface.co/datasets/Nine1Eight/vil-canonical-glyph-system
89
+
90
+ ---
91
+
92
+ ## Author
93
+
94
+ Matthew Blake Ward (Nine1Eight)
95
+ Tulsa, Oklahoma, USA
config.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "model_type": "vil-glyph-encoder",
3
+ "embed_dim": 768,
4
+ "canon_size": 111,
5
+ "gvlp_version": "1.2",
6
+ "deterministic": true
7
+ }
vil-encoder-v1.2.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3cacca9c36226f6f2de305d7a6b06e50f499123ec86dfeb5ac4fc70876463c3
3
+ size 113431045