spikefly commited on
Commit
683f092
Β·
0 Parent(s):

Initial placeholder model card

Browse files
Files changed (2) hide show
  1. .gitattributes +35 -0
  2. README.md +98 -0
.gitattributes ADDED
@@ -0,0 +1,35 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,98 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ library_name: pytorch
4
+ pipeline_tag: robotics
5
+ tags:
6
+ - robotics
7
+ - vision-language-action
8
+ - vla
9
+ - bridge
10
+ - widowx
11
+ - simpler-env
12
+ - manipulation
13
+ - qwen-vl
14
+ ---
15
+
16
+ # SemanticVLA Β· Bridge (SimplerEnv WidowX)
17
+
18
+ > 🚧 **Placeholder.** The URL is stable; checkpoints will be uploaded incrementally per the [release roadmap](https://github.com/Fei-Ni/SemanticVLA_Offcial/blob/main/docs/ROADMAP.md).
19
+
20
+ [SemanticVLA](https://github.com/Fei-Ni/SemanticVLA_Offcial) finetuned on [BridgeData V2](https://rail-berkeley.github.io/bridgedata/), targeting [SimplerEnv](https://github.com/simpler-env/SimplerEnv) WidowX evaluation (`widowx_spoon_on_towel` / `widowx_carrot_on_plate` / `widowx_stack_cube` / `widowx_put_eggplant_in_basket`).
21
+
22
+ ## Configuration
23
+
24
+ | Field | Value |
25
+ |---|---|
26
+ | Backbone | Qwen3VL-4B (Qwen3VL-GR00T-Bridge-RT-1 init) |
27
+ | Action head | GR00T-style flow-matching expert |
28
+ | Semantic output | `trace_latent` (trace + LAM latent-action token), `none` injection |
29
+ | LM loss weight | 0.10 |
30
+ | Action horizon | 16 |
31
+ | LAM tokenizer | [`SemanticVLA-LAM` β†’ `oxe-bridge-only/v4-step16k`](https://huggingface.co/spikefly/SemanticVLA-LAM) |
32
+ | Training data | `bridge_orig_1.0.0_lerobot` (with dense trace labels via OXE NPY index) |
33
+ | Target | 100,000 steps |
34
+
35
+ ## Headline result
36
+
37
+ SimplerEnv WidowX numbers will be filled in here once the 100k-step training and the 24-episodes-per-task evaluation complete. Training is in flight on Isambard; see the [code repo](https://github.com/Fei-Ni/SemanticVLA_Offcial) for the latest training metrics.
38
+
39
+ ## Planned layout
40
+
41
+ ```
42
+ SemanticVLA-Bridge/
43
+ β”œβ”€β”€ tl-none-lw010-step100k/
44
+ β”‚ β”œβ”€β”€ pytorch_model.pt
45
+ β”‚ β”œβ”€β”€ config.yaml
46
+ β”‚ └── model_card.md
47
+ └── README.md
48
+ ```
49
+
50
+ ## Sibling SemanticVLA checkpoint repos
51
+
52
+ | Repo | Purpose |
53
+ |---|---|
54
+ | πŸ€— [`SemanticVLA-LAM`](https://huggingface.co/spikefly/SemanticVLA-LAM) | LAM tokenizers used by this VLA |
55
+ | πŸ€— [`SemanticVLA-LIBERO`](https://huggingface.co/spikefly/SemanticVLA-LIBERO) | LIBERO-finetuned VLA |
56
+
57
+ ## Related resources
58
+
59
+ - **Code**: https://github.com/Fei-Ni/SemanticVLA_Offcial
60
+ - **Datasets Β· Bridge subset**: https://huggingface.co/datasets/spikefly/SemanticVLA-Bridge-LeRobot
61
+ - **Datasets Β· all**: https://hf.co/collections/spikefly/semanticvla-datasets
62
+ - **Collection Β· Model Zoo**: https://hf.co/collections/spikefly/semanticvla-model-zoo
63
+
64
+ ## How to load (placeholder API)
65
+
66
+ ```python
67
+ from huggingface_hub import hf_hub_download
68
+ import torch
69
+
70
+ ckpt = hf_hub_download(
71
+ repo_id="spikefly/SemanticVLA-Bridge",
72
+ filename="tl-none-lw010-step100k/pytorch_model.pt",
73
+ )
74
+ state = torch.load(ckpt, map_location="cpu")
75
+ # loader will be released with the code repo
76
+ ```
77
+
78
+ ## Citation
79
+
80
+ ```bibtex
81
+ @misc{semanticvla2026,
82
+ title = {SemanticVLA: Semantic Vision-Language-Action Models with Trace-Conditioned Latent Action Tokens},
83
+ author = {SemanticVLA Authors},
84
+ year = {2026},
85
+ url = {https://github.com/Fei-Ni/SemanticVLA_Offcial}
86
+ }
87
+
88
+ @inproceedings{walke2023bridgedata,
89
+ title = {BridgeData V2: A Dataset for Robot Learning at Scale},
90
+ author = {Walke, Homer and Black, Kevin and others},
91
+ booktitle = {Conference on Robot Learning (CoRL)},
92
+ year = {2023}
93
+ }
94
+ ```
95
+
96
+ ## License
97
+
98
+ Released under the [MIT License](https://github.com/Fei-Ni/SemanticVLA_Offcial/blob/main/LICENSE).