Add model card for ViFailback-8B

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +37 -0
README.md ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: image-text-to-text
3
+ library_name: transformers
4
+ ---
5
+
6
+ # ViFailback-8B
7
+
8
+ ViFailback-8B is a Vision-Language Model (VLM) designed to diagnose robotic manipulation failures and provide both textual and visual correction guidance. It is fine-tuned from Qwen3-VL-8B-Instruct as part of the ViFailback framework.
9
+
10
+ The model utilizes explicit visual symbols (arrows, crosshairs, state icons) to bridge the gap between failure diagnosis and policy correction, allowing robotic systems to learn from and recover from real-world failures.
11
+
12
+ - **Paper:** [Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols](https://arxiv.org/abs/2512.02787)
13
+ - **Project Page:** [https://x1nyuzhou.github.io/vifailback.github.io/](https://x1nyuzhou.github.io/vifailback.github.io/)
14
+ - **Repository:** [https://github.com/x1nyuzhou/ViFailback](https://github.com/x1nyuzhou/ViFailback)
15
+
16
+ ## Usage
17
+
18
+ To run inference and render the corrective visual symbols, use the `vifailback_infer.py` script provided in the [official GitHub repository](https://github.com/x1nyuzhou/ViFailback):
19
+
20
+ ```bash
21
+ python vifailback_infer.py \
22
+ --model_path sii-rhos-ai/ViFailback-8B \
23
+ --json_path ./examples/example_vifailback_infer.json \
24
+ --dataset_root /path/to/ViFailback-Dataset \
25
+ --output_dir ./inference_visualizations
26
+ ```
27
+
28
+ ## Citation
29
+
30
+ ```bibtex
31
+ @article{zeng2025diagnose,
32
+ title={Diagnose, Correct, and Learn from Manipulation Failures via Visual Symbols},
33
+ author={Zeng, Xianchao and Zhou, Xinyu and Li, Youcheng and Shi, Jiayou and Li, Tianle and Chen, Liangming and Ren, Lei and Li, Yong-Lu},
34
+ journal={arXiv preprint arXiv:2512.02787},
35
+ year={2025}
36
+ }
37
+ ```