Add link to Demo for VL and CV tasks to README

#7
Files changed (1) hide show
  1. README.md +31 -0
README.md CHANGED
@@ -127,6 +127,37 @@ generated_text = outputs[0]
127
  print(f"Youtu-VL output: {generated_text}")
128
  ```
129
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
130
  ## 🎉 Citation
131
 
132
  If you find our work useful in your research, please consider citing our paper:
 
127
  print(f"Youtu-VL output: {generated_text}")
128
  ```
129
 
130
+ ### Demo for VL and CV tasks
131
+
132
+ A simple demo for quick start, including VL and CV tasks: [jupyter notebook](https://github.com/TencentCloudADP/youtu-vl/blob/main/demo/demo.ipynb)
133
+
134
+ The core part of this demo is three lines below:
135
+
136
+ ```python
137
+ model_path = "tencent/Youtu-VL-4B-Instruct"
138
+ youtu_vl = YoutuVL(model_path)
139
+ response = youtu_vl(prompt, img_path, seg_mode=seg_mode)
140
+ ```
141
+
142
+ ### Qualitative Results
143
+ * **Task: Grounding**
144
+ > **Prompt:** Please provide the bounding box coordinate of the region this sentence describes: a black and white cat sitting on the edge of the bathtub
145
+ >
146
+ > <img src="https://cdn-uploads.huggingface.co/production/uploads/656312995475849b82c38bc4/349v2vYasfz4GtF_T_D09.png" width="300px">
147
+
148
+ * **Task: Object Detection**
149
+ > **Prompt:** Detect all objects in the provided image.
150
+ >
151
+ > <img src="https://cdn-uploads.huggingface.co/production/uploads/656312995475849b82c38bc4/rUJ6PzIjGJWwK4e9owPlY.png" width="300px">
152
+
153
+ * **Task: Referring Segmentation**
154
+ > **Prompt:** Can you segment "hotdog on left" in this image?
155
+ >
156
+ > <img src="https://cdn-uploads.huggingface.co/production/uploads/656312995475849b82c38bc4/K-5UG6HSLb28UFGx2pdPX.png" width="300px">
157
+
158
+ For more examples, please refer to paper and Jupyter notebooks.
159
+
160
+
161
  ## 🎉 Citation
162
 
163
  If you find our work useful in your research, please consider citing our paper: