Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -150,4 +150,12 @@ it wan't successfull, the model couln't predict some handwritten recipes, and so
|
|
| 150 |
we decided to try using: Qwen2.5-VL Vision-Language Model and the results were much better!
|
| 151 |
Comparison:
|
| 152 |
|
| 153 |
-

|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 150 |
we decided to try using: Qwen2.5-VL Vision-Language Model and the results were much better!
|
| 151 |
Comparison:
|
| 152 |
|
| 153 |
+

|
| 154 |
+
|
| 155 |
+
|
| 156 |
+
## Challenges:
|
| 157 |
+
- Synthetic Data: Creating a synthetic dataset that accurately mimicked real-world distributions was difficult. We had to carefully tune the generation parameters to ensure the data was diverse and reliable for training.
|
| 158 |
+
|
| 159 |
+
- Hugging Face Deployment & Optimization:
|
| 160 |
+
The Issue: We struggled initially with configuring the Hugging Face environment; the model execution was too slow for a real-time application.
|
| 161 |
+
The Solution: After investigating various deployment strategies, we shifted to using the Hugging Face Inference Client. This transition optimized our inference pipeline, allowing the model to run within acceptable time limits and ensuring a smooth user experience.
|