Buckets:
| # Qwen3.5-Vision C++ Web App | |
| A C++ project wrapper around a rich Web App for Qwen3.5 and Qwen3-VL models. Powered by `pybind11` and an embedded native C++ HTTP server (`httplib`). | |
| It provides a modern GUI for: | |
| - Image queries | |
| - Image captioning | |
| - Point extraction | |
| - Object detection | |
|  | |
| ## Build Instructions (Local) | |
| ```bash | |
| mkdir build | |
| cd build | |
| cmake .. | |
| cmake --build . --config Release | |
| ``` | |
| ## Usage | |
| ```bash | |
| # By default runs Qwen3_5ForConditionalGeneration model: prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX | |
| ./build/Release/visionapp.exe | |
| # Or use specific architectures | |
| ./build/Release/visionapp.exe --model_name Qwen/Qwen3-VL-2B-Instruct --arch Qwen3VLForConditionalGeneration --port 7860 | |
| ``` | |
| ## Running in Google Colab | |
| You can easily run this C++ application in Google Colab. Colab already provides the essential tools (Python 3, GCC, and CMake). | |
| 1. Open a new [Google Colab Notebook](https://colab.research.google.com/). | |
| 2. Create a new code cell and paste the following block to clone, build, and run the project all at once: | |
| ```bash | |
| # Clone the repository | |
| !git clone https://github.com/PRITHIVSAKTHIUR/Qwen3.5-Vision.cpp.git | |
| %cd Qwen3.5-Vision.cpp | |
| # Install dependencies (Python dev headers & libraries) | |
| !apt-get update && apt-get install -y python3-dev | |
| !pip install pybind11 torch transformers pillow torchvision | |
| # Build the project | |
| !mkdir build | |
| %cd build | |
| !cmake .. | |
| !cmake --build . --config Release | |
| # Run the executable (replace arguments with your specific model details) | |
| !./visionapp --model_name prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX \ | |
| --arch Qwen3_5ForConditionalGeneration \ | |
| --port 7860 | |
| ``` | |
Xet Storage Details
- Size:
- 1.82 kB
- Xet hash:
- c82eef7a04821ba78f29a1598e179043b899061cddcb88944ae365698fa9ae09
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.