Buckets:
Qwen3.5-Vision C++ Web App
A C++ project wrapper around a rich Web App for Qwen3.5 and Qwen3-VL models. Powered by pybind11 and an embedded native C++ HTTP server (httplib).
It provides a modern GUI for:
- Image queries
- Image captioning
- Point extraction
- Object detection
Build Instructions (Local)
mkdir build
cd build
cmake ..
cmake --build . --config Release
Usage
# By default runs Qwen3_5ForConditionalGeneration model: prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX
./build/Release/visionapp.exe
# Or use specific architectures
./build/Release/visionapp.exe --model_name Qwen/Qwen3-VL-2B-Instruct --arch Qwen3VLForConditionalGeneration --port 7860
Running in Google Colab
You can easily run this C++ application in Google Colab. Colab already provides the essential tools (Python 3, GCC, and CMake).
- Open a new Google Colab Notebook.
- Create a new code cell and paste the following block to clone, build, and run the project all at once:
# Clone the repository
!git clone https://github.com/PRITHIVSAKTHIUR/Qwen3.5-Vision.cpp.git
%cd Qwen3.5-Vision.cpp
# Install dependencies (Python dev headers & libraries)
!apt-get update && apt-get install -y python3-dev
!pip install pybind11 torch transformers pillow torchvision
# Build the project
!mkdir build
%cd build
!cmake ..
!cmake --build . --config Release
# Run the executable (replace arguments with your specific model details)
!./visionapp --model_name prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX \
--arch Qwen3_5ForConditionalGeneration \
--port 7860
Xet Storage Details
- Size:
- 1.82 kB
- Xet hash:
- c82eef7a04821ba78f29a1598e179043b899061cddcb88944ae365698fa9ae09
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.
