|
download
raw
1.82 kB

Qwen3.5-Vision C++ Web App

A C++ project wrapper around a rich Web App for Qwen3.5 and Qwen3-VL models. Powered by pybind11 and an embedded native C++ HTTP server (httplib).

It provides a modern GUI for:

  • Image queries
  • Image captioning
  • Point extraction
  • Object detection

1

Build Instructions (Local)

mkdir build
cd build
cmake ..
cmake --build . --config Release

Usage

# By default runs Qwen3_5ForConditionalGeneration model: prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX
./build/Release/visionapp.exe 

# Or use specific architectures
./build/Release/visionapp.exe --model_name Qwen/Qwen3-VL-2B-Instruct --arch Qwen3VLForConditionalGeneration --port 7860

Running in Google Colab

You can easily run this C++ application in Google Colab. Colab already provides the essential tools (Python 3, GCC, and CMake).

  1. Open a new Google Colab Notebook.
  2. Create a new code cell and paste the following block to clone, build, and run the project all at once:
# Clone the repository
!git clone https://github.com/PRITHIVSAKTHIUR/Qwen3.5-Vision.cpp.git
%cd Qwen3.5-Vision.cpp

# Install dependencies (Python dev headers & libraries)
!apt-get update && apt-get install -y python3-dev
!pip install pybind11 torch transformers pillow torchvision

# Build the project
!mkdir build
%cd build
!cmake ..
!cmake --build . --config Release

# Run the executable (replace arguments with your specific model details)
!./visionapp --model_name prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX \
             --arch Qwen3_5ForConditionalGeneration \
             --port 7860

Xet Storage Details

Size:
1.82 kB
·
Xet hash:
c82eef7a04821ba78f29a1598e179043b899061cddcb88944ae365698fa9ae09

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.