1.82 kB

	# Qwen3.5-Vision C++ Web App

	A C++ project wrapper around a rich Web App for Qwen3.5 and Qwen3-VL models. Powered by `pybind11` and an embedded native C++ HTTP server (`httplib`).

	It provides a modern GUI for:
	- Image queries
	- Image captioning
	- Point extraction
	- Object detection


	![1](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/b8ERcd1r5dRgInumQfsUu.png)


	## Build Instructions (Local)

	```bash
	mkdir build
	cd build
	cmake ..
	cmake --build . --config Release
	```

	## Usage

	```bash
	# By default runs Qwen3_5ForConditionalGeneration model: prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX
	./build/Release/visionapp.exe

	# Or use specific architectures
	./build/Release/visionapp.exe --model_name Qwen/Qwen3-VL-2B-Instruct --arch Qwen3VLForConditionalGeneration --port 7860
	```

	## Running in Google Colab

	You can easily run this C++ application in Google Colab. Colab already provides the essential tools (Python 3, GCC, and CMake).

	1. Open a new [Google Colab Notebook](https://colab.research.google.com/).
	2. Create a new code cell and paste the following block to clone, build, and run the project all at once:

	```bash
	# Clone the repository
	!git clone https://github.com/PRITHIVSAKTHIUR/Qwen3.5-Vision.cpp.git
	%cd Qwen3.5-Vision.cpp

	# Install dependencies (Python dev headers & libraries)
	!apt-get update && apt-get install -y python3-dev
	!pip install pybind11 torch transformers pillow torchvision

	# Build the project
	!mkdir build
	%cd build
	!cmake ..
	!cmake --build . --config Release

	# Run the executable (replace arguments with your specific model details)
	!./visionapp --model_name prithivMLmods/Qwen3.5-0.8B-Unredacted-MAX \
	--arch Qwen3_5ForConditionalGeneration \
	--port 7860
	```

Xet Storage Details

Size:: 1.82 kB
Xet hash:: c82eef7a04821ba78f29a1598e179043b899061cddcb88944ae365698fa9ae09

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.