About

This repository contains precalculated Feature Extraction matrices (embeddings) designed for Sigma-Captioner. These caches allow for high-speed image-to-text alignment and similarity searching without re-encoding the entire vocabulary on every run.

Compatible with any project, but recommended for use with https://github.com/uninterruptedpowersupply3-NEW/Sigma-Captioner/tree/SGLANG

πŸ“Š Dataset Sources

πŸ“‚ Directory Structure & Usage

The embeddings are split into two versions based on reliability:

Path Contents Recommendation
root/ vocab_hybrid_matrix.pt Base Use: Stable and recommended for general tasks.
DANANDENG/ vocab_hybrid_meta.pt Experimental: Hybrid Anime + English merge.

Experimental Warning: The Anime tags in the DANANDENG directory are prone to hallucinations regarding specific characters. Use the root files for production/standard tagging and reserve the DANANDENG files for experimental research.

  • Model: Qwen/Qwen3-VL-Embedding-2B
  • Format: PyTorch Tensor (.pt) + JSON Metadata
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for UPShf/Vocabulary-Qwen3-VL-Embedding-2B

Finetuned
(6)
this model