Image Feature Extraction
Transformers
Safetensors
English
Chinese
mingtok
visual-tokenizer
feature-extraction
image-reconstruction
autoregressive
Instructions to use inclusionAI/MingTok-Vision with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use inclusionAI/MingTok-Vision with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("image-feature-extraction", model="inclusionAI/MingTok-Vision")# Load model directly from transformers import MingTok model = MingTok.from_pretrained("inclusionAI/MingTok-Vision", dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,6 +1,6 @@
|
|
| 1 |
## MingTok: A Unified Tokenizer for Visual Understanding and Generation without Vector Quantization
|
| 2 |
|
| 3 |
-
<p align="center">📑 <a href="">Technical Report</a> | 📖 <a href="https://inclusionai.github.io/blog/mingtok/">Project Page</a> | 🤗 <a href="https://huggingface.co/inclusionAI/MingTok-Vision">Hugging Face</a> | 🤖 <a href="https://modelscope.cn/models/inclusionAI/MingTok-Vision">ModelScope</a> | 💾 <a href="https://github.com/inclusionAI/Ming-UniVision">GitHub</a></p>
|
| 4 |
|
| 5 |
## Key Features
|
| 6 |
- 🖼️ **First Continuous Unified Vision Tokenizer:** MingTok enables unified vision understanding and generation via a continuous latent space, eliminating quantization while preserving semantic and perceptual fidelity.
|
|
|
|
| 1 |
## MingTok: A Unified Tokenizer for Visual Understanding and Generation without Vector Quantization
|
| 2 |
|
| 3 |
+
<p align="center">📑 <a href="https://inclusionai.github.io/blog/mingtok/">Technical Report</a> | 📖 <a href="https://inclusionai.github.io/blog/mingtok/">Project Page</a> | 🤗 <a href="https://huggingface.co/inclusionAI/MingTok-Vision">Hugging Face</a> | 🤖 <a href="https://modelscope.cn/models/inclusionAI/MingTok-Vision">ModelScope</a> | 💾 <a href="https://github.com/inclusionAI/Ming-UniVision">GitHub</a></p>
|
| 4 |
|
| 5 |
## Key Features
|
| 6 |
- 🖼️ **First Continuous Unified Vision Tokenizer:** MingTok enables unified vision understanding and generation via a continuous latent space, eliminating quantization while preserving semantic and perceptual fidelity.
|