Instructions to use wisdomik/QuiltNet-B-16-PMB with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- OpenCLIP
How to use wisdomik/QuiltNet-B-16-PMB with OpenCLIP:
import open_clip model, preprocess_train, preprocess_val = open_clip.create_model_and_transforms('hf-hub:wisdomik/QuiltNet-B-16-PMB') tokenizer = open_clip.get_tokenizer('hf-hub:wisdomik/QuiltNet-B-16-PMB') - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -27,7 +27,7 @@ pipeline_tag: zero-shot-image-classification
|
|
| 27 |
|
| 28 |
|
| 29 |
## QuiltNet-B-16-PMB Description
|
| 30 |
-
QuiltNet-B-32/PMB is a ViT-B/16 image tower and PubMedBERT text tower vision-language foundation model trained on the [Quilt-1M](https://quilt1m.github.io/) dataset curated from representative histopathology videos.
|
| 31 |
It can perform various vision-language processing (VLP) tasks such as cross-modal retrieval, image classification, and visual question answering.
|
| 32 |
QuiltNet establishes new state of the art in a wide range of standard datasets, and substantially outperforms prior VLP approaches:
|
| 33 |
|
|
|
|
| 27 |
|
| 28 |
|
| 29 |
## QuiltNet-B-16-PMB Description
|
| 30 |
+
[QuiltNet-B-32/PMB](https://github.com/wisdomikezogwo/quilt1m/) is a ViT-B/16 image tower and PubMedBERT text tower vision-language foundation model trained on the [Quilt-1M](https://quilt1m.github.io/) dataset curated from representative histopathology videos.
|
| 31 |
It can perform various vision-language processing (VLP) tasks such as cross-modal retrieval, image classification, and visual question answering.
|
| 32 |
QuiltNet establishes new state of the art in a wide range of standard datasets, and substantially outperforms prior VLP approaches:
|
| 33 |
|