Update readme

Browse files

Files changed (3) hide show

README.md +39 -7
assets/logo.png +0 -0
assets/streamlit_app.png +0 -0

README.md CHANGED Viewed

@@ -1,16 +1,48 @@
-# MedCLIP
-## Description
-A CLIP model is finetuned on the [ROCO dataset](https://github.com/razorx89/roco-dataset).
-## Dataset
 Each image is accompanied by a text caption. The caption length varies from a few characters (a single word) to 2,000 characters. During preprocessing we remove all images that has a caption shorter than 10 characters.
 Training set: 57,780 images with their caption.
 Validation set: 7.200
 Test set: 7,650
-## Training
-Finetune a CLIP model by simply running `sh run_medclip`.
 This is the validation loss curve we observed when we trained the model using the `run_medclip.sh` script.
 ![Validation loss](./assets/val_loss.png)
-## Evaluation

+# MedCLIP: Fine-tuning a CLIP model on the ROCO medical dataset
+<!-- ![Logo](./assets/logo.png) -->
+<h3 align="center">
+  <!-- <p>MedCLIP</p> -->
+  <img src="./assets/logo.png" alt="huggingface-medclip" width="250" height="250">
+## Summary
+This repository contains the code for fine-tuning a CLIP model on the [ROCO dataset](https://github.com/razorx89/roco-dataset), a dataset made of radiology images and a caption.
+This work is done as a part of the [**Flax/Jax community week**](https://github.com/huggingface/transformers/blob/master/examples/research_projects/jax-projects/README.md#quickstart-flax-and-jax-in-transformers) organized by Hugging Face and Google.
+### Demo
+You can try a Streamlit demo app that uses this model on [🤗 Spaces](https://huggingface.co/spaces/kaushalya/medclip-roco). You may have to signup for 🤗 Spaces private beta to access this app (screenshot shown below).
+![Streamlit app](./assets/streamlit_app.png)
+🤗 Hub Model card: https://huggingface.co/flax-community/medclip-roco
+## Dataset
 Each image is accompanied by a text caption. The caption length varies from a few characters (a single word) to 2,000 characters. During preprocessing we remove all images that has a caption shorter than 10 characters.
 Training set: 57,780 images with their caption.
 Validation set: 7.200
 Test set: 7,650
+[ ] Give an example
+## Installation 💽
+This repo depends on the master branch of [Hugging Face - Transformers library](https://github.com/huggingface/transformers). First you need to clone the transformers repository and then install it locally (preferably inside a virtual environment) with `pip install -e ".[flax]"`.
+## Model
+You can load the pretrained model from the Hugging Face Hub with
+```
+from medclip.modeling_hybrid_clip import FlaxHybridCLIP
+model = FlaxHybridCLIP.from_pretrained("flax-community/medclip-roco")
+```
+## Training
+The model is trained using Flax/JAX on a cloud TPU-v3-8.
+You can fine-tune a CLIP model implemented in Flax by simply running `sh run_medclip`.
 This is the validation loss curve we observed when we trained the model using the `run_medclip.sh` script.
 ![Validation loss](./assets/val_loss.png)
+## TODO
+[ ] Evaluation on down-stream tasks
+[ ] Zero-shot learning performance
+[ ] Merge the demo app

assets/logo.png ADDED Viewed

assets/streamlit_app.png ADDED Viewed