Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,70 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
datasets:
|
| 4 |
+
- eltorio/ROCO-radiology
|
| 5 |
+
language:
|
| 6 |
+
- en
|
| 7 |
+
- fr
|
| 8 |
+
base_model:
|
| 9 |
+
- HuggingFaceM4/Idefics3-8B-Llama3
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
+
# IDEFICS3_ROCO
|
| 13 |
+
|
| 14 |
+
[](https://colab.research.google.com/#fileId=https://huggingface.co/eltorio/IDEFICS3_ROCO/blob/main/ROCO-idefics3.ipynb)
|
| 15 |
+
|
| 16 |
+
## A Fine-tuned Radiology-focused Model based on Hugging Face's Idefics3 Model
|
| 17 |
+
|
| 18 |
+
This repository contains a fine-tuned version of the Hugging Face [Idefics3-8B-Llama3](https://huggingface.co/HuggingFaceM4/Idefics3-8B-Llama3) model, built on top of the Meta 3.1 8B architecture. Our model, `IDEFICS3_ROCO`, has been fine-tuned on the [Radiology Objects in Context (ROCO)](https://huggingface.co/datasets/eltorio/ROCO-radiology) dataset, a large-scale medical and multimodal imaging collection.
|
| 19 |
+
|
| 20 |
+
### Model Information
|
| 21 |
+
|
| 22 |
+
* **Base Model:** Idefics3-8B-Llama3
|
| 23 |
+
* **Fine-tuning Dataset:** Radiology Objects in Context (ROCO)
|
| 24 |
+
* **License:** Apache-2.0
|
| 25 |
+
* **Current Status:** Fine-tuning process is currently halted at checkpoint 640 (out of 24,000) due to limitations with Colab Free T4 GPU unit. Contributions to complete the fine-tuning process are welcome!
|
| 26 |
+
|
| 27 |
+
### Training Progress Status
|
| 28 |
+
|
| 29 |
+
* Current checkpoint: 620-640/24000 (~2.7% completed)
|
| 30 |
+
* Estimated remaining GPU time: ~57 hours
|
| 31 |
+
* Hardware requirements: T4 GPU with >16GB VRAM
|
| 32 |
+
* Last update: november, 7th 2021
|
| 33 |
+
|
| 34 |
+
### Fine-tuning Code
|
| 35 |
+
|
| 36 |
+
The fine-tuning code is available as a Jupyter Notebook in the [ROCO-radiology dataset repository](https://huggingface.co/datasets/eltorio/ROCO-radiology) on Hugging Face:
|
| 37 |
+
|
| 38 |
+
* [ROCO-idefics3.ipynb](https://huggingface.co/eltorio/IDEFICS3_ROCO/blob/main/ROCO-idefics3.ipynb)
|
| 39 |
+
|
| 40 |
+
The [Junyper Notebook](https://colab.research.google.com/#fileId=https%3A//huggingface.co/eltorio/IDEFICS3_ROCO/blob/main/ROCO-idefics3.ipynb) [](https://colab.research.google.com/#fileId=https://huggingface.co/eltorio/IDEFICS3_ROCO/blob/main/ROCO-idefics3.ipynb) contains the code to fine-tune the Idefics3-8B-Llama3 model on the ROCO dataset. The fine-tuning process is currently halted at checkpoint 640 (out of 24,000) due to limitations with Colab Free T4 GPU unit. Contributions to complete the fine-tuning process are welcome!
|
| 41 |
+
|
| 42 |
+
### Contributions Welcome
|
| 43 |
+
|
| 44 |
+
If you have the resources to complete the fine-tuning process, we would appreciate your contribution. Please fork this repository, finish the fine-tuning process, and submit a pull request with your updates.
|
| 45 |
+
|
| 46 |
+
### Citation
|
| 47 |
+
|
| 48 |
+
If you use this model in your work, please cite the original Idefics3 model and our fine-tuned model:
|
| 49 |
+
|
| 50 |
+
* [Idefics3-8B-Llama3](https://huggingface.co/HuggingFaceM4/Idefics3-8B-Llama3)
|
| 51 |
+
* [IDEFICS3_ROCO](https://huggingface.co/eltorio/IDEFICS3_ROCO)
|
| 52 |
+
|
| 53 |
+
### Contribution Guide
|
| 54 |
+
|
| 55 |
+
1. **Technical Requirements**
|
| 56 |
+
* Access to powerful GPU (T4, V100, A100 or equivalent)
|
| 57 |
+
* Python environment with PyTorch
|
| 58 |
+
* Disk space: ~50GB
|
| 59 |
+
|
| 60 |
+
2. **Getting Started**
|
| 61 |
+
* Fork the repository
|
| 62 |
+
* Resume from checkpoint 640
|
| 63 |
+
* Follow instructions in [ROCO-idefics3.ipynb](https://huggingface.co/eltorio/IDEFICS3_ROCO/blob/main/ROCO-idefics3.ipynb) [](https://colab.research.google.com/#fileId=https://huggingface.co/eltorio/IDEFICS3_ROCO/blob/main/ROCO-idefics3.ipynb)
|
| 64 |
+
|
| 65 |
+
3. **Contact**
|
| 66 |
+
* For questions: [link to issues/discussions]
|
| 67 |
+
|
| 68 |
+
### Acknowledgments
|
| 69 |
+
|
| 70 |
+
This work was made possible by the [Hugging Face Transformers](https://huggingface.co/) library and the [ROCO-radiology dataset](https://huggingface.co/datasets/eltorio/ROCO-radiology).
|