zmgong commited on
Commit
a285463
·
1 Parent(s): 90df42a

Add README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md ADDED
@@ -0,0 +1,49 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Model Card for DuoduoCLIP
2
+
3
+ In this model repo we provide the official pretrained models used in the paper **CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale.**
4
+ The model usage and code can be found in the [github repo](https://github.com/bioscan-ml/clibd).
5
+
6
+ ***Note: We provide the main model in the initial release, we will soon upload the other models used in the paper.***
7
+
8
+ ## Model Details
9
+
10
+ ### Model Description
11
+
12
+ - **Finetuned from model:**
13
+ -image: timm model (["vit_base_patch16_224"](https://huggingface.co/3dlg-hcvc/DuoduoCLIP))
14
+ -DNA barcode: BarcodeBERT ["bioscanr/barcodeBERT pre-trained on CANADA-1.5M"](https://huggingface.co/bioscan-ml/bioscan-clibd/tree/main/ckpt/BarcodeBERT/5_mer)
15
+ -text: Pre-trained BERT model (["prajjwal1/bert-small"](https://huggingface.co/prajjwal1/bert-small))
16
+ ### Model Sources
17
+
18
+ - **Repository:** https://github.com/bioscan-ml/clibd
19
+ - **Paper:** https://arxiv.org/abs/2405.17537
20
+
21
+ ### Model Checkpoints
22
+
23
+ - **ckpt/bioscan_clip/final_experiments/image_dna_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-1M dataset by aligning images and DNA.
24
+ - **ckpt/bioscan_clip/final_experiments/image_dna_text_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-1M dataset by aligning images, DNA, and taxonomy labels.
25
+ - **ckpt/bioscan_clip/new_5M_training/image_dna_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-5M dataset by aligning images and DNA.
26
+ - **ckpt/bioscan_clip/new_5M_training/image_dna_text_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-5M dataset by aligning images, DNA, and taxonomy labels.
27
+
28
+ ## Training Data
29
+ -[BIOSCAN-1M](https://huggingface.co/datasets/bioscan-ml/BIOSCAN-1M).
30
+ -[BIOSCAN-5M](https://huggingface.co/datasets/bioscan-ml/BIOSCAN-5M).
31
+ You can also find the processed data from [here](https://huggingface.co/datasets/bioscan-ml/bioscan-clibd)
32
+ **BibTeX:**
33
+ ```bibtex
34
+ @article{gong2024clibd,
35
+ title={{CLIBD}: Bridging Vision and Genomics for Biodiversity Monitoring at Scale},
36
+ author={Gong, ZeMing and Wang, Austin T. and Huo, Xiaoliang and Haurum, Joakim Bruslund and Lowe, Scott C. and Taylor, Graham W. and Chang, Angel X.},
37
+ journal={arXiv preprint arXiv:2405.17537},
38
+ year={2024},
39
+ eprint={2405.17537},
40
+ archivePrefix={arXiv},
41
+ primaryClass={cs.AI},
42
+ doi={10.48550/arxiv.2405.17537},
43
+ }
44
+ ```
45
+ We would like to express our gratitude for the use of the INSECT dataset, which played a pivotal role in the completion of our experiments. Additionally, we acknowledge the use and modification of code from the [Fine-Grained-ZSL-with-DNA](https://github.com/sbadirli/Fine-Grained-ZSL-with-DNA) repository, which facilitated part of our experimental work. The contributions of these resources have been invaluable to our project, and we appreciate the efforts of all developers and researchers involved.
46
+
47
+ This reseach was supported by the Government of Canada’s New Frontiers in Research Fund (NFRF) [NFRFT-2020-00073],
48
+ Canada CIFAR AI Chair grants, and the Pioneer Centre for AI (DNRF grant number P1).
49
+ This research was also enabled in part by support provided by the Digital Research Alliance of Canada (alliancecan.ca).