File size: 3,072 Bytes
8d648d8
a285463
 
 
 
 
 
 
 
 
23a9c0c
448b2b6
23a9c0c
a285463
23a9c0c
448b2b6
a285463
 
 
 
 
 
 
 
 
 
 
 
 
3373760
da86ca9
3373760
da86ca9
3373760
b803b44
3373760
a285463
 
 
 
 
 
 
 
 
 
 
 
 
d5ca635
 
 
a285463
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
# Model Card for CLIBD

In this model repo we provide the official pretrained models used in the paper **CLIBD: Bridging Vision and Genomics for Biodiversity Monitoring at Scale.**
The model usage and code can be found in the [github repo](https://github.com/bioscan-ml/clibd).

## Model Details

### Model Description

- **Finetuned from model:**

    -Image: timm model (["vit_base_patch16_224"](https://huggingface.co/timm/vit_base_patch16_224.mae))

    -DNA barcode: BarcodeBERT ["bioscanr/barcodeBERT pre-trained on CANADA-1.5M"](https://huggingface.co/bioscan-ml/bioscan-clibd/tree/main/ckpt/BarcodeBERT/5_mer)

    -Text: Pre-trained BERT model  (["prajjwal1/bert-small"](https://huggingface.co/prajjwal1/bert-small))
### Model Sources

- **Repository:** https://github.com/bioscan-ml/clibd
- **Paper:** https://arxiv.org/abs/2405.17537

### Model Checkpoints

- **ckpt/bioscan_clip/final_experiments/image_dna_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-1M dataset by aligning images and DNA.
- **ckpt/bioscan_clip/final_experiments/image_dna_text_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-1M dataset by aligning images, DNA, and taxonomy labels.
- **ckpt/bioscan_clip/new_5M_training/image_dna_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-5M dataset by aligning images and DNA.
- **ckpt/bioscan_clip/new_5M_training/image_dna_text_4gpu_50epoch/best.pth:** The model trained on the BIOSCAN-5M dataset by aligning images, DNA, and taxonomy labels.

## Training Data

-[BIOSCAN-1M](https://huggingface.co/datasets/bioscan-ml/BIOSCAN-1M).

-[BIOSCAN-5M](https://huggingface.co/datasets/bioscan-ml/BIOSCAN-5M).

You can also find the processed data from [here](https://huggingface.co/datasets/bioscan-ml/bioscan-clibd).

**BibTeX:**
```bibtex
@article{gong2024clibd,
  title={{CLIBD}: Bridging Vision and Genomics for Biodiversity Monitoring at Scale},
  author={Gong, ZeMing and Wang, Austin T. and Huo, Xiaoliang and Haurum, Joakim Bruslund and Lowe, Scott C. and Taylor, Graham W. and Chang, Angel X.},
  journal={arXiv preprint arXiv:2405.17537},
  year={2024},
  eprint={2405.17537},
  archivePrefix={arXiv},
  primaryClass={cs.AI},
  doi={10.48550/arxiv.2405.17537},
}
```

## Acknowledgement

We would like to express our gratitude for the use of the INSECT dataset, which played a pivotal role in the completion of our experiments. Additionally, we acknowledge the use and modification of code from the [Fine-Grained-ZSL-with-DNA](https://github.com/sbadirli/Fine-Grained-ZSL-with-DNA) repository, which facilitated part of our experimental work. The contributions of these resources have been invaluable to our project, and we appreciate the efforts of all developers and researchers involved.

This reseach was supported by the Government of Canada’s New Frontiers in Research Fund (NFRF) [NFRFT-2020-00073],
Canada CIFAR AI Chair grants, and the Pioneer Centre for AI (DNRF grant number P1).
This research was also enabled in part by support provided by the Digital Research Alliance of Canada (alliancecan.ca).