English
File size: 3,704 Bytes
890a890
 
35cfd3b
 
890a890
c8ed2de
35cfd3b
a1aef1d
 
 
c8ed2de
35cfd3b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
c8ed2de
35cfd3b
 
c8ed2de
 
35cfd3b
 
 
8ca3ad4
 
 
35cfd3b
 
 
a9ed331
8ca3ad4
35cfd3b
 
d6d27e6
8ca3ad4
35cfd3b
 
8ca3ad4
 
35cfd3b
 
8ca3ad4
35cfd3b
d6d27e6
 
 
 
35cfd3b
 
 
a9ed331
35cfd3b
 
a9ed331
35cfd3b
 
 
a9ed331
35cfd3b
 
 
a9ed331
35cfd3b
a9ed331
35cfd3b
a9ed331
35cfd3b
 
a9ed331
35cfd3b
a9ed331
35cfd3b
 
 
a9ed331
35cfd3b
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
---
license: cc-by-sa-4.0
language:
- en
---
# phytoClassUCSC - A phytoplankton classifier for IFCB data

__TRY IT OUT HERE:__  
__https://colab.research.google.com/drive/1mv4xs8NHyyqls9OMfZ74HpzCLi9GlkTX?usp=sharing__

Note: Sections and prompts from the [model cards paper](https://arxiv.org/abs/1810.03993), v2.

Jump to section:

- [Model details](#model-details)
- [Intended use](#intended-use)
- [Factors](#factors)
- [Metrics](#metrics)
- [Evaluation data](#evaluation-data)
- [Training data](#training-data)
- [Quantitative analyses](#quantitative-analyses)
- [Ethical considerations](#ethical-considerations)
- [Caveats and recommendations](#caveats-and-recommendations)

## Model details

- Developed by the Kudela Lab from the Ocean Sciences Department at University of California, Santa Cruz.
- Current version trained in February, 2023.
- Version 1.0
- phytoClassUCSC is a depthwise- CNN based on the Xception architecture [Chollet, F., 2017](https://arxiv.org/abs/1610.02357) with 134 layers using weights pretrained on ImageNet.
- An average pooling layer is used.
- Licensed under CC-BY-SA-4.0
- For Questions email Patrick Daniel ([pcdaniel@ucsc.edu](pcdaniel@ucsc.edu))

## Intended use

This model was designed and trained to work with IFCB data generated in Monterey Bay. While that does not mean it may not perform well in other locations, the distribution of training images reflects common phytoplankton observed at the Santa Cruz Wharf and Power Buoy locations.

Independent model validation should be used when applying the model to other sites.

### Primary intended uses

Generalized micro-phytoplankton classifier for common taxa found in the Monterey Bay.

### Primary intended users

Researchers intersted in a general.

### Out-of-scope use cases

Observing and identifying rare or non-endemic taxa.

## Factors

Model classes were chosen based on common and resolvable phytoplankton taxa. Taxonomic groupings were chosen based on what researchers in the lab felt groups that could be confidently identified, given the expertise and research intersts of the lab.

### Instrument

Model was trained on images from Imaging FlowCytobot (IFCB) instruments primary deployed at the Santa Cruz Wharf and the Monterey Bay Aquarium Research Institute (MBARI) Power Buoy. The Santa Cruz Wharf IFCB (#104) is an early generation 


## Metrics

_Deployed model performance will vary with the natural variabilability in the observed phytoplankton communities over different time scales (seasonality). As such model performance should be evaluated throughout IFCb deployments using independently labled images._

### Model performance measures
Training model performace was evaluated using a held-back validation training set. F1-scores were calcuated for each class. [See Results here](https://stage-habdac-streamlit.srv.axds.co/Model_Metrics)

### Approaches to uncertainty and variability

Uncertainty is addressed by applying a set of class-specific thresholds for each prediction. This works reasonably well for out-of-distribution images.

## Training data

To Be Described

## Ethical considerations

None


## Caveats and recommendations

This model was developed as in interation of previous classification efforts and as such is subject to a history of decision making that is not captured here. For that reasons this classifier is not a panacea for all phytoplankton image data, but was specifically developed for looking at phytoplankton communities in Monterey Bay.



IFCB collected data are very context specific and subject to both observation configurations and small-scale variability. 


Review section 4.9 of the [model cards paper](https://arxiv.org/abs/1810.03993).