File size: 1,546 Bytes
eb73d88
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9195330
9fe4ec5
eb73d88
 
9fe4ec5
 
eb73d88
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e60910c
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
license: mit
tags:
- chemistry
- molecular-property-prediction
- drug-discovery
datasets:
- natelgrw/ReTiNA
pipeline_tag: tabular-regression
---

# ReTiNA Models: Molecular Retention Time Prediction

A collection of machine learning models for predicting the retention time of chemical compounds in various LC-MS. These models use molecular descriptors and method encodings to predict chemical retention times, useful for automated compound identification.

Source code for the ReTiNA model collection is available at this [Github Repository](https://github.com/natelgrw/retina_models).

The ReTiNA dataset is available at this [Hugging Face Repository](https://huggingface.co/datasets/natelgrw/ReTiNA). 

## 🤖 Available Models

In retention time prediction, we recommend using **ReTiNA_XGB1**, as it has the highest overall prediction accuracy.

| Model | Architecture | Overall RMSE (s) | Overall MAE (s) | Overall R<sup>2</sup> |
|-----|-----|-----|-----|-----|
| **ReTiNA_XGB1** | XGBoost | 182.81 | 119.30 | 0.659 |
| **ReTiNA_MLP1** | PyTorch Residual MLP | 202.67 | 141.79 | 0.516 |

All models were evaluated across rigorous scaffold, cluster, and method splits.

## 📄 Citation

If you use a ReTiNA prediction model in your research, please cite:

```
@modelcollection{retinamodels,
  title={ReTiNA-Models: Machine Learning Models for LC-MS Retention Time Prediction},
  author={Leung, Nathan},
  institution={Coley Research Group @ MIT}
  year={2025},
  howpublished={\url{https://huggingface.co/natelgrw/ReTiNA-Models}},
}
```