BioGeek commited on
Commit
6e34abf
·
verified ·
1 Parent(s): d70bef6

Fix code, update citation references

Browse files
Files changed (1) hide show
  1. README.md +41 -5
README.md CHANGED
@@ -38,9 +38,10 @@ from winnow.scripts.main import filter_dataset
38
  from winnow.fdr.nonparametric import NonParametricFDRControl
39
 
40
  # 1) Download model files
 
41
  snapshot_download(
42
  repo_id="InstaDeepAI/winnow-general-model",
43
- allow_patterns=["*.pkl"]),
44
  repo_type="model",
45
  local_dir=general_model,
46
  )
@@ -50,8 +51,8 @@ calibrator = ProbabilityCalibrator.load(general_model)
50
 
51
  # 3) Load your dataset (InstaNovo-style config)
52
  dataset = InstaNovoDatasetLoader().load(
53
- "path_to_spectrum_data.parquet",
54
- "path_to_instanovo_predictions.csv",
55
  )
56
  dataset = filter_dataset(dataset) # standard Winnow filtering
57
 
@@ -125,5 +126,40 @@ winnow predict \
125
 
126
  ## Citation
127
 
128
- If you use Winnow or this model, please cite:
129
- TODO
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
38
  from winnow.fdr.nonparametric import NonParametricFDRControl
39
 
40
  # 1) Download model files
41
+ general_model = Path("general_model")
42
  snapshot_download(
43
  repo_id="InstaDeepAI/winnow-general-model",
44
+ allow_patterns=["*.pkl"],
45
  repo_type="model",
46
  local_dir=general_model,
47
  )
 
51
 
52
  # 3) Load your dataset (InstaNovo-style config)
53
  dataset = InstaNovoDatasetLoader().load(
54
+ data_path="path_to_spectrum_data.parquet",
55
+ predictions_path="path_to_instanovo_predictions.csv",
56
  )
57
  dataset = filter_dataset(dataset) # standard Winnow filtering
58
 
 
126
 
127
  ## Citation
128
 
129
+ If you use `winnow` in your research, please cite our preprint: [De novo peptide sequencing rescoring and FDR estimation with Winnow](https://arxiv.org/abs/2509.24952)
130
+
131
+ ```bibtex
132
+ @article{mabona2025novopeptidesequencingrescoring,
133
+ title={De novo peptide sequencing rescoring and FDR estimation with Winnow},
134
+ author={Amandla Mabona and Jemma Daniel and Henrik Servais Janssen Knudsen and Rachel Catzel
135
+ and Kevin Michael Eloff and Erwin M. Schoof and Nicolas Lopez Carranza and Timothy P. Jenkins
136
+ and Jeroen Van Goey and Konstantinos Kalogeropoulos},
137
+ year={2025},
138
+ eprint={2509.24952},
139
+ archivePrefix={arXiv},
140
+ primaryClass={q-bio.QM},
141
+ url={https://arxiv.org/abs/2509.24952},
142
+ }
143
+ ```
144
+
145
+ If you use the `InstaNovo` model to generate predictions, please also cite: [InstaNovo enables diffusion-powered de novo peptide sequencing in large-scale proteomics experiments](https://doi.org/10.1038/s42256-025-01019-5)
146
+
147
+ ```bibtex
148
+ @article{eloff_kalogeropoulos_2025_instanovo,
149
+ title = {InstaNovo enables diffusion-powered de novo peptide sequencing in large-scale
150
+ proteomics experiments},
151
+ author = {Eloff, Kevin and Kalogeropoulos, Konstantinos and Mabona, Amandla and Morell,
152
+ Oliver and Catzel, Rachel and Rivera-de-Torre, Esperanza and Berg Jespersen,
153
+ Jakob and Williams, Wesley and van Beljouw, Sam P. B. and Skwark, Marcin J.
154
+ and Laustsen, Andreas Hougaard and Brouns, Stan J. J. and Ljungars,
155
+ Anne and Schoof, Erwin M. and Van Goey, Jeroen and auf dem Keller, Ulrich and
156
+ Beguir, Karim and Lopez Carranza, Nicolas and Jenkins, Timothy P.},
157
+ year = 2025,
158
+ month = {Mar},
159
+ day = 31,
160
+ journal = {Nature Machine Intelligence},
161
+ doi = {10.1038/s42256-025-01019-5},
162
+ issn = {2522-5839},
163
+ url = {https://doi.org/10.1038/s42256-025-01019-5}
164
+ }
165
+ ```