File size: 316 Bytes
3d67f6a
 
4d13fec
 
3d67f6a
4d13fec
 
 
 
1
2
3
4
5
6
7
8
9
---
license: mit
library_name: fasttext
pipeline_tag: text-classification
---

This is the fastText pretraining data filter targeting the LAMBADA DE task, discussed in the main text of the Perplexity Correlations paper: https://arxiv.org/abs/2409.05816

Code: https://github.com/TristanThrush/perplexity-correlations