Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
almanach
/
gaperon-quality-classifier
like
0
Follow
ALMAnaCH (Inria)
108
Text Classification
Transformers
ONNX
Safetensors
togethercomputer/RedPajama-Data-V2
LLM360/TxT360
French
English
xlm-roberta
gaperon
quality-classifier
document-quality
data-curation
arxiv:
2510.25771
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
gaperon-quality-classifier
/
onnx
3.19 GB
1 contributor
History:
1 commit
wissamantoun
Upload folder using huggingface_hub
a385183
verified
26 days ago
config.json
1.21 kB
Upload folder using huggingface_hub
26 days ago
model.onnx
182 kB
xet
Upload folder using huggingface_hub
26 days ago
model.onnx.data
3.11 GB
xet
Upload folder using huggingface_hub
26 days ago
ort_config.json
Safe
1.21 kB
Upload folder using huggingface_hub
26 days ago
sentencepiece.bpe.model
18.2 MB
xet
Upload folder using huggingface_hub
26 days ago
special_tokens_map.json
Safe
964 Bytes
Upload folder using huggingface_hub
26 days ago
tokenizer.json
Safe
61.3 MB
xet
Upload folder using huggingface_hub
26 days ago
tokenizer_config.json
1.18 kB
Upload folder using huggingface_hub
26 days ago