roberta-eloquent / data /README.md
protagonist's picture
eloquent26: 5x5x66 generation corpus + scores
c1667ab verified
# eloquent26 generations
5 generators Γ— 5 strategies Γ— 66 topics = 1,650+ generations from the
ELOQUENT 2026 Voight-Kampff factor-isolation experiment.
## Files
- `generations.tar.gz` β€” full `out/` tree:
- `{generator}/{strategy}/{topic_id}.txt` β€” the texts
- `_references/{set}/*.txt` β€” human controls
- `scores/{detector}/{generator}/{strategy}/{topic_id}.json` β€” per-text scores
- `manifest.jsonl`, `analysis/*.csv`
- `scores.parquet` β€” 9.6k-row aggregated detector scores (for quick inspection)
## Strategies
`vanilla`, `imperfection`, `roundtrip`, `roundtrip_imperf`, `lost_in_translation`.
Round-trip language: Hindi (closed frontier), Chinese (Qwen family).
## License
For research use. Contact the author for any commercial use.
Repo: `protagonist/roberta-eloquent`.