A collection of datasets that are curated by Ontocord's team.
AI & ML interests
Ontology + Concordance: The meeting of meaning
Recent Activity
View all activity
Papers
View all Papers Organization Card
About Ontocord.AI
We are an open source volunteer organization dedicated to creating safer, smaller and higher performance AI.
Our Recent Work
- MixtureVitae
- CulturaY
- OpenAssistant
- Aurora-m v1
- Vistral-7b-chat
- Red Pajama v1
- OIG and the OIG-moderation
models 89
ontocord/1.7b-MixtureVitae-web_curated-100BT
2B • Updated • 275
ontocord/1.7b-MixtureVitae-curated-80BT
2B • Updated • 54
ontocord/1.7b-Comma0.1-300BT
2B • Updated • 936
ontocord/1.7b-MixtureVitae-300BT-v1-decontaminated-16k
Feature Extraction • 2B • Updated • 50
ontocord/1.7b-MixtureVitae-300BT-v1-decontaminated-16k-SFT-openthoughts30k
Feature Extraction • 2B • Updated • 38
ontocord/1.3b-Comma0.1-300BT
1B • Updated • 198
ontocord/0.4b-Comma0.1-300BT
0.4B • Updated • 327
ontocord/0.13b-Comma0.1-300BT
Updated
ontocord/1.7b_data-common_corpus-eng-300BT
2B • Updated • 319
ontocord/1.3b_data-common_corpus-eng-300BT
1B • Updated • 248
datasets 33
ontocord/synthetic-prompt-common-pile-annotated
Viewer • Updated • 202k
ontocord/MixtureVitae-v1-decontaminated
Viewer • Updated • 534M • 93
ontocord/MV_Qwen-Magpie
Viewer • Updated • 1.05k • 41
ontocord/Dolci-Think-RL-7B-decontaminated
Viewer • Updated • 101k • 71
ontocord/Dolci-Think-DPO-7B-decontaminated
Viewer • Updated • 150k • 877
ontocord/Dolci-Instruct-DPO-decontaminated
Viewer • Updated • 258k • 51
ontocord/Dolci-Instruct-SFT-decontaminated
Viewer • Updated • 2.14M • 766
ontocord/Dolci-Think-SFT-7B-decontaminated
Viewer • Updated • 2.26M • 514
ontocord/person2rel
Updated • 3
ontocord/MixtureVitae-v1
Viewer • Updated • 59.9M • 1.08k • 16