The training datasets used for training the ChEmbed family of text embedding models
AI & ML interests
None defined yet.
Recent Activity
Organization Card
Edit this README.md markdown file to author your organization card.
models 7
BASF-AI/ChEmbed-prog
Feature Extraction • 0.1B • Updated
• 22
BASF-AI/ChEmbed-vanilla
Feature Extraction • 0.1B • Updated
• 9
BASF-AI/ChEmbed-plug
Feature Extraction • 0.1B • Updated
BASF-AI/ChEmbed-full
Feature Extraction • 0.1B • Updated
• 16 • 1
BASF-AI/ChemVocab
Updated
BASF-AI/nomic-bert-2048
0.1B • Updated
• 2
BASF-AI/nomic-embed-text-v1.5
Sentence Similarity • 0.1B • Updated
• 35
datasets 76
BASF-AI/ChemRxivRetrieval
Viewer
• Updated
• 79.5k • 32 • 1
BASF-AI/uspto-title-abs-chem
Viewer
• Updated
• 75.8k • 19
BASF-AI/uspto-synth-query-abs-chem
Viewer
• Updated
• 75.8k • 18
BASF-AI/PlantCAD2_virtual_hackathon
Viewer
• Updated
• 9 • 15
BASF-AI/dolma-pes2o-chemistry
Viewer
• Updated
• 361k • 33 • 1
BASF-AI/ChemRxiv-Papers
Viewer
• Updated
• 30.4k • 18 • 1
BASF-AI/ChemRxiv-Paragraphs
Viewer
• Updated
• 209k • 15 • 2
BASF-AI/ChemRxiv-Train-CC-BY
Viewer
• Updated
• 139k • 14 • 1
BASF-AI/dolma-chem-only-query-generated
Viewer
• Updated
• 1.17M • 9
BASF-AI/ChemRxiv-Train-CC-BY-v2
Viewer
• Updated
• 138k • 4 • 2