EmbedKombinat is an open-source project building the largest LLM-verified embedding training dataset. Embedding models are held back by false negatives in their training data we want to fix this by crowdsourcing LLM verification.
No public activity