Sentence Similarity
sentence-transformers
PyTorch
German
bert
Eval Results (legacy)
text-embeddings-inference
Instructions to use and-effect/musterdatenkatalog_clf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- sentence-transformers
How to use and-effect/musterdatenkatalog_clf with sentence-transformers:
from sentence_transformers import SentenceTransformer model = SentenceTransformer("and-effect/musterdatenkatalog_clf") sentences = [ "Bebauungspläne, vorhabenbezogene Bebauungspläne (Geltungsbereiche)", "Fachkräfte für Glücksspielsuchtprävention und -beratung", "Tagespflege Altenhilfe", "Bebauungsplan der Innenentwicklung gem. § 13a BauGB - Ortskern Rütenbrock" ] embeddings = model.encode(sentences) similarities = model.similarity(embeddings, embeddings) print(similarities.shape) # [4, 4] - Notebooks
- Google Colab
- Kaggle
add github Repo
Browse files
README.md
CHANGED
|
@@ -67,7 +67,7 @@ license: cc-by-4.0
|
|
| 67 |
|
| 68 |
## Model Sources
|
| 69 |
|
| 70 |
-
- **Repository**:
|
| 71 |
- **Demo**: [Spaces App](https://huggingface.co/spaces/and-effect/Musterdatenkatalog)
|
| 72 |
|
| 73 |
This model is based on [bert-base-german-cased](https://huggingface.co/bert-base-cased) and fine-tuned on [and-effect/mdk_gov_data_titles_clf](https://huggingface.co/datasets/and-effect/mdk_gov_data_titles_clf). The model was created as part of the [Bertelsmann Foundation's Musterdatenkatalog (MDK)](https://www.bertelsmann-stiftung.de/de/unsere-projekte/smart-country/musterdatenkatalog) project. The model is intended to classify open source dataset titles from german municipalities. This can help municipalities in Germany, as well as data analysts and journalists, to see which cities have already published data sets and what might be missing. The model is specifically tailored for this task and uses a specific taxonomy. It thus has a clear intended downstream task and should be used with the mentioned taxonomy.
|
|
|
|
| 67 |
|
| 68 |
## Model Sources
|
| 69 |
|
| 70 |
+
- **Repository**: https://github.com/bertelsmannstift/Musterdatenkatalog-V4
|
| 71 |
- **Demo**: [Spaces App](https://huggingface.co/spaces/and-effect/Musterdatenkatalog)
|
| 72 |
|
| 73 |
This model is based on [bert-base-german-cased](https://huggingface.co/bert-base-cased) and fine-tuned on [and-effect/mdk_gov_data_titles_clf](https://huggingface.co/datasets/and-effect/mdk_gov_data_titles_clf). The model was created as part of the [Bertelsmann Foundation's Musterdatenkatalog (MDK)](https://www.bertelsmann-stiftung.de/de/unsere-projekte/smart-country/musterdatenkatalog) project. The model is intended to classify open source dataset titles from german municipalities. This can help municipalities in Germany, as well as data analysts and journalists, to see which cities have already published data sets and what might be missing. The model is specifically tailored for this task and uses a specific taxonomy. It thus has a clear intended downstream task and should be used with the mentioned taxonomy.
|