Add library_name and pipeline_tag metadata (#1)

- Add library_name and pipeline_tag metadata (fe072c4fe8ad0375cc4bf07f9af1b1172c911798)

Co-authored-by: Niels Rogge <nielsr@users.noreply.huggingface.co>

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,19 +1,24 @@
 ---
 license: other
 license_name: rail-share
 license_link: LICENSE
-language:
-- en
 metrics:
 - perplexity
-datasets:
-- allenai/peS2o
 ---
 # Model Card for SHARE-14B
 SHARE-14B (Social-Humanities AI for Research and Education) is a 14-billion-parameter decoder-only causal language model pretrained exclusively on content relevant to the social sciences and humanities (SSH). It is intended as a domain-specific base model for SSH research and education, and is designed to be used through the MIRROR interface, which surfaces token-level surprisal rather than generating new text.
 **Note:** This is an intermediate checkpoint released after ~15% of planned pretraining (96B tokens of a target ~630B). It is a base (pretrained-only) model with no SFT, DPO, or RLHF. This base model is not suitable to chat applications.
 ## Model Details
@@ -31,8 +36,7 @@ SHARE-14B is the first causal language model fully pretrained by and for the SSH
 ### Model Sources
 - **Repository:** https://github.com/Joaoffg/SHARE
-- **Paper:** SHARE Technical Report: Social-Humanities AI for Research and Education (2026)
-- **Demo (MIRROR interface):** [Add link]
 - **Contact:** ferreiragoncalves@eshcc.eur.nl
 ## Uses

 ---
+datasets:
+- allenai/peS2o
+language:
+- en
+- nl
 license: other
 license_name: rail-share
 license_link: LICENSE
 metrics:
 - perplexity
+library_name: transformers
+pipeline_tag: text-generation
 ---
 # Model Card for SHARE-14B
 SHARE-14B (Social-Humanities AI for Research and Education) is a 14-billion-parameter decoder-only causal language model pretrained exclusively on content relevant to the social sciences and humanities (SSH). It is intended as a domain-specific base model for SSH research and education, and is designed to be used through the MIRROR interface, which surfaces token-level surprisal rather than generating new text.
+More information can be found in the paper [SHARE: Social-Humanities AI for Research and Education](https://huggingface.co/papers/2604.11152).
 **Note:** This is an intermediate checkpoint released after ~15% of planned pretraining (96B tokens of a target ~630B). It is a base (pretrained-only) model with no SFT, DPO, or RLHF. This base model is not suitable to chat applications.
 ## Model Details
 ### Model Sources
 - **Repository:** https://github.com/Joaoffg/SHARE
+- **Paper:** [SHARE: Social-Humanities AI for Research and Education](https://arxiv.org/abs/2604.11152)
 - **Contact:** ferreiragoncalves@eshcc.eur.nl
 ## Uses