davanstrien HF Staff commited on
Commit
d292a6e
Β·
verified Β·
1 Parent(s): 2684d85

Expand org README: community + open-foundations framing

Browse files

Hi @wjbmattingly β€” opening as a PR rather than a direct commit since you're co-maintainer and this changes the landing-page framing meaningfully. Veto/edit anything you want.

Current README is the placeholder ("Lightweight AI models for cultural heritage institutions / More coming soon"). With 14 models + 9 datasets in the org now (plus a couple of book chapters explaining the recipes), it felt time to flesh it out.

Changes:
- Kept the hero gif and frontmatter unchanged.
- Replaced the "more coming soon" stub with a landing-page body.
- Framing: the org as a community relay β€” open foundations + community datasets β†’ small task-specific models β†’ extensible by the next institution. Named YOLO/DETR/BERT/Qwen-VL, BigLAM, and the index-card-detector-v5 extension of NLS's detector as the worked example.
- Linked AI Patterns for GLAM (incl. Boring AI + Beyond Chatbots) for the why.
- CTA: a "share a model, or suggest one" section pointing at Discussions, with the idea of curating contributed models into an org collection.
- Maintainer line names both of us + flags wider community contributions.

Happy to take this in, partially merge, or pass on it.

Files changed (1) hide show
  1. README.md +21 -17
README.md CHANGED
@@ -6,24 +6,28 @@ colorTo: indigo
6
  sdk: static
7
  pinned: false
8
  ---
9
- <br>
10
- <br>
11
- <br>
12
  <div align="center">
13
- <h2 align="center">SMALL MODELS FOR GLAM</h2>
14
- <br>
15
 
16
  ![Demo](https://cdn-uploads.huggingface.co/production/uploads/60107b385ac3e86b3ea4fc34/yfg8gNmfri4XSS5Oazkbm.gif)
17
 
18
- <br>
19
- <br>
20
- <p align="center">
21
- Lightweight AI models for cultural heritage institutions
22
- </p>
23
- <br>
24
- <br>
25
- <br>
26
- <p align="center">
27
- <sub>More coming soon. Follow this organization to get notified.</sub>
28
- </p>
29
- </div>
 
 
 
 
 
 
 
 
6
  sdk: static
7
  pinned: false
8
  ---
9
+
 
 
10
  <div align="center">
11
+ <h1>Small Models for GLAM</h1>
 
12
 
13
  ![Demo](https://cdn-uploads.huggingface.co/production/uploads/60107b385ac3e86b3ea4fc34/yfg8gNmfri4XSS5Oazkbm.gif)
14
 
15
+ </div>
16
+
17
+ Most of what gets done in libraries, archives and museums runs on a long tail of small, repetitive jobs β€” backlogs to clear, scans to make searchable, metadata to tidy. A good chunk of that work can be handled by small, task-specific models, and the people who know what those tasks are are the people working in those institutions.
18
+
19
+ This org is a place to put the models that come out of that work, so the next institution facing the same problem doesn't start from scratch.
20
+
21
+ Each model here builds on something. Most are fine-tunes of open foundation models β€” YOLO, DETR, BERT, Qwen-VL β€” trained on community datasets, often from [BigLAM](https://huggingface.co/biglam) or contributed by individual institutions. Several extend existing community-trained models for new collections rather than starting over: [index-card-detector-v5](https://huggingface.co/small-models-for-glam/index-card-detector-v5) takes the National Library of Scotland's archival card detector and extends it to three additional archives. That extension pattern matters β€” it's how this kind of work gets cheaper for everyone over time.
22
+
23
+ Recipes for most of the models live in [AI Patterns for GLAM](https://danielvanstrien.xyz/ai-patterns-for-glam/); [The Case for Boring AI](https://danielvanstrien.xyz/ai-patterns-for-glam/discovery/boring-ai.html) and [Beyond Chatbots](https://danielvanstrien.xyz/ai-patterns-for-glam/discovery/beyond-chatbots.html) set out the why.
24
+
25
+ ## How the models get built
26
+
27
+ Mostly with agentic workflows: an agent handles the data prep, training, and packaging; a human stays in the loop for the parts that matter β€” label review, evaluation, deciding whether something is good enough to release.
28
+
29
+ ## Share a model, or suggest one
30
+
31
+ If you've trained a small task-specific model for your own collection, share it in [Discussions](https://huggingface.co/spaces/small-models-for-glam/README/discussions) and we'll add good ones to a curated collection so other institutions can find them. Suggestions for tasks you'd like to see covered are welcome there too.
32
+
33
+ Maintained by [Daniel van Strien](https://huggingface.co/davanstrien) and [William Mattingly](https://huggingface.co/wjbmattingly), with contributions and datasets from across the GLAM ML community.