File size: 1,525 Bytes
7d376a7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
df809ba
 
 
7d376a7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ab04a8f
7d376a7
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
# 🥪 OpenFacedSandwich

Welcome to **OpenFacedSandwich** — a slightly absurd collection of Danish AI models, tools, and experiments… layered like a proper smørrebrød.

We build things that are:
- open 🧑‍🍳  
- layered 🧠  
- occasionally overengineered 😏  

## 🍞 What’s on the menu?

### 🧠 DaMorph
A collection of experimental models exploring **morphological segmentation for Danish NLP**.

👉 Because Danish words are long… and deserve to be sliced properly.

**Includes:**
- DaMorph
- DaMorph Tokenizers (yes, we slice at every layer)
- DaMedSum

### 🏥 DaMedSum
Danish medical summarization models trained on **LUMI HPC**.

👉 Turning long, complicated medical text into something (slightly) more digestible.

**Includes:**
- T5-large
- large / base / small variants  

### 🔪 Tokenizers (a.k.a. precision slicing)
Because no good sandwich starts without proper slicing.

- Morphological tokenizers for Danish  
- Built to explore how structure impacts understanding  

## 🥓 Philosophy

Why “OpenFacedSandwich”?

Because:
- **Open-faced** → everything is visible → open source  
- **Layers** → models, tokens, systems  
- **Stacking** → how we actually build AI  

👉 We don’t just build models  
👉 we **assemble sandwiches**

## 🤔 What to expect

- Serious experiments 🤓  
- Slightly cursed ideas 😈  
- Danish NLP in all its glory 🇩🇰  
- Things that *probably* shouldn’t work… but do  

## ⚡ Slogan

**Open source never tasted this good.**