File size: 1,224 Bytes
3a9b2f4
7d376a7
3a9b2f4
7d376a7
 
 
 
 
 
 
 
 
 
 
 
 
 
df809ba
 
 
7d376a7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
ab04a8f
7d376a7
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
# 🥪 Rye AI

Welcome to **Rye AI** — a slightly absurd collection of Danish AI models, tools, and experiments… layered like a proper smørrebrød.

We build things that are:
- open 🧑‍🍳  
- layered 🧠  
- occasionally overengineered 😏  

## 🍞 What’s on the menu?

### 🧠 DaMorph
A collection of experimental models exploring **morphological segmentation for Danish NLP**.

👉 Because Danish words are long… and deserve to be sliced properly.

**Includes:**
- DaMorph
- DaMorph Tokenizers (yes, we slice at every layer)
- DaMedSum

### 🏥 DaMedSum
Danish medical summarization models trained on **LUMI HPC**.

👉 Turning long, complicated medical text into something (slightly) more digestible.

**Includes:**
- T5-large
- large / base / small variants  

### 🔪 Tokenizers (a.k.a. precision slicing)
Because no good sandwich starts without proper slicing.

- Morphological tokenizers for Danish  
- Built to explore how structure impacts understanding  

## 🤔 What to expect

- Serious experiments 🤓  
- Slightly cursed ideas 😈  
- Danish NLP in all its glory 🇩🇰  
- Things that *probably* shouldn’t work… but do  

## ⚡ Slogan

**Open source never tasted this good.**