dominic-fischer commited on
Commit
df279ab
·
1 Parent(s): 784e739

update hf demo

Browse files
Files changed (2) hide show
  1. LICENSE.txt +21 -0
  2. README.md +29 -1
LICENSE.txt ADDED
@@ -0,0 +1,21 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ MIT License
2
+
3
+ Copyright (c) 2025, University of Zurich (UZH)
4
+
5
+ Permission is hereby granted, free of charge, to any person obtaining a copy
6
+ of this software and associated documentation files (the "Software"), to deal
7
+ in the Software without restriction, including without limitation the rights
8
+ to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
9
+ copies of the Software, and to permit persons to whom the Software is
10
+ furnished to do so, subject to the following conditions:
11
+
12
+ The above copyright notice and this permission notice shall be included in all
13
+ copies or substantial portions of the Software.
14
+
15
+ THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16
+ IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17
+ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18
+ AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19
+ LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20
+ OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21
+ SOFTWARE.
README.md CHANGED
@@ -5,8 +5,36 @@ colorFrom: purple
5
  colorTo: yellow
6
  sdk: gradio
7
  sdk_version: 5.43.1
 
 
8
  app_file: app.py
9
  pinned: false
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  colorTo: yellow
6
  sdk: gradio
7
  sdk_version: 5.43.1
8
+ tags:
9
+ - beta
10
  app_file: app.py
11
  pinned: false
12
  ---
13
 
14
+ # Basic Lemmatizer for Romansh Varieties <span style="color:gray">(Beta)</span>: Demo
15
+
16
+ This demo visualises the functionalities of the package "romansh_lemmatizer", available at:
17
+
18
+ https://github.com/ZurichNLP/romansh_lemmatizer
19
+
20
+ The underlying Python package presents a basic dictionary-based lemmatizer for the Romansh language.
21
+ Provided a Romansh text, the lemmatizer splits it into words and looks up each word in the [Pledari Grond](https://pledarigrond.ch/) dictionaries for the five standard Romansh idioms: Sursilvan, Sutsilvan, Surmiran, Puter, and Vallader, as well as the dictionary for Rumantsch Grischun.
22
+
23
+ For example, if a Romansh text contains the word _lavuraiva_, the lemmatizer traces the word back to the Vallader and Puter dictionaries:
24
+
25
+ <img src="illustration.png" alt="illustration" width="400"/>
26
+
27
+ Typical use cases for the lemmatizer include:
28
+ - Accessing potential German translations (glosses) of Romansh words
29
+ - Automatically detecting the variety of a Romansh text, based on how many words are found in the respective dictionaries
30
+
31
+ A limitation of the current version is that the lemmatizer does not disambiguate between multiple possible ways of lemmatizing a word. Specifically:
32
+ 1. If a word has multiple dictionary entries, all the dictionary entries are returned, irrespective of the context in which the word occurs.
33
+ 2. If there are multiple ways of morphologically analysing a given word form, all possible analyses are returned.
34
+
35
+ ## Acknowledgements and Data Rights
36
+ This demo incorporates dictionary data from the [Pledari Grond](https://pledarigrond.ch/) project.
37
+
38
+ - The dictionaries for Rumantsch Grischun, Surmiran, Sursilvan and Sutsilvan are openly licensed. © **Lia Rumantscha** 1980 – 2025
39
+ - The dictionaries for Vallader and Puter are kindly provided by [**Uniun dals Grischs**](https://www.udg.ch/dicziunari) and may only be used in the context of this lemmatizer. © Uniun dals Grischs. All rights reserved.
40
+