sbompolas commited on
Commit
a1f828f
·
verified ·
1 Parent(s): 4906b00

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -48
README.md CHANGED
@@ -10,66 +10,55 @@ pinned: false
10
  license: cc-by-4.0
11
  ---
12
 
13
- # Stanza Parser with CoNLL-U Viewer
14
 
15
- A comprehensive linguistic analysis tool powered by Stanford's Stanza library that provides sentence parsing with multiple output formats.
16
 
17
  ## Features
18
 
19
- - **Multi-language Support**: Parse text in English, Spanish, French, German, Chinese, Russian, and Arabic
20
- - **CoNLL-U Output**: Get standard linguistic annotation format output
21
- - **Interactive Data Table**: Browse parsed tokens with all linguistic features
22
- - **Dependency Visualization**: Text-based visualization of dependency relationships
23
- - **Copy-friendly Output**: Easy to copy results for use in other tools
24
 
25
- ## What is CoNLL-U?
26
 
27
- CoNLL-U is a standard format for representing linguistic annotations that includes:
 
 
 
 
28
 
29
- - **Tokenization**: Word and sentence boundaries
30
- - **Part-of-Speech Tagging**: Universal and language-specific POS tags
31
- - **Lemmatization**: Base forms of words
32
- - **Morphological Features**: Grammatical attributes
33
- - **Dependency Parsing**: Syntactic relationships between words
34
 
35
- ## How to Use
 
 
 
 
 
 
 
 
 
 
36
 
37
- 1. Enter your text in the input box
38
- 2. Select the appropriate language
39
- 3. Click "Parse Text" or press Enter
40
- 4. View results in three formats:
41
- - Raw CoNLL-U format (copy-paste ready)
42
- - Interactive data table
43
- - Dependency structure visualization
44
 
45
- ## Example Output
46
-
47
- For the sentence "The cat sits on the mat", you'll get:
48
-
49
- - **CoNLL-U format**: Standard 10-column format with all linguistic features
50
- - **Data table**: Interactive view of each token's properties
51
- - **Dependencies**: "cat --nsubj--> sits", "mat --nmod--> sits", etc.
52
-
53
- ## Use Cases
54
-
55
- - **Linguistic Research**: Analyze sentence structure and grammatical relationships
56
- - **NLP Development**: Generate training data or test parsing models
57
- - **Educational**: Learn about syntactic analysis and dependency grammar
58
- - **Text Processing**: Prepare annotated data for downstream tasks
59
 
60
  ## Technical Details
61
 
62
- This space uses:
63
- - **Stanza**: Stanford's multilingual NLP toolkit
64
- - **Gradio**: For the interactive web interface
65
- - **Pandas**: For data table visualization
66
 
67
- The models are automatically downloaded and cached when the space starts up.
68
-
69
- ## Supported Languages
70
-
71
- Currently supports: English (en), Spanish (es), French (fr), German (de), Chinese (zh), Russian (ru), Arabic (ar)
72
-
73
- ---
74
 
75
- *Powered by Stanford Stanza - https://stanfordnlp.github.io/stanza/*
 
 
 
10
  license: cc-by-4.0
11
  ---
12
 
13
+ # Greek Text Parser with spaCy displaCy
14
 
15
+ This Hugging Face Space provides a web interface for parsing Greek text using spaCy and visualizing the results in CoNLL-U format or as interactive dependency trees.
16
 
17
  ## Features
18
 
19
+ - **Greek Language Processing**: Uses spaCy's Greek language model (`el_core_news_sm`)
20
+ - **CoNLL-U Output**: Generate standard CoNLL-U format for dependency parsing
21
+ - **Visual Dependencies**: Interactive dependency tree visualization using displaCy
22
+ - **Download Options**: Save HTML visualizations for offline viewing
 
23
 
24
+ ## Usage
25
 
26
+ 1. Enter Greek text in the input field
27
+ 2. Choose your output format:
28
+ - **HTML**: Interactive dependency visualization
29
+ - **CoNLL-U**: Standard dependency parsing format
30
+ 3. Click "Parse Text" to process
31
 
32
+ ## CoNLL-U Format
 
 
 
 
33
 
34
+ The CoNLL-U format includes:
35
+ - Token ID
36
+ - Word form
37
+ - Lemma
38
+ - Universal POS tag
39
+ - Language-specific POS tag
40
+ - Morphological features
41
+ - Head token ID
42
+ - Dependency relation
43
+ - Enhanced dependencies
44
+ - Miscellaneous annotations
45
 
46
+ ## Examples
 
 
 
 
 
 
47
 
48
+ Try these Greek sentences:
49
+ - `Ο γάτος κοιμάται στον καναπέ.` (The cat sleeps on the sofa)
50
+ - Μαρία διαβάζει ένα βιβλίο στη βιβλιοθήκη.` (Maria reads a book in the library)
51
+ - `Τα παιδιά παίζουν ποδόσφαιρο στην αυλή.` (The children play football in the yard)
 
 
 
 
 
 
 
 
 
 
52
 
53
  ## Technical Details
54
 
55
+ - Built with Gradio for the web interface
56
+ - Uses spaCy 3.4+ with Greek language support
57
+ - Generates standard CoNLL-U format output
58
+ - Provides downloadable HTML visualizations
59
 
60
+ ## Requirements
 
 
 
 
 
 
61
 
62
+ - Python 3.8+
63
+ - spaCy with Greek model
64
+ - Gradio for web interface