Spaces:

UniversalCEFR
/

README

Configuration error

josephimperial commited on May 26, 2025

Commit

5856947

verified ·

1 Parent(s): 694f252

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -2,19 +2,7 @@
 UniversalCEFR is a largescale, multilingual, multidimensional dataset comprising of texts annotated according to the [CEFR (Common European Framework of Reference)](https://www.coe.int/en/web/common-european-framework-reference-languages/level-descriptions). The collection comprises of a total of 505,807 CEFR-labeled texts in 13 languages as listed below:
- - English (en)
- - Spanish (es)
- - German (de)
- - Dutch (nl)
- - Czech (cs)
- - Italian (it)
- - French (fr)
- - Estonian (et)
- - Portuguese (pt)
- - Arabic (ar)
- - Hindi (hi)
- - Russian (ru)
- - Welsh (cy)
 ## UniversalCEFR Data Format / Schema
 To ensure interoperability, transformation, and machine readability, adopted **standardised JSON format** for each CEFR-labeled text. These fields include the source dataset, language, granularity (document, paragraph, sentence, discourse), production category (learner or reference), and license.

 UniversalCEFR is a largescale, multilingual, multidimensional dataset comprising of texts annotated according to the [CEFR (Common European Framework of Reference)](https://www.coe.int/en/web/common-european-framework-reference-languages/level-descriptions). The collection comprises of a total of 505,807 CEFR-labeled texts in 13 languages as listed below:
+English (en), Spanish (es), German (de), Dutch (nl), Czech (cs), Italian (it), French (fr), Estonian (et), Portuguese (pt), Arabic (ar), Hindi (hi), Russian (ru), Welsh (cy)
 ## UniversalCEFR Data Format / Schema
 To ensure interoperability, transformation, and machine readability, adopted **standardised JSON format** for each CEFR-labeled text. These fields include the source dataset, language, granularity (document, paragraph, sentence, discourse), production category (learner or reference), and license.