beyond commited on
Commit
82a0d21
·
1 Parent(s): a9ab535

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -9
README.md CHANGED
@@ -31,17 +31,22 @@ inference:
31
 
32
  # SEGA-large model
33
 
34
- **SEGA: SkEtch-based Generative Augmentation**
 
35
 
36
  **SEGA** is a **general text augmentation model** that can be used for data augmentation for **various NLP tasks** (including sentiment analysis, topic classification, NER, and QA). SEGA uses an encoder-decoder structure (based on the BART architecture) and is pre-trained on the `C4-realnewslike` corpus.
37
 
38
 
39
  ![sega-illustration](https://cdn.jsdelivr.net/gh/beyondguo/mdnice_pictures/typora/sega-main-illustration.png)
40
 
41
- - Paper: [this paper](to_be_added)
42
- - Github: [this repository](to_be_added).
43
-
44
 
 
 
 
 
 
45
 
46
  ### How to use
47
  ```python
@@ -65,11 +70,11 @@ Output:
65
  | Model | #params | Language |
66
  |------------------------|--------------------------------|-------|
67
  | [`sega-large`]() | xM | English |
68
- | [`sega-base`]() | xM | English |
69
- | [`sega-small`]() | xM | English |
70
- | [`sega-large-chinese`]() | xM | Chinese |
71
- | [`sega-base-chinese`]() | xM | Chinese |
72
- | [`sega-small-chinese`]() | xM | Chinese |
73
 
74
 
75
  ## Data Augmentation for Text Classification Tasks:
 
31
 
32
  # SEGA-large model
33
 
34
+ **SEGA: SkEtch-based Generative Augmentation** \
35
+ **基于草稿的生成式增强模型**
36
 
37
  **SEGA** is a **general text augmentation model** that can be used for data augmentation for **various NLP tasks** (including sentiment analysis, topic classification, NER, and QA). SEGA uses an encoder-decoder structure (based on the BART architecture) and is pre-trained on the `C4-realnewslike` corpus.
38
 
39
 
40
  ![sega-illustration](https://cdn.jsdelivr.net/gh/beyondguo/mdnice_pictures/typora/sega-main-illustration.png)
41
 
42
+ - Paper: [coming soon](to_be_added)
43
+ - GitHub: [coming soon](to_be_added).
 
44
 
45
+ **SEGA** is able to write complete paragraphs given a sketch (or framework), which can be composed of:
46
+ - keywords /key-phrases, like [NLP | AI | computer science]
47
+ - spans, like [Conference on Empirical Methods | submission of research papers]
48
+ - sentences, like [I really like machine learning | I work at Google since last year]
49
+ - all mixup~
50
 
51
  ### How to use
52
  ```python
 
70
  | Model | #params | Language |
71
  |------------------------|--------------------------------|-------|
72
  | [`sega-large`]() | xM | English |
73
+ | [`sega-base`(coming soon)]() | xM | English |
74
+ | [`sega-small`(coming soon)]() | xM | English |
75
+ | [`sega-large-chinese`(coming soon)]() | xM | Chinese |
76
+ | [`sega-base-chinese`(coming soon)]() | xM | Chinese |
77
+ | [`sega-small-chinese`(coming soon)]() | xM | Chinese |
78
 
79
 
80
  ## Data Augmentation for Text Classification Tasks: