beyond
/

genius-large

text2text-generation

conditional text generation

sketch-based text generation

data augmentation

Model card Files Files and versions

beyond commited on Nov 4, 2022

Commit

82a0d21

·

1 Parent(s): a9ab535

Update README.md

Files changed (1) hide show

README.md +14 -9

README.md CHANGED Viewed

@@ -31,17 +31,22 @@ inference:
 # SEGA-large model
-**SEGA: SkEtch-based Generative Augmentation**
 **SEGA** is a **general text augmentation model** that can be used for data augmentation for **various NLP tasks** (including sentiment analysis, topic classification, NER, and QA). SEGA uses an encoder-decoder structure (based on the BART architecture) and is pre-trained on the `C4-realnewslike` corpus.
 ![sega-illustration](https://cdn.jsdelivr.net/gh/beyondguo/mdnice_pictures/typora/sega-main-illustration.png)
-- Paper: [this paper](to_be_added)
-- Github: [this repository](to_be_added).
 ### How to use
 ```python
@@ -65,11 +70,11 @@ Output:
 | Model | #params | Language |
 |------------------------|--------------------------------|-------|
 | [`sega-large`]() | xM   | English |
-| [`sega-base`]()  | xM    | English |
-| [`sega-small`]()        | xM    | English |
-| [`sega-large-chinese`]() | xM    |  Chinese |
-| [`sega-base-chinese`]() | xM    | Chinese |
-| [`sega-small-chinese`]() | xM | Chinese |
 ## Data Augmentation for Text Classification Tasks:

 # SEGA-large model
+**SEGA: SkEtch-based Generative Augmentation** \
+**基于草稿的生成式增强模型**
 **SEGA** is a **general text augmentation model** that can be used for data augmentation for **various NLP tasks** (including sentiment analysis, topic classification, NER, and QA). SEGA uses an encoder-decoder structure (based on the BART architecture) and is pre-trained on the `C4-realnewslike` corpus.
 ![sega-illustration](https://cdn.jsdelivr.net/gh/beyondguo/mdnice_pictures/typora/sega-main-illustration.png)
+- Paper: [coming soon](to_be_added)
+- GitHub: [coming soon](to_be_added).
+**SEGA** is able to write complete paragraphs given a sketch (or framework), which can be composed of:
+- keywords /key-phrases, like [NLP | AI | computer science]
+- spans, like [Conference on Empirical Methods | submission of research papers]
+- sentences, like [I really like machine learning | I work at Google since last year]
+- all mixup~
 ### How to use
 ```python
 | Model | #params | Language |
 |------------------------|--------------------------------|-------|
 | [`sega-large`]() | xM   | English |
+| [`sega-base`(coming soon)]()  | xM    | English |
+| [`sega-small`(coming soon)]()        | xM    | English |
+| [`sega-large-chinese`(coming soon)]() | xM    |  Chinese |
+| [`sega-base-chinese`(coming soon)]() | xM    | Chinese |
+| [`sega-small-chinese`(coming soon)]() | xM | Chinese |
 ## Data Augmentation for Text Classification Tasks: