Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,38 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Text Summarization
|
| 2 |
+
This is a assignment of Applied Deep Learning which is a course of National Taiwan University(NTU).
|
| 3 |
+
### Task Description:Chinese News Summarization (Title Generation)
|
| 4 |
+
input(news content):
|
| 5 |
+
```
|
| 6 |
+
從小就很會念書的李悅寧, 在眾人殷殷期盼下,以榜首之姿進入臺大醫學院, 但始終忘不了對天文的熱情。大學四年級一場遠行後,她決心遠赴法國攻讀天文博士。 從小沒想過當老師的她,再度跌破眾人眼鏡返台任教,......
|
| 7 |
+
```
|
| 8 |
+
output(news title):
|
| 9 |
+
```
|
| 10 |
+
榜首進台大醫科卻休學 、27歲拿到法國天文博士 李悅寧跌破眾人眼鏡返台任教
|
| 11 |
+
```
|
| 12 |
+
### Objective
|
| 13 |
+
- Fine-tune a pre-trained model:[google/mt5-small](https://huggingface.co/google/mt5-small) to pass the baseline.
|
| 14 |
+
- Compare the difference between beam search, top k sampling, top p sampling, temperature.
|
| 15 |
+
```
|
| 16 |
+
Baseline(f1-score):rouge-1: 22.0, rouge-2: 8.5, rouge-L: 20.5
|
| 17 |
+
```
|
| 18 |
+
### Experiments
|
| 19 |
+
- Greedy
|
| 20 |
+
After the model generate the probility of every token as result, Greedy is the simplest way to choose the next word with most probable word(argmax).
|
| 21 |
+
However, there is a problem that it's easy to choose the duplicate word with Greedy strategy.
|
| 22 |
+
```
|
| 23 |
+
Greedy Result(f1-score):rouge-1: 15.7, rouge-2: 4.9, rouge-L: 14.8
|
| 24 |
+
```
|
| 25 |
+
- Beam Search
|
| 26 |
+
Beam Search strategy is keeping track of the k most probable sentences and finding the best one as a result.
|
| 27 |
+
Therefore, if beam size is setting as 1, it becomes Greedy. We can say that beam search kind of solves the problem of Greedy.
|
| 28 |
+
However, if beam size is too large, the result will turn into too generic and less relevant though the result is safe and "correct".
|
| 29 |
+
For example
|
| 30 |
+
```
|
| 31 |
+
I love to listen Taylor Swift's songs so I decide to participate the concert of Taylor.
|
| 32 |
+
```
|
| 33 |
+
- Top k Sampling
|
| 34 |
+
|
| 35 |
+
- Top p Sampling
|
| 36 |
+
|
| 37 |
+
- Temperature
|
| 38 |
+
|