teppei727 commited on
Commit
fb52b4c
·
1 Parent(s): 4fe7a78

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +59 -1
README.md CHANGED
@@ -9,7 +9,65 @@ metrics:
9
  library_name: transformers
10
  ---
11
 
 
12
 
13
  This model can classify the relation between the sentence pair of input.
14
 
15
- Now we are working on preparing the Model card. Please wait for a few days.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  library_name: transformers
10
  ---
11
 
12
+ # Descliption
13
 
14
  This model can classify the relation between the sentence pair of input.
15
 
16
+ Now we are working on preparing the Model card. Please wait for a few days.
17
+
18
+
19
+ The model trained from [bert-large-uncased](https://huggingface.co/bert-large-uncased]) on the dataset published in the paper;[Automatic Prediction of Discourse Connectives](https://arxiv.org/abs/1702.00992).
20
+
21
+ The dataset to make this model is based on English Wikipedia data and has 20 labels. However, this model will classify into 14 labels. This is because the 20-class data set was restructured to 14 classes to suit our research objective of "automatic slide generation. This distribution is shown below.
22
+
23
+
24
+ |Level 1|Level 2|Level 3|Connectives (20)|
25
+ |-------------|-----------------|------------------|--------------------|
26
+ | Temporal | Synchronous | | meanwhile |
27
+ | Temporal | Asynchronous | Precedence | then, |
28
+ | Temporal | Asynchronous | Precedence | finally, |
29
+ | Temporal | Asynchronous | Succession | by then |
30
+ | Contingency | Cause | Result | therefore |
31
+ | Comparison | Concession | Arg2-as-denier | however, |
32
+ | Comparison | Concession | Arg2-as-denier | nevertheless |
33
+ | Comparison | Contrast | | on the other hand, |
34
+ | Comparison | Contrast | | by contrast, |
35
+ | Expansion | Conjunction | | and |
36
+ | Expansion | Conjunction | | moreover |
37
+ | Expansion | Conjunction | | indeed |
38
+ | Expansion | Equivalence | | in other words |
39
+ | Expansion | Exception | Arg1-as-excpt | otherwise |
40
+ | Expansion | Instantiation | Arg2-as-instance | for example, |
41
+ | Expansion | Level-of-detail | Arg1-as-detail | overall, |
42
+ | Expansion | Level-of-detail | Arg2-as-detail | in particular, |
43
+ | Expansion | Substitution | Arg2-as-subst | instead |
44
+ | Expansion | Substitution | Arg2-as-subst | rather |
45
+
46
+ # Training
47
+
48
+ The model was trained using AutoModelForSequenceClassification.from_pretrained
49
+
50
+ ```
51
+ training_args = TrainingArguments(
52
+ output_dir = output_dir,
53
+ save_strategy="epoch",
54
+ num_train_epochs = 5,
55
+ per_device_train_batch_size=16,
56
+ per_device_eval_batch_size=32,
57
+ warmup_steps=0,
58
+ weight_decay=0.01,
59
+ logging_dir="./logs",
60
+ evaluation_strategy="epoch",
61
+ learning_rate=2e-5,
62
+ metric_for_best_model="f1",
63
+ load_best_model_at_end=True
64
+ )
65
+ ```
66
+
67
+ # Evaluation (14 labels and original 20 labels classification) using the dataset test split gives:
68
+
69
+ | Model | Macro F1 | Accuracy | Precision | Recall |
70
+ |--------------------------|-----------------|-----------------|------------------|---------------|
71
+ | 14 labels classification | 0.586 | 0.589 | 0.630 | 0.591 |
72
+ | 20 labels classification | 0.478 | 0.488 | 0.536 | 0.488 |
73
+