File size: 2,015 Bytes
d7f85a0
 
4816edf
 
14ce5f0
4816edf
 
 
1222f83
 
 
 
 
 
 
 
 
198b690
d3c5c64
198b690
 
 
 
d3c5c64
 
198b690
d3c5c64
 
d2d6537
d3c5c64
 
14ce5f0
d2d6537
 
 
 
d3c5c64
d2d6537
d3c5c64
d2d6537
 
 
 
198b690
d3c5c64
d2d6537
 
d3c5c64
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9ac3ec4
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
---
license: mit
metrics:
- accuracy
- matthews_correlation
pipeline_tag: text-classification
datasets:
- 19kmunz/iot-23-preprocessed-minimumcolumns
widget:
- text: "8081 tcp S0 2 80 0"
  example_title: "malicious, label_1"
- text: "37215 tcp S0 2 80 0"
  example_title: "malicious, label_1"
- text: "67 udp S0 11 3608 0"
  example_title: "Benign, label_0"
- text: "0 icmp OTH 9 844 0"
  example_title: "Benign, label_0"
---
## introduction
This is an undergraduate course project in computer security.

The task is to fine tune the large model to achieve malicious network flow data detection.

## base model
bert-base-uncased

## dataset:
19kmunz/iot-23-preprocessed-minimumcolumns

## example prompt:
```markdown
8081 tcp S0 2 80 0
37215 tcp S0 2 80 0
52869 tcp S0 2 80 0
8080 tcp S0 2 80 0
80 tcp S0 2 80 0
```
The above are "malicious", which is "label_1".
```markdown
67 udp S0 11 3608 0
0 icmp OTH 9 844 0
136 icmp OTH 3 216 0
0 icmp OTH 8 648 0
134 icmp OTH 2 96 0
```
The above are "Benign", which is "label_0".

## accuracy
```markdown
	Training Loss	Valid. Loss	Valid. Accur.
epoch					
1	0.288545	0.190351	0.929988
2	0.147658	0.154426	0.943510
3	0.108059	0.173112	0.943510
4	0.092468	0.161035	0.947416
```

## MCC score: 0.816

The "Total MCC" refers to the Matthews Correlation Coefficient (MCC), typically used to assess the quality of predictions in binary classification problems. 

The MCC value ranges from -1 to 1, where 1 signifies perfect predictions, 0 indicates predictions similar to random chance, and -1 denotes completely opposite predictions.

A model with an MCC value of 0.816 can be considered quite good. This value being close to 1 implies that the model has a high predictive capability and can classify samples with considerable accuracy. A higher MCC value closer to 1 indicates stronger predictive ability in the model.

In summary, an MCC value of 0.816 indicates that the model demonstrates a high level of accuracy and predictive capability in binary classification tasks.