HagalazAI commited on
Commit
b05cac6
·
verified ·
1 Parent(s): 9283895

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -13
README.md CHANGED
@@ -44,8 +44,6 @@ Detects **technical red-team / offensive security** text (English).
44
  | Other | 130 000 |
45
  | **Total** | **180 296** |
46
 
47
- Source: [Primus-FineWeb](https://huggingface.co/datasets/trendmicro-ailab/Primus-FineWeb) (filtered & hand-labelled).
48
-
49
  ---
50
 
51
  ## Model details
@@ -60,6 +58,19 @@ Source: [Primus-FineWeb](https://huggingface.co/datasets/trendmicro-ailab/Primus
60
 
61
  ---
62
 
 
 
 
 
 
 
 
 
 
 
 
 
 
63
  ## Quick start
64
 
65
  ```python
@@ -77,14 +88,3 @@ print(f"P(offensive) = {prob:.3f}")
77
 
78
  is_red = prob >= 0.515 # ← recommended threshold
79
  print("is_red:", is_red)
80
-
81
- ## Source & License
82
-
83
- This dataset is built from the Primus-FineWeb collection (trendmicro-ailab/Primus-FineWeb), which is itself made available under the Open Data Commons Attribution License 1.0 (ODC-By-1.0). When you redistribute or build on this data you **must**:
84
-
85
- 1. Include an attribution statement, e.g.:
86
- > “Contains data from trendmicro-ailab/Primus-FineWeb, used under ODC-By-1.0 (http://opendatacommons.org/licenses/by/1-0/).”
87
- 2. Keep any existing copyright or license notices intact.
88
- 3. Abide by [Common Crawl’s Terms of Use](https://commoncrawl.org/terms-of-use/) for the underlying crawled content (e.g. don’t use it for illegal or harmful activities).
89
-
90
- If you’re republishing any of the “hacking” or offensive-security tutorials, you’re perfectly free to do so—just follow those three steps above.
 
44
  | Other | 130 000 |
45
  | **Total** | **180 296** |
46
 
 
 
47
  ---
48
 
49
  ## Model details
 
58
 
59
  ---
60
 
61
+ ## Source & License
62
+
63
+ This dataset is built from the Primus-FineWeb collection (trendmicro-ailab/Primus-FineWeb), which is itself made available under the Open Data Commons Attribution License 1.0 (ODC-By-1.0). When you redistribute or build on this data you **must**:
64
+
65
+ 1. Include an attribution statement, e.g.:
66
+ > “Contains data from trendmicro-ailab/Primus-FineWeb, used under ODC-By-1.0 (http://opendatacommons.org/licenses/by/1-0/).”
67
+ 2. Keep any existing copyright or license notices intact.
68
+ 3. Abide by [Common Crawl’s Terms of Use](https://commoncrawl.org/terms-of-use/) for the underlying crawled content (e.g. don’t use it for illegal or harmful activities).
69
+
70
+ If you’re republishing any of the “hacking” or offensive-security tutorials, you’re perfectly free to do so—just follow those three steps above.
71
+
72
+ --
73
+
74
  ## Quick start
75
 
76
  ```python
 
88
 
89
  is_red = prob >= 0.515 # ← recommended threshold
90
  print("is_red:", is_red)