exploitintel commited on
Commit
4dc95b5
·
verified ·
1 Parent(s): 8697889

Update self-references to exploitintel (username change)

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -2,7 +2,7 @@
2
  license: apache-2.0
3
  base_model: Qwen/Qwen3-8B
4
  datasets:
5
- - eiphuggincve/cve-cwe-consensus
6
  language:
7
  - en
8
  tags:
@@ -24,7 +24,7 @@ ID(s)** it corresponds to. The LoRA adapter is merged into the base and released
24
  loads directly with `transformers`.
25
 
26
  Trained only on labels where **NVD and the CNA agree** after roll-up to **CWE View-1003** — see the
27
- [`cve-cwe-consensus`](https://huggingface.co/datasets/eiphuggincve/cve-cwe-consensus) dataset.
28
 
29
  ## Results (held-out test split, 6,802 rows)
30
 
@@ -55,7 +55,7 @@ actually learned rather than drowned out.
55
  import torch
56
  from transformers import AutoModelForCausalLM, AutoTokenizer
57
 
58
- mid = "eiphuggincve/cve-cwe-qwen3-8b"
59
  tok = AutoTokenizer.from_pretrained(mid)
60
  model = AutoModelForCausalLM.from_pretrained(mid, torch_dtype="auto", device_map="auto")
61
 
@@ -75,7 +75,7 @@ print(tok.decode(out[0][inputs.shape[-1]:], skip_special_tokens=True))
75
 
76
  - **Base:** `Qwen/Qwen3-8B` (trained 4-bit via `unsloth/qwen3-8b-unsloth-bnb-4bit`)
77
  - **Method:** QLoRA (4-bit) with Unsloth, merged to 16-bit · released checkpoint: **checkpoint-960** (final; eval loss declined monotonically through training)
78
- - **Dataset:** [`eiphuggincve/cve-cwe-consensus`](https://huggingface.co/datasets/eiphuggincve/cve-cwe-consensus) — 69,386 rows (55,810 / 6,774 / 6,802), majority CWEs capped at 2,500
79
  - **Epochs:** 2 · **Context:** 512 · **LR:** 2e-4 · **Optimizer:** AdamW 8-bit · **Scheduler:** linear · **Batch:** 32 · **Weight decay:** 0.01 · **Seed:** 3407
80
  - **LoRA:** rank 16 / alpha 32 / dropout 0 · **Packing:** on · **Train-on-completions-only:** off
81
 
 
2
  license: apache-2.0
3
  base_model: Qwen/Qwen3-8B
4
  datasets:
5
+ - exploitintel/cve-cwe-consensus
6
  language:
7
  - en
8
  tags:
 
24
  loads directly with `transformers`.
25
 
26
  Trained only on labels where **NVD and the CNA agree** after roll-up to **CWE View-1003** — see the
27
+ [`cve-cwe-consensus`](https://huggingface.co/datasets/exploitintel/cve-cwe-consensus) dataset.
28
 
29
  ## Results (held-out test split, 6,802 rows)
30
 
 
55
  import torch
56
  from transformers import AutoModelForCausalLM, AutoTokenizer
57
 
58
+ mid = "exploitintel/cve-cwe-qwen3-8b"
59
  tok = AutoTokenizer.from_pretrained(mid)
60
  model = AutoModelForCausalLM.from_pretrained(mid, torch_dtype="auto", device_map="auto")
61
 
 
75
 
76
  - **Base:** `Qwen/Qwen3-8B` (trained 4-bit via `unsloth/qwen3-8b-unsloth-bnb-4bit`)
77
  - **Method:** QLoRA (4-bit) with Unsloth, merged to 16-bit · released checkpoint: **checkpoint-960** (final; eval loss declined monotonically through training)
78
+ - **Dataset:** [`exploitintel/cve-cwe-consensus`](https://huggingface.co/datasets/exploitintel/cve-cwe-consensus) — 69,386 rows (55,810 / 6,774 / 6,802), majority CWEs capped at 2,500
79
  - **Epochs:** 2 · **Context:** 512 · **LR:** 2e-4 · **Optimizer:** AdamW 8-bit · **Scheduler:** linear · **Batch:** 32 · **Weight decay:** 0.01 · **Seed:** 3407
80
  - **LoRA:** rank 16 / alpha 32 / dropout 0 · **Packing:** on · **Train-on-completions-only:** off
81