End of training

Browse files

Files changed (3) hide show

README.md +26 -45
emissions.csv +2 -2
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -1,51 +1,37 @@
 ---
-base_model: hfl/chinese-macbert-base
-datasets:
-- CIRCL/Vulnerability-CNVD
 library_name: transformers
 license: apache-2.0
-metrics:
-- accuracy
 tags:
 - generated_from_trainer
-- text-classification
-- classification
-- nlp
-- chinese
-- vulnerability
-pipeline_tag: text-classification
-language: zh
 model-index:
 - name: vulnerability-severity-classification-chinese-macbert-base
   results: []
 ---
-# VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification (Chinese Text)
-This model is a fine-tuned version of [hfl/chinese-macbert-base](https://huggingface.co/hfl/chinese-macbert-base) on the dataset [CIRCL/Vulnerability-CNVD](https://huggingface.co/datasets/CIRCL/Vulnerability-CNVD).
-For more information, visit the [Vulnerability-Lookup project page](https://vulnerability.circl.lu) or the [ML-Gateway GitHub repository](https://github.com/vulnerability-lookup/ML-Gateway), which demonstrates its usage in a FastAPI server.
-## How to use
-You can use this model directly with the Hugging Face `transformers` library for text classification:
-```python
-from transformers import pipeline
-classifier = pipeline(
-    "text-classification",
-    model="CIRCL/vulnerability-severity-classification-chinese-macbert-base"
-)
-# Example usage for a Chinese vulnerability description
-description_chinese = "TOTOLINK A3600R是中国吉翁电子（TOTOLINK）公司的一款6天线1200M无线路由器。TOTOLINK A3600R存在缓冲区溢出漏洞，该漏洞源于/cgi-bin/cstecgi.cgi文件的UploadCustomModule函数中的File参数未能正确验证输入数据的长度大小，攻击者可利用该漏洞在系统上执行任意代码或者导致拒绝服务。"
-result_chinese = classifier(description_chinese)
-print(result_chinese)
-# Expected output example: [{'label': '高', 'score': 0.9802}]
-```
 ## Training procedure
@@ -53,27 +39,22 @@ print(result_chinese)
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
-- train_batch_size: 64
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
-It achieves the following results on the evaluation set:
-- Loss: 0.6059
-- Accuracy: 0.7771
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.5764        | 1.0   | 1772 | 0.6157          | 0.7462   |
-| 0.5644        | 2.0   | 3544 | 0.5618          | 0.7663   |
-| 0.4589        | 3.0   | 5316 | 0.5615          | 0.7781   |
-| 0.3881        | 4.0   | 7088 | 0.5791          | 0.7823   |
-| 0.3433        | 5.0   | 8860 | 0.6059          | 0.7771   |
 ### Framework versions
@@ -81,4 +62,4 @@ It achieves the following results on the evaluation set:
 - Transformers 4.57.3
 - Pytorch 2.9.1+cu128
 - Datasets 4.4.2
-- Tokenizers 0.22.1

 ---
 library_name: transformers
 license: apache-2.0
+base_model: hfl/chinese-macbert-base
 tags:
 - generated_from_trainer
+metrics:
+- accuracy
 model-index:
 - name: vulnerability-severity-classification-chinese-macbert-base
   results: []
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# vulnerability-severity-classification-chinese-macbert-base
+This model is a fine-tuned version of [hfl/chinese-macbert-base](https://huggingface.co/hfl/chinese-macbert-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.6086
+- Accuracy: 0.7746
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
 ## Training procedure
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Accuracy |
+|:-------------:|:-----:|:-----:|:---------------:|:--------:|
+| 0.6059        | 1.0   | 3548  | 0.5956          | 0.7432   |
+| 0.5083        | 2.0   | 7096  | 0.5697          | 0.7664   |
+| 0.491         | 3.0   | 10644 | 0.5535          | 0.7730   |
+| 0.4476        | 4.0   | 14192 | 0.5666          | 0.7790   |
+| 0.3577        | 5.0   | 17740 | 0.6086          | 0.7746   |
 ### Framework versions
 - Transformers 4.57.3
 - Pytorch 2.9.1+cu128
 - Datasets 4.4.2
+- Tokenizers 0.22.2

emissions.csv CHANGED Viewed

	@@ -1,2 +1,2 @@
1	- timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2	- 2026-01-~~03T12~~:51:21,codecarbon,~~b34798ef~~-~~0b78~~-~~4cee~~-~~90c2~~-~~8e8713063c5a~~,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,~~2547~~.~~9638344610576~~,0.~~127837422724597~~,5.~~0172385100449054e~~-05,42.5,~~635~~.~~5354141556119~~,~~755~~.~~7507977485657~~,0.~~030019676328265212~~,0.~~6507438997613697~~,0.~~5336937614800426~~,1.~~2144573375696777~~,Luxembourg,LUX,,,,Linux-6.8.0-90-generic-x86_64-with-glibc2.39,3.12.3,2.~~8.4~~,224,Intel(R) Xeon(R) Platinum 8480+,4,4 x NVIDIA ~~L40S~~,6.1661,49.7498,2015.3354606628418,machine,N,1.0


1	+ timestamp,project_name,run_id,experiment_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,water_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,cpu_utilization_percent,gpu_utilization_percent,ram_utilization_percent,ram_used_gb,on_cloud,pue,wue
2	+ 2026-01-09T10:30:02,codecarbon,10fef713-6914-4471-bbfb-6264fe2c2ca7,5b0fa12a-3dd7-45bb-9766-cc326314d9f1,3486.109665968921,0.08144179050348038,2.3361798195423276e-05,70.0001582310907,663.0685817263877,70.0,0.06543725066806824,0.6428308792642952,0.06543004039795555,0.7736981703303191,0.0,Luxembourg,LUX,,,,Linux-6.8.0-88-generic-x86_64-with-glibc2.39,3.12.3,3.2.1,224,Intel(R) Xeon(R) Platinum 8480+,2,2 x NVIDIA H100 NVL,6.1661,49.7498,2015.3354606628418,machine,0.9989925158318941,73.37708693149108,1.1,21.73262945207015,N,1.0,0.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5f7155a033821ee610bc068c70a623f4d7b5cc045751bef79b5a4264f3b6e459
 size 409103316

 version https://git-lfs.github.com/spec/v1
+oid sha256:690b132fc5b94e270503cd6e31880decad98d2ba478d1c41b693c6c22d31787c
 size 409103316