EunsuKim commited on
Commit
a8e34e7
·
verified ·
1 Parent(s): adf1b40

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -19
README.md CHANGED
@@ -1,3 +1,6 @@
 
 
 
1
  ---
2
  license: apache-2.0
3
  language:
@@ -15,20 +18,14 @@ pipeline_tag: text-classification
15
 
16
  # BenchHub-Cat-7b
17
 
18
- **BenchHub-Cat-7b** is a 7B parameter instruction-tuned language model that performs structured classification of natural language queries into three dimensions:
19
-
20
- - `subject`: Topic domain of the query (e.g., law, health, travel)
21
- - `skill`: Type of skill or task (e.g., reasoning, explanation, comparison)
22
- - `target`: General or cultural-specific target audience
23
-
24
- It is based on the Qwen2.5-7B-Instruct architecture and trained on a mixture of synthetic and GPT-generated instruction data.
25
 
26
  ## 🔧 Model Details
27
 
28
- - **Base Model**: Qwen2.5-7B-Instruct
29
- - **Task**: Structured triple-label classification
30
- - **Prompt Format**: Instruction-style with output structure
31
- - **Training Framework**: Axolotl + DeepSpeed ZeRO-3
32
 
33
  ## 🧪 Training Configuration
34
 
@@ -41,19 +38,40 @@ It is based on the Qwen2.5-7B-Instruct architecture and trained on a mixture of
41
  | Scheduler | Cosine Decay |
42
  | Warmup Ratio | 0.05 |
43
  | Optimizer | Method from [19] |
 
44
  | Hardware | 4× A6000 48GB GPUs |
45
  | Training Time | ~5 hours per run |
46
 
47
  ## 🧠 Intended Use
48
 
49
- **Input**: Open-ended natural language queries
50
- **Output**: Structured classification result with 3 fields
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
51
 
52
- ### Example Categories:
53
- - `subject`: education, health, history, law, etc.
54
- - `skill`: reasoning, recall, summarization, etc.
55
- - `target`: general, cultural-specific
56
 
57
- ### ✨ Example Prompt & Output
58
 
59
- #### 📝 Prompt
 
1
+
2
+
3
+ ````markdown
4
  ---
5
  license: apache-2.0
6
  language:
 
18
 
19
  # BenchHub-Cat-7b
20
 
21
+ **BenchHub-Cat-7b** is a category classification model based on **Qwen2.5-7B**, fine-tuned to assign natural language queries to structured category triplets: `(subject, skill, target)`.
 
 
 
 
 
 
22
 
23
  ## 🔧 Model Details
24
 
25
+ - **Base Model**: [Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct)
26
+ - **Task**: Structured multi-label classification (triple: subject, skill, target)
27
+ - **Prompting Style**: Instruction-style with expected format output
28
+ - **Training Framework**: [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) + DeepSpeed ZeRO-3
29
 
30
  ## 🧪 Training Configuration
31
 
 
38
  | Scheduler | Cosine Decay |
39
  | Warmup Ratio | 0.05 |
40
  | Optimizer | Method from [19] |
41
+ | Trainer | DeepSpeed ZeRO-3 |
42
  | Hardware | 4× A6000 48GB GPUs |
43
  | Training Time | ~5 hours per run |
44
 
45
  ## 🧠 Intended Use
46
 
47
+ **Input**: Natural language question or instruction
48
+ **Output**: Triplet `(subject, skill, target)`, such as:
49
+
50
+ ```yaml
51
+ { "subject_type": "history",
52
+ "task_type": "reasoning",
53
+ "target_type": "korea"}
54
+ ````
55
+
56
+
57
+ ## ✨ Prompt Example
58
+
59
+ ```
60
+ ### Instruction:
61
+ Classify the following query into subject, skill, and target.
62
+
63
+ ### Query:
64
+ How did Confucianism shape education in East Asia?
65
+
66
+ ### Output:
67
+ { "subject_type": "history",
68
+ "task_type": "reasoning",
69
+ "target_type": "korea"}
70
+ ```
71
+
72
+ ## 📜 License
73
 
74
+ Apache 2.0
 
 
 
75
 
 
76
 
77
+ ````