Cooolder commited on
Commit
0250675
·
verified ·
1 Parent(s): 7113230

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -8
README.md CHANGED
@@ -1,26 +1,25 @@
1
  ---
2
  license: apache-2.0
3
  language:
4
- - en
5
  base_model:
6
  - Qwen/Qwen3-4B-Instruct-2507
7
  pipeline_tag: text-generation
8
  tags:
9
- - performance-prediction
10
- - llm-evaluation
11
- - meta-learning
12
  ---
13
 
14
- # SCOPE: LLM Performance Prediction Model
15
 
16
  SCOPE is a specialized model that predicts how a target LLM will perform on a given question. Given a target question and a set of anchor questions with known performance results, SCOPE predicts the **output length** and **correctness** of the target model's response.
17
 
18
  ## Model Description
19
 
20
  - **Task**: Performance prediction for LLMs
21
- - **Base Model**: Qwen3-4B
22
- - **Training**: Supervised Fine-Tuning (SFT) + Reinforcement Learning with Chain-of-Thought reasoning
23
- - **Input**: Target question + 5 anchor questions with performance data
24
  - **Output**: Predicted length (tokens) and correctness (yes/no)
25
 
26
  ## Intended Use
 
1
  ---
2
  license: apache-2.0
3
  language:
4
+ - multilingual
5
  base_model:
6
  - Qwen/Qwen3-4B-Instruct-2507
7
  pipeline_tag: text-generation
8
  tags:
9
+ - Model Routing
10
+ - LLM reasoning
 
11
  ---
12
 
13
+ # SCOPE: Scalable and Controllable Outcome Performance Estimator
14
 
15
  SCOPE is a specialized model that predicts how a target LLM will perform on a given question. Given a target question and a set of anchor questions with known performance results, SCOPE predicts the **output length** and **correctness** of the target model's response.
16
 
17
  ## Model Description
18
 
19
  - **Task**: Performance prediction for LLMs
20
+ - **Base Model**: Qwen/Qwen3-4B-Instruct-2507
21
+ - **Training**: Supervised Fine-Tuning (SFT) + Reinforcement Learning (GRPO)
22
+ - **Input**: Target question + k anchor questions with performance data
23
  - **Output**: Predicted length (tokens) and correctness (yes/no)
24
 
25
  ## Intended Use