Wesleythu commited on
Commit
a2301d9
·
verified ·
1 Parent(s): 6563ff4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -31,7 +31,7 @@ WildReward is trained using **ordinal regression** (CORAL-like approach) on the
31
  - **Source:** WildChat - large-scale human-LLM interactions
32
  - **Labeling:** 5-point ordinal scale based on user satisfaction signals
33
  - **Filtering:** Two-stage refinement including implicit feedback mining and refusal validation
34
- - **License:** [Specify your dataset license]
35
 
36
  ## Usage
37
 
 
31
  - **Source:** WildChat - large-scale human-LLM interactions
32
  - **Labeling:** 5-point ordinal scale based on user satisfaction signals
33
  - **Filtering:** Two-stage refinement including implicit feedback mining and refusal validation
34
+ - **License:** MIT
35
 
36
  ## Usage
37