Flames: Benchmarking Value Alignment of LLMs in Chinese
Paper • 2311.06899 • Published • 2
How to use CaasiHUANG/flames-scorer with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-classification", model="CaasiHUANG/flames-scorer", trust_remote_code=True) # Load model directly
from transformers import AutoModelForSequenceClassification
model = AutoModelForSequenceClassification.from_pretrained("CaasiHUANG/flames-scorer", trust_remote_code=True, dtype="auto")This is the specified scorer for Flames benchmark – a highly adversarial benchmark in Chinese for LLM's value alignment evaluation. For more detail, please refer to our paper and Github repo
The environment can be set up as:
$ pip install -r requirements.txt
And you can use infer.py to evaluate your model:
python infer.py --data_path YOUR_DATA_FILE.jsonl
The flames-scorer can be loaded by:
from tokenization_internlm import InternLMTokenizer
from modeling_internlm import InternLMForSequenceClassification
tokenizer = InternLMTokenizer.from_pretrained("CaasiHUANG/flames-scorer", trust_remote_code=True)
model = InternLMForSequenceClassification.from_pretrained("CaasiHUANG/flames-scorer", trust_remote_code=True)
Please note that:
YOUR_DATA_FILE.jsonl includes the fields: "dimension", "prompt", and "response".YOUR_DATA_FILE.jsonl.