Create model card
Browse files
README.md
ADDED
|
@@ -0,0 +1,134 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
pipeline_tag: tabular-classification
|
| 3 |
+
|
| 4 |
+
library_name: sklearn
|
| 5 |
+
|
| 6 |
+
tags:
|
| 7 |
+
- european-values
|
| 8 |
+
|
| 9 |
+
license: apache-2.0
|
| 10 |
+
|
| 11 |
+
---
|
| 12 |
+
|
| 13 |
+
# European Values Evaluation Pipeline
|
| 14 |
+
|
| 15 |
+
This repository contains a trained scikit-learn pipeline for evaluating the European
|
| 16 |
+
values of a large language model, and has been trained on data from the [European Values
|
| 17 |
+
Study](https://europeanvaluesstudy.eu/).
|
| 18 |
+
|
| 19 |
+
|
| 20 |
+
## Usage
|
| 21 |
+
|
| 22 |
+
You can use this pipeline to evaluate the European values of a large language model by
|
| 23 |
+
passing the survey responses to the `transform` method of the pipeline. The output will
|
| 24 |
+
be a score between 0% and 100%, where 100% indicates a perfect match with the European
|
| 25 |
+
values.
|
| 26 |
+
|
| 27 |
+
|
| 28 |
+
### Example
|
| 29 |
+
|
| 30 |
+
```python
|
| 31 |
+
import cloudpickle
|
| 32 |
+
from huggingface_hub import snapshot_download
|
| 33 |
+
|
| 34 |
+
pipeline_dir = snapshot_download(repo_id="EuroEval/european-values-pipeline")
|
| 35 |
+
with open(f"{pipeline_dir}/pipeline.pkl", "rb") as f:
|
| 36 |
+
pipeline = cloudpickle.load(f)
|
| 37 |
+
survey_response = [1, 5, 2, ..., 4] # Example survey response to 53 questions
|
| 38 |
+
score = pipeline.transform([survey_response])[0].item()
|
| 39 |
+
print(f'European values score: {score:.2%}')
|
| 40 |
+
```
|
| 41 |
+
|
| 42 |
+
|
| 43 |
+
## Questions Used
|
| 44 |
+
|
| 45 |
+
The pipeline has been trained on 53 selected questions from the European Values Study,
|
| 46 |
+
which has been chosen based on an optimisation procedure that maximises the agreement on
|
| 47 |
+
the questions across the EU countries. The question IDs are as follows:
|
| 48 |
+
|
| 49 |
+
| Question ID | Choice | Question Title |
|
| 50 |
+
|-------------|--------|----------------|
|
| 51 |
+
| F025 | 1 | Religious denomination: Major groups |
|
| 52 |
+
| F025 | 5 | Religious denomination: Major groups |
|
| 53 |
+
| A124_09 | NA | Neighbours: Homosexuals |
|
| 54 |
+
| F025 | 3 | Religious denomination: Major groups |
|
| 55 |
+
| F118 | NA | Justifiable: Homosexuality |
|
| 56 |
+
| D081 | NA | Homosexual couples are as good parents as other couples |
|
| 57 |
+
| C001_01 | 1 | Jobs scarce: Men should have more right to a job than women (5-point scale) |
|
| 58 |
+
| F122 | NA | Justifiable: Euthanasia |
|
| 59 |
+
| E025 | NA | Political action: Signing a petition |
|
| 60 |
+
| D059 | NA | Men make better political leaders than women do |
|
| 61 |
+
| D054 | NA | One of main goals in life has been to make my parents proud |
|
| 62 |
+
| D078 | NA | Men make better business executives than women do |
|
| 63 |
+
| D026_05 | NA | It is child's duty to take care of ill parent |
|
| 64 |
+
| E069_01 | NA | Confidence: Churches |
|
| 65 |
+
| C041 | NA | Work should come first even if it means less spare time |
|
| 66 |
+
| E003 | 4 | Aims of respondent: First choice |
|
| 67 |
+
| E116 | NA | Political system: Having the army rule |
|
| 68 |
+
| G007_36B | NA | Trust: People of another nationality (b) |
|
| 69 |
+
| G007_35B | NA | Trust: People of another religion (b) |
|
| 70 |
+
| E228 | NA | Democracy: The army takes over when government is incompetent |
|
| 71 |
+
| E001 | 2 | Aims of country: First choice |
|
| 72 |
+
| E265_08 | NA | How often in country’s elections: Voters are threatened with violence at the polls |
|
| 73 |
+
| E114 | NA | Political system: Having a strong leader |
|
| 74 |
+
| E265_01 | NA | How often in country’s elections: Votes are counted fairly |
|
| 75 |
+
| C039 | NA | Work is a duty towards society |
|
| 76 |
+
| E233 | NA | Democracy: Women have the same rights as men |
|
| 77 |
+
| E233B | NA | Democracy: People obey their rulers |
|
| 78 |
+
| G062 | NA | How close you feel: Continent (e.g., Europe, Asia, etc.) |
|
| 79 |
+
| E028 | NA | Political action: Joining unofficial strikes |
|
| 80 |
+
| E265_07 | NA | How often in country’s election: Rich people buy elections |
|
| 81 |
+
| E265_06 | NA | How often in country’s elections: Election officials are fair |
|
| 82 |
+
| E265_02 | NA | How often in country’s elections: Opposition candidates are prevented from running |
|
| 83 |
+
| A080_01 | NA | Member: Belong to humanitarian or charitable organization |
|
| 84 |
+
| E069_02 | NA | Confidence: Armed forces |
|
| 85 |
+
| A080_02 | NA | Member: Belong to self-help group or mutual aid group |
|
| 86 |
+
| G052 | NA | Evaluate the impact of immigrants on the development of your country |
|
| 87 |
+
| E037 | NA | Government responsibility |
|
| 88 |
+
| A072 | NA | Member: Belong to professional associations |
|
| 89 |
+
| G005 | NA | Citizen of: Country |
|
| 90 |
+
| G063 | NA | How close you feel: World |
|
| 91 |
+
| A068 | NA | Member: Belong to political parties |
|
| 92 |
+
| A078 | NA | Member: Belong to consumer groups |
|
| 93 |
+
| A079 | NA | Member: Belong to other groups |
|
| 94 |
+
| E036 | NA | Private vs state ownership of business |
|
| 95 |
+
| A003 | NA | Important in life: Leisure time |
|
| 96 |
+
| G257 | NA | How close do you feel: To country |
|
| 97 |
+
| D001_B | NA | How much do you trust your family (4-point scale) |
|
| 98 |
+
| F025 | 8 | Religious denomination: Major groups |
|
| 99 |
+
| F025 | 7 | Religious denomination: Major groups |
|
| 100 |
+
| E264 | 4 | Vote in elections: National level |
|
| 101 |
+
| A009 | NA | State of health: Subjective |
|
| 102 |
+
| E001 | 4 | Aims of country: First choice |
|
| 103 |
+
| F025 | 4 | Religious denomination: Major groups |
|
| 104 |
+
|
| 105 |
+
|
| 106 |
+
## Pipeline Components
|
| 107 |
+
|
| 108 |
+
- **Scaler**: MinMaxScaler for normalising the input data to the range [0, 1].
|
| 109 |
+
- **Model**: KernelDensity model that has been fitted to the EU training data and can
|
| 110 |
+
measure the log-likelihood of a scaled survey response.
|
| 111 |
+
- **Scorer**: A custom SigmoidTransformer component which transforms the log-likelihoods
|
| 112 |
+
into a score between 0% and 100%, which is a parametrised sigmoid function (slope and
|
| 113 |
+
center fitted on the validation data).
|
| 114 |
+
|
| 115 |
+
|
| 116 |
+
## License
|
| 117 |
+
|
| 118 |
+
This pipeline is licensed under the Apache License 2.0. You can use it for both personal
|
| 119 |
+
and commercial purposes, but you must include the license file in any distribution of
|
| 120 |
+
the pipeline.
|
| 121 |
+
|
| 122 |
+
|
| 123 |
+
## Citation
|
| 124 |
+
|
| 125 |
+
If you use this pipeline in your research, please cite the following paper:
|
| 126 |
+
|
| 127 |
+
```bibtex
|
| 128 |
+
@article{simonsen2025european,
|
| 129 |
+
title={Evaluating European Values in Large Language Models},
|
| 130 |
+
author={Simonsen, Annika and Müller-Eberstein, Maximilian and van der Goot, Rob and Einarsson, Hafsteinn and Smart, Dan Saattrup},
|
| 131 |
+
journal={arXiv preprint arXiv:XXXX.XXXXX},
|
| 132 |
+
year={2025}
|
| 133 |
+
}
|
| 134 |
+
```
|