Text Classification
Transformers
Safetensors
English
bert
question-answering
evaluation
text
text-embeddings-inference
Instructions to use zli12321/answer_equivalence_tiny_bert with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zli12321/answer_equivalence_tiny_bert with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="zli12321/answer_equivalence_tiny_bert")# Load model directly from transformers import AutoTokenizer, AutoModelForSequenceClassification tokenizer = AutoTokenizer.from_pretrained("zli12321/answer_equivalence_tiny_bert") model = AutoModelForSequenceClassification.from_pretrained("zli12321/answer_equivalence_tiny_bert") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -286,14 +286,25 @@ Our fine-tuned models are available on Huggingface:
|
|
| 286 |
## 📄 Citation
|
| 287 |
|
| 288 |
```bibtex
|
| 289 |
-
@
|
| 290 |
-
|
| 291 |
-
|
| 292 |
-
|
| 293 |
-
|
| 294 |
-
|
| 295 |
-
|
| 296 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 297 |
}
|
| 298 |
```
|
| 299 |
|
|
|
|
| 286 |
## 📄 Citation
|
| 287 |
|
| 288 |
```bibtex
|
| 289 |
+
@inproceedings{li-etal-2024-pedants,
|
| 290 |
+
title = "{PEDANTS}: Cheap but Effective and Interpretable Answer Equivalence",
|
| 291 |
+
author = "Li, Zongxia and
|
| 292 |
+
Mondal, Ishani and
|
| 293 |
+
Nghiem, Huy and
|
| 294 |
+
Liang, Yijun and
|
| 295 |
+
Boyd-Graber, Jordan Lee",
|
| 296 |
+
editor = "Al-Onaizan, Yaser and
|
| 297 |
+
Bansal, Mohit and
|
| 298 |
+
Chen, Yun-Nung",
|
| 299 |
+
booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2024",
|
| 300 |
+
month = nov,
|
| 301 |
+
year = "2024",
|
| 302 |
+
address = "Miami, Florida, USA",
|
| 303 |
+
publisher = "Association for Computational Linguistics",
|
| 304 |
+
url = "https://aclanthology.org/2024.findings-emnlp.548/",
|
| 305 |
+
doi = "10.18653/v1/2024.findings-emnlp.548",
|
| 306 |
+
pages = "9373--9398",
|
| 307 |
+
abstract = "Question answering (QA) can only make progress if we know if an answer is correct, but current answer correctness (AC) metrics struggle with verbose, free-form answers from large language models (LLMs). There are two challenges with current short-form QA evaluations: a lack of diverse styles of evaluation data and an over-reliance on expensive and slow LLMs. LLM-based scorers correlate better with humans, but this expensive task has only been tested on limited QA datasets. We rectify these issues by providing rubrics and datasets for evaluating machine QA adopted from the Trivia community. We also propose an efficient, and interpretable QA evaluation that is more stable than an exact match and neural methods (BERTScore)."
|
| 308 |
}
|
| 309 |
```
|
| 310 |
|