zfj1998
/

A2Search-3B-Instruct

Question Answering

reinforcement-learning

Model card Files Files and versions

A2Search-3B-Instruct / README.md

zfj1998's picture

Update README.md

7bb786e verified 4 months ago

|

history blame contribute delete

442 Bytes

	---
	license: apache-2.0
	datasets:
	- RUC-NLPIR/FlashRAG_datasets
	language:
	- en
	metrics:
	- f1
	- recall
	base_model:
	- Qwen/Qwen2.5-3B-Instruct
	pipeline_tag: question-answering
	tags:
	- ambiguity
	- reinforcement-learning
	- agent
	---

	- This repository contains the RL-trained model accompanying our paper, A^2Search: Ambiguity-Aware Question Answering with Reinforcement Learning. More details are available at https://github.com/zfj1998/A2Search