Llama-3.2-3B-Instruct — Multi-Hop BES

Paper Link: https://arxiv.org/abs/2605.28814

Multi-hop retrieval-augmented QA agent fine-tuned from meta-llama/Llama-3.2-3B-Instruct with Bidirectional Evolutionary Search (BES) on the 3–4-hop subset of MuSiQue.

For the 8B variant see Xkev/Llama-3.1-8B-Instruct-multihop-BES.

Training

  • Base model: meta-llama/Llama-3.2-3B-Instruct
  • Dataset: 3–4-hop subset of MuSiQue (~5.5k examples)
  • Retrieval: E5 + FAISS over wiki-18

Intended use

Research on multi-hop reasoning and post-training. Not intended for general dialog or production.

License

Llama 3.2 Community License for fine-tuning artifacts. Base model meta-llama/Llama-3.2-3B-Instruct is governed by Meta's Llama 3.2 License, which still applies to derived weights.

Downloads last month
-
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Xkev/Llama-3.2-3B-Instruct-multihop-BES

Finetuned
(1600)
this model

Dataset used to train Xkev/Llama-3.2-3B-Instruct-multihop-BES

Collection including Xkev/Llama-3.2-3B-Instruct-multihop-BES

Paper for Xkev/Llama-3.2-3B-Instruct-multihop-BES