--- base_model: meta-llama/Llama-3.1-8B-Instruct library_name: transformers license: llama3.1 pipeline_tag: text-generation tags: - drug-combination - relation-extraction - biomedical - llama - chain-of-thought --- # RexDrug-Base This is the SFT (Supervised Fine-Tuning) base model for **RexDrug**, a chain-of-thought reasoning model for biomedical drug combination relation extraction. For more details, please refer to the paper: [RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs](https://huggingface.co/papers/2603.08166). **Official Code:** [DUTIR-BioNLP/RexDrug](https://github.com/DUTIR-BioNLP/RexDrug) ## Model Details - **Base architecture**: [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) - **Fine-tuning method**: SFT with LoRA (merged) - **Task**: Drug combination relation extraction from biomedical literature - **Relation types**: POS (beneficial), NEG (harmful), COMB (neutral/mixed), NO_COMB (no combination) ## Usage This model is intended to be used with the [RexDrug-adapter](https://huggingface.co/dlutIR/RexDrug-adapter) (LoRA adapter trained via GRPO). See the adapter repository for the full quick start guide. ```python from transformers import AutoTokenizer, AutoModelForCausalLM from peft import PeftModel import torch model = AutoModelForCausalLM.from_pretrained( "dlutIR/RexDrug-base", torch_dtype=torch.bfloat16, device_map="auto", ) model = PeftModel.from_pretrained(model, "dlutIR/RexDrug-adapter") ``` ## License This model is built upon Llama 3.1 and is subject to the [Llama 3.1 Community License Agreement](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE).