File size: 1,453 Bytes
741f011
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e72bb13
741f011
 
 
 
 
 
 
6a9392e
741f011
 
 
6a9392e
741f011
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
license: llama3.1
base_model: meta-llama/Llama-3.1-8B-Instruct
library_name: transformers
tags:
  - drug-combination
  - relation-extraction
  - biomedical
  - llama
  - chain-of-thought
---

# RexDrug-Base

This is the SFT (Supervised Fine-Tuning) base model for **RexDrug**, a chain-of-thought reasoning model for biomedical drug combination relation extraction.

## Model Details

- **Base architecture**: [Llama-3.1-8B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct)
- **Fine-tuning method**: SFT with LoRA (merged)
- **Task**: Drug combination relation extraction from biomedical literature
- **Relation types**: POS (beneficial), NEG (harmful), COMB (neutral/mixed), NO_COMB (no combination)

## Usage

This model is intended to be used with the [RexDrug-adapter](https://huggingface.co/DUTIR-BioNLP/RexDrug-adapter) (LoRA adapter trained via GRPO). See the adapter repository for the full quick start guide.

```python
from transformers import AutoTokenizer, AutoModelForCausalLM
from peft import PeftModel
import torch

model = AutoModelForCausalLM.from_pretrained(
    "DUTIR-BioNLP/RexDrug-base",
    torch_dtype=torch.bfloat16,
    device_map="auto",
)
model = PeftModel.from_pretrained(model, "DUTIR-BioNLP/RexDrug-adapter")
```

## License

This model is built upon Llama 3.1 and is subject to the [Llama 3.1 Community License Agreement](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/LICENSE).