RobustRDP / README.md
Jingcz's picture
Upload README.md with huggingface_hub
7d7ccef verified
|
Raw
History Blame Contribute Delete
1.93 kB
---
license: other
library_name: transformers
tags:
- chemistry
- molecular-structure
- reaction-diagram
- robustrdp
base_model: Qwen/Qwen2.5-VL-3B-Instruct
pipeline_tag: image-text-to-text
---
# RobustRDP
Fine-tuned model for the paper: *RobustRDP: Advancing Reaction Diagram Parsing via Synthetic-to-Real Data Scaling and Robustness-Oriented Training*.
## Description
This model is a fine-tuned checkpoint based on **Qwen2.5-VL-3B-Instruct**, trained with a three-stage pipeline:
1. **Pretrain stage**: Synthetic data pretraining on large-scale synthetic reaction diagrams.
2. **SFT stage**: Supervised fine-tuning with three specialized tasks:
- **Vanilla Reaction Parsing (VRP)**: Standard reaction diagram parsing
- **Region-Guided Reaction Parsing (RGRP)**: Region-aware parsing with spatial guidance
- **Prefix-Perturbed Reaction Parsing (PPRP)**: Robustness-oriented parsing with prefix perturbations
3. **DPO stage**: Direct Preference Optimization to further align model outputs with ground-truth annotations.
## Training Details
| Config | Pretrain | SFT | DPO |
|--------|----------|-----|-----|
| Base Model | Qwen2.5-VL-3B-Instruct | Qwen2.5-VL-3B-Instruct | Qwen2.5-VL-3B-Instruct |
| Learning Rate | 1.0×10⁻⁶ | 1.0×10⁻⁵ | 3.0×10⁻⁷ |
| Batch Size | 16 | 4 | 64 |
| Epochs | 1 | 1 | 1 |
| Scheduler | Cosine (warmup 0.03) | Cosine (warmup 0.03) | Cosine (warmup 0.03) |
| Optimizer | AdamW | AdamW | AdamW |
| Trainable Params | LLM only | Full (vision + LLM) | LLM only |
## Usage
```python
from transformers import AutoModelForCausalLM, AutoTokenizer
model_name = "Jingcz/RobustRDP"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)
```
## Related Resources
- **Dataset**: [RxnLabelData](https://huggingface.co/datasets/Jingcz/RxnLabelData)
- **Annotation Platform**: [RxnLabel](https://github.com/jaydetang/RxnLabel)