| --- |
| license: apache-2.0 |
| tags: |
| - qwen2.5 |
| - lora |
| - negotiation |
| --- |
| |
| # Negotiation Models Repository |
|
|
| This repository contains various fine-tuned models for negotiation message reconstruction. |
|
|
| ## Available Models |
|
|
| ### Qwen3_4B_LORA_1k |
| - **Location**: `Qwen3_4B_LORA_1k/` |
| - **Base model**: Qwen/Qwen2.5-4B-Instruct |
| - **Method**: LoRA (Low-Rank Adaptation) |
| - **Training samples**: ~1k negotiation examples |
| - **Hardware**: 2x A100 GPUs |
| - **Size**: ~505MB (LoRA adapter only) |
|
|
| ### Qwen3_4B_Full_1k |
| - **Location**: `Qwen3_4B_Full_1k/` |
| - **Base model**: Qwen/Qwen2.5-4B-Instruct |
| - **Method**: Full fine-tuning (all parameters) |
| - **Training samples**: ~1k negotiation examples |
| - **Hardware**: 2x A100 GPUs with DeepSpeed ZeRO-3 |
| - **Size**: ~7.6GB (full model weights) |
|
|
| ## Usage |
|
|
| ### Using LoRA model |
| ```python |
| from transformers import AutoModelForCausalLM, AutoTokenizer |
| from peft import PeftModel |
| |
| # Load base model |
| base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-4B-Instruct") |
| tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-4B-Instruct") |
| |
| # Load LoRA adapter |
| model = PeftModel.from_pretrained(base_model, "JLiangHe/negotiation_clone", subfolder="Qwen3_4B_LORA_1k") |
| ``` |
|
|
| ### Using Full fine-tuned model |
| ```python |
| from transformers import AutoModelForCausalLM, AutoTokenizer |
| |
| # Load full fine-tuned model directly |
| model = AutoModelForCausalLM.from_pretrained("JLiangHe/negotiation_clone", subfolder="Qwen3_4B_Full_1k") |
| tokenizer = AutoTokenizer.from_pretrained("JLiangHe/negotiation_clone", subfolder="Qwen3_4B_Full_1k") |
| ``` |
|
|
| ## Model Comparison |
|
|
| | Model | Method | Size | Memory | Performance | |
| |-------|--------|------|--------|-------------| |
| | Qwen3_4B_LORA_1k | LoRA | 505MB | Low (base + adapter) | Good for inference efficiency | |
| | Qwen3_4B_Full_1k | Full FT | 7.6GB | High (full model) | May have better task adaptation | |
|
|