Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ library_name: transformers
|
|
| 12 |
language:
|
| 13 |
- en
|
| 14 |
pipeline_tag: text-generation
|
| 15 |
-
base_model: deepseek-ai/deepseek-llm-1.5b-chat
|
| 16 |
---
|
| 17 |
|
| 18 |
# Emotion-Therapy Chatbot Based on DeepSeek LLM (1.5B)
|
|
@@ -25,7 +25,7 @@ This model is part of the project **"Designing an Emotion-Therapy Chatbot Based
|
|
| 25 |
|
| 26 |
## 🔧 Model Training Details
|
| 27 |
|
| 28 |
-
- **Base Model**: `
|
| 29 |
- **Platform**: AutoDL with a single NVIDIA RTX 4090 GPU instance
|
| 30 |
- **Fine-tuning Method**: LoRA (Low-Rank Adaptation) using [LLaMA Factory](https://github.com/hiyouga/LLaMA-Factory)
|
| 31 |
- **Objective**: Improve model performance on empathetic responses, emotional understanding, and mental support
|
|
|
|
| 12 |
language:
|
| 13 |
- en
|
| 14 |
pipeline_tag: text-generation
|
| 15 |
+
base_model: deepseek-ai/deepseek-llm-1.5b-chat # Or deepseek-ai/deepseek-llm-7b-chat
|
| 16 |
---
|
| 17 |
|
| 18 |
# Emotion-Therapy Chatbot Based on DeepSeek LLM (1.5B)
|
|
|
|
| 25 |
|
| 26 |
## 🔧 Model Training Details
|
| 27 |
|
| 28 |
+
- **Base Model**: `Deepseek R1-1.5B - Distill` or `Deepseek R1-7B - Distill`
|
| 29 |
- **Platform**: AutoDL with a single NVIDIA RTX 4090 GPU instance
|
| 30 |
- **Fine-tuning Method**: LoRA (Low-Rank Adaptation) using [LLaMA Factory](https://github.com/hiyouga/LLaMA-Factory)
|
| 31 |
- **Objective**: Improve model performance on empathetic responses, emotional understanding, and mental support
|