lastmass
/

llama3.1-Medical-Assistant

Text Generation

text-generation-inference

Model card Files Files and versions

lastmass commited on Jun 24, 2025

Commit

4eb1adb

·

verified ·

1 Parent(s): 400e75b

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -12,10 +12,13 @@ library_name: transformers
 pipeline_tag: text-generation
 ---
 # 基于 Meta-Llama-3.1-8B-Instruct 的中文医疗对话模型
 本模型通过在 `meta-llama/Llama-3.1-8B` 基础模型上，使用 `Flmc/DISC-Med-SFT` 数据集进行监督微调（SFT）得到。该模型旨在为用户提供医疗相关的对话支持。
 ## 模型架构
 本模型采用了 LoRA (Low-Rank Adaptation) 技术，训练后的 LoRA 适配器权重保存在 `adapter_model.safetensors` 文件中。

 pipeline_tag: text-generation
 ---
 # 基于 Meta-Llama-3.1-8B-Instruct 的中文医疗对话模型
 本模型通过在 `meta-llama/Llama-3.1-8B` 基础模型上，使用 `Flmc/DISC-Med-SFT` 数据集进行监督微调（SFT）得到。该模型旨在为用户提供医疗相关的对话支持。
+# 使用GRPO训练的医疗推理模型看这里[https://huggingface.co/lastmass/Qwen3_Medical_GRPO]
 ## 模型架构
 本模型采用了 LoRA (Low-Rank Adaptation) 技术，训练后的 LoRA 适配器权重保存在 `adapter_model.safetensors` 文件中。