aerovane0's picture
Improve model card: link paper and code, update license, and add tags (#1)
a5e8ef6
---
base_model: Qwen/Qwen3-8B
library_name: peft
license: mit
pipeline_tag: text-generation
tags:
- base_model:Qwen/Qwen3-8B
- llama-factory
- lora
- transformers
- medical
- public-health
model-index:
- name: sft100
results: []
---
# GlobalHealthAtlas Public Model
This repository contains the fine-tuned LoRA adapter for the **GlobalHealthAtlas Public Model**, introduced in the paper [From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas](https://huggingface.co/papers/2602.00491).
The model is designed to provide informative, context-aware answers to public health–related queries across 15 domains and 17 languages. It was fine-tuned from [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) on the GlobalHealthAtlas dataset.
- **Paper:** [From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas](https://huggingface.co/papers/2602.00491)
- **Repository:** [Jan8217/GlobalHealthAtlas](https://github.com/Jan8217/GlobalHealthAtlas)
## Overview
Public health reasoning requires population-level inference grounded in scientific evidence, expert consensus, and safety constraints. This model addresses the scarcity of supervised signals in this domain. It was trained on the GlobalHealthAtlas dataset, a large-scale multilingual corpus of 280,210 instances spanning 15 domains including infectious disease prevention, health policy, and vaccination.
## Intended Uses & Limitations
This model is intended to be used as a question-answering and information retrieval component for research and public health queries.
**Important Disclaimer:** This model is **NOT** intended for clinical diagnosis, medical advice, or other high‑stakes decision-making without human review by domain experts.
## Training Procedure
The model was fine-tuned using LoRA (Low-Rank Adaptation) on the Qwen3-8B base model via the [LLaMA Factory](https://github.com/hiyouga/LLaMA-Factory) framework.
### Training Hyperparameters
The following hyperparameters were used during training:
- **Learning rate:** 5e-05
- **Train batch size:** 1
- **Gradient accumulation steps:** 8
- **Total train batch size:** 8
- **Optimizer:** AdamW
- **LR scheduler type:** cosine
- **LR scheduler warmup ratio:** 0.1
- **Num epochs:** 2.0
- **Seed:** 42
### Framework Versions
- PEFT 0.15.1
- Transformers 4.51.3
- Pytorch 2.3.0+cu121
- Datasets 3.2.0
- Tokenizers 0.21.0
## Citation
If you use this model or the GlobalHealthAtlas dataset, please cite:
```bibtex
@article{globalhealthatlas2026,
title={From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas},
author={GlobalHealthAtlas Team},
year={2026}
}
```