Improve model card: link paper and code, update license, and add tags

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +72 -112
README.md CHANGED
@@ -1,112 +1,72 @@
1
- ---
2
- library_name: peft
3
- license: other
4
- base_model: Qwen/Qwen3-8B
5
- tags:
6
- - base_model:Qwen/Qwen3-8B
7
- - llama-factory
8
- - lora
9
- - transformers
10
- pipeline_tag: text-generation
11
- model-index:
12
- - name: sft100
13
- results: []
14
- ---
15
-
16
-
17
- # GlobalHealthAtlas Public Model
18
-
19
- This repository contains the code and resources for the GlobalHealthAtlas Public Model project. The project hosts a fine-tuned question-answering model designed to provide helpful answers to public health–related queries.
20
-
21
- ## Overview
22
-
23
- The GlobalHealthAtlas Public Model is designed to provide informative, context-aware answers to questions about public health documents and topics. It includes the fine-tuned adapter model, tokenizer files, example prompts, and scripts for running inference in a Q&A or conversational setting.
24
-
25
- ## Files and Directories
26
-
27
- - **adapter_config.json**: Configuration for the adapter model.
28
- - **adapter_model.safetensors**: Pre-trained adapter model.
29
- - **added_tokens.json**: Tokens added to the tokenizer.
30
- - **all_results.json**: Example outputs and sample Q&A responses from inference and evaluation.
31
- - **merges.txt**: Merges for the tokenizer.
32
- - **README.md**: This file.
33
- - **special_tokens_map.json**: Mapping of special tokens.
34
- - **tokenizer_config.json**: Configuration for the tokenizer.
35
- - **tokenizer.json**: Tokenizer file.
36
- - **train_results.json**: Results from training.
37
- - **trainer_log.jsonl**: Log of the training process.
38
- - **trainer_state.json**: State of the trainer.
39
- - **checkpoint-990/**: Directory containing checkpoint files for the model.
40
-
41
- ## Usage
42
-
43
- To use the GlobalHealthAtlas Public Model, clone the repository and follow the instructions in the README files. Ensure that all dependencies are installed and configured correctly. Load `adapter_model.safetensors` and the tokenizer, then run the provided inference scripts or your own prompt loop to perform question-answering. This model is intended for research and information purposes and should not be used for clinical diagnosis or other high‑stakes decisions without human review.
44
-
45
- ## License
46
-
47
- This project is licensed under the MIT License. See the LICENSE file for details.
48
-
49
- ## Acknowledgments
50
-
51
- We would like to thank the contributors and the community for their support in developing this project.
52
-
53
- ## References
54
-
55
- For more information, please refer to the related publications and documentation in the Public Health directory.
56
-
57
- ## Model description
58
-
59
- This model is a fine-tuned version of [/root/Qwen3-8B](https://huggingface.co//root/Qwen3-8B) on the GlobalHealthAtlas dataset.
60
-
61
-
62
- ## Intended uses & limitations
63
-
64
- This model is intended to be used as a question-answering and information retrieval component for the Global Health Atlas project and related public health queries. It is designed to provide helpful, evidence-informed answers but is NOT intended for clinical diagnosis, medical advice, or other high‑stakes decision-making. Users should validate performance on their own data prior to deployment, monitor for potential biases introduced by the fine-tuning data, and review outputs with domain experts when used for operational tasks.
65
-
66
- ## Training and evaluation data
67
-
68
- The model was fine-tuned on the `GlobalHealthAtlas` dataset to adapt the Qwen/Qwen3-8B base model for question-answering and response generation. Evaluation used held-out splits and standard validation procedures. Detailed evaluation metrics, per-epoch results, and sample outputs are available in `train_results.json` and `all_results.json` in this repository. To reproduce results, use the same data splits and training hyperparameters listed below. For additional context about dataset preparation and task setup, consult the `Public_health/` directory and the Global Health Atlas repository: https://github.com/Jan8217/GlobalHealthAtlas.
69
-
70
-
71
- This model is a fine-tuned version of [/root/Qwen3-8B](https://huggingface.co//root/Qwen3-8B) on the GlobalHealthAtlas dataset
72
-
73
- ## Model description
74
-
75
- More information needed
76
-
77
- ## Intended uses & limitations
78
-
79
- More information needed
80
-
81
- ## Training and evaluation data
82
-
83
- More information needed
84
-
85
- ## Training procedure
86
-
87
- ### Training hyperparameters
88
-
89
- The following hyperparameters were used during training:
90
- - learning_rate: 5e-05
91
- - train_batch_size: 1
92
- - eval_batch_size: 8
93
- - seed: 42
94
- - gradient_accumulation_steps: 8
95
- - total_train_batch_size: 8
96
- - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
97
- - lr_scheduler_type: cosine
98
- - lr_scheduler_warmup_ratio: 0.1
99
- - num_epochs: 2.0
100
-
101
- ### Training results
102
-
103
- Training completed for 2 epochs using the hyperparameters listed above. Final evaluation metrics (validation loss and task-specific scores) and per-epoch logs are stored in `train_results.json` and the training log is available in `trainer_log.jsonl`. Please refer to those files for exact numeric results and per-example breakdowns.
104
-
105
-
106
- ### Framework versions
107
-
108
- - PEFT 0.15.1
109
- - Transformers 4.51.3
110
- - Pytorch 2.3.0+cu121
111
- - Datasets 3.2.0
112
- - Tokenizers 0.21.0
 
1
+ ---
2
+ base_model: Qwen/Qwen3-8B
3
+ library_name: peft
4
+ license: mit
5
+ pipeline_tag: text-generation
6
+ tags:
7
+ - base_model:Qwen/Qwen3-8B
8
+ - llama-factory
9
+ - lora
10
+ - transformers
11
+ - medical
12
+ - public-health
13
+ model-index:
14
+ - name: sft100
15
+ results: []
16
+ ---
17
+
18
+ # GlobalHealthAtlas Public Model
19
+
20
+ This repository contains the fine-tuned LoRA adapter for the **GlobalHealthAtlas Public Model**, introduced in the paper [From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas](https://huggingface.co/papers/2602.00491).
21
+
22
+ The model is designed to provide informative, context-aware answers to public health–related queries across 15 domains and 17 languages. It was fine-tuned from [Qwen/Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) on the GlobalHealthAtlas dataset.
23
+
24
+ - **Paper:** [From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas](https://huggingface.co/papers/2602.00491)
25
+ - **Repository:** [Jan8217/GlobalHealthAtlas](https://github.com/Jan8217/GlobalHealthAtlas)
26
+
27
+ ## Overview
28
+
29
+ Public health reasoning requires population-level inference grounded in scientific evidence, expert consensus, and safety constraints. This model addresses the scarcity of supervised signals in this domain. It was trained on the GlobalHealthAtlas dataset, a large-scale multilingual corpus of 280,210 instances spanning 15 domains including infectious disease prevention, health policy, and vaccination.
30
+
31
+ ## Intended Uses & Limitations
32
+
33
+ This model is intended to be used as a question-answering and information retrieval component for research and public health queries.
34
+
35
+ **Important Disclaimer:** This model is **NOT** intended for clinical diagnosis, medical advice, or other high‑stakes decision-making without human review by domain experts.
36
+
37
+ ## Training Procedure
38
+
39
+ The model was fine-tuned using LoRA (Low-Rank Adaptation) on the Qwen3-8B base model via the [LLaMA Factory](https://github.com/hiyouga/LLaMA-Factory) framework.
40
+
41
+ ### Training Hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - **Learning rate:** 5e-05
45
+ - **Train batch size:** 1
46
+ - **Gradient accumulation steps:** 8
47
+ - **Total train batch size:** 8
48
+ - **Optimizer:** AdamW
49
+ - **LR scheduler type:** cosine
50
+ - **LR scheduler warmup ratio:** 0.1
51
+ - **Num epochs:** 2.0
52
+ - **Seed:** 42
53
+
54
+ ### Framework Versions
55
+
56
+ - PEFT 0.15.1
57
+ - Transformers 4.51.3
58
+ - Pytorch 2.3.0+cu121
59
+ - Datasets 3.2.0
60
+ - Tokenizers 0.21.0
61
+
62
+ ## Citation
63
+
64
+ If you use this model or the GlobalHealthAtlas dataset, please cite:
65
+
66
+ ```bibtex
67
+ @article{globalhealthatlas2026,
68
+ title={From Knowledge to Inference: Formalizing Specialized Public Health Reasoning on GlobalHealthAtlas},
69
+ author={GlobalHealthAtlas Team},
70
+ year={2026}
71
+ }
72
+ ```