TrialPanorama
/

LLaMA-3-8B-TP

 ---
+license: apache-2.0
+base_model: meta-llama/Meta-Llama-3-8B
+tags:
+- trialpanorama
+- clinical-trials
+- sample-size-estimation
+- rlvr
+- reinforcement-learning
+- llama-3
+language:
+- en
+pipeline_tag: text-generation
 ---
+# LLaMA-3-8B-TP
+This model is fine-tuned from [Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) by using [TrialPanorama dataset](https://huggingface.co/datasets/TrialPanorama/Dataset) for clinical trials.
+## Model Details
+- **Base Model**: Meta-Llama-3-8B-Instruct
+- **Fine-tuning Method**: Two-stage training
+  - Stage 1: Supervised Fine-Tuning (SFT) for knowledge injection
+  - Stage 2: RLVR (Reinforcement Learning with Verifiable Reward)
+## Usage
+### Basic Usage with Transformers
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+# Load model and tokenizer
+model_name = "TrialPanorama/LLaMA-3-8B-TP"
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.bfloat16,
+    device_map="auto"
+)
+# Prepare input (a toy example)
+prompt = """Given the following clinical trial information, estimate the required sample size:
+[Input Information]
+Please provide the estimated sample size and reasoning."""
+# Generate response
+inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+outputs = model.generate(
+    **inputs,
+    max_new_tokens=512,
+    temperature=0.6,
+    top_p=0.95,
+    do_sample=True
+)
+response = tokenizer.decode(outputs[0], skip_special_tokens=True)
+print(response)
+```
+### Usage with vLLM (Recommended for Production)
+```python
+from vllm import LLM, SamplingParams
+# Initialize vLLM
+llm = LLM(
+    model="TrialPanorama/LLaMA-3-8B-TP",
+    tensor_parallel_size=1,
+    dtype="bfloat16"
+)
+# Set sampling parameters
+sampling_params = SamplingParams(
+    temperature=0.6,
+    top_p=0.95,
+    max_tokens=512
+)
+# Generate
+prompts = ["Your sample size estimation prompt here"]
+outputs = llm.generate(prompts, sampling_params)
+for output in outputs:
+    print(output.outputs[0].text)
+```
+## Citation
+If you use this model in your research, please cite:
+```bibtex
+@article{wang2025trialpanorama,
+  title     = {Developing Large Language Models for Clinical Research Using One Million Clinical Trials},
+  author    = {Wang, Zifeng and Lin, Jiacheng and Jin, Qiao and Gao, Junyi and Pradeepkumar, Jathurshan and Jiang, Pengcheng and Lu, Zhiyong and Sun, Jimeng},
+  journal   = {arXiv preprint arXiv:2505.16097},
+  year      = {2025},
+  url       = {https://arxiv.org/abs/2505.16097}
+}
+```