stanford-crfm
/

BioMedLM

Text Generation

text-generation-inference

Model card Files Files and versions

Add medical tag

#6

by davanstrien HF Staff - opened Dec 19, 2023

base: refs/heads/main

←

from: refs/pr/6

Discussion Files changed

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -2,8 +2,10 @@
 license: bigscience-bloom-rail-1.0
 datasets:
 - pubmed
-widget:
-- text: 'Photosynthesis is'
 ---
 # Model Card for BioMedLM 2.7B
@@ -145,4 +147,4 @@ BioMedLM 2.7B is a standard GPT-2 implementation (trained with Flash Attention)
 ## Compute Infrastructure
-The model was trained on [MosaicML Cloud](https://www.mosaicml.com/cloud), a platform designed for large workloads like LLMs. Using the [Composer](https://github.com/mosaicml/composer) training library and [PyTorch FSDP](https://pytorch.org/docs/stable/fsdp.html), it was easy to enable multi-node training across 128 A100-40GB GPUs, and the total run was completed in ~6.25 days.

 license: bigscience-bloom-rail-1.0
 datasets:
 - pubmed
+widget:
+- text: Photosynthesis is
+tags:
+- medical
 ---
 # Model Card for BioMedLM 2.7B
 ## Compute Infrastructure
+The model was trained on [MosaicML Cloud](https://www.mosaicml.com/cloud), a platform designed for large workloads like LLMs. Using the [Composer](https://github.com/mosaicml/composer) training library and [PyTorch FSDP](https://pytorch.org/docs/stable/fsdp.html), it was easy to enable multi-node training across 128 A100-40GB GPUs, and the total run was completed in ~6.25 days.