dfurman
/

LLaMA-7B

@@ -3,7 +3,7 @@ pipeline_tag: text-generation
 license: other
 ---
-# 🚀 LLaMA-7B
 LLaMA-7B is a base model for text generation. It was built and released by Meta AI alongside "[LLaMA: Open and Efficient Foundation Language Models](https://arxiv.org/abs/2302.13971)".
@@ -31,7 +31,7 @@ evaluating and mitigating biases, risks, toxic and harmful content generations,
 The primary intended users of the model are researchers in natural language processing, machine learning and artificial intelligence.
 **Out-of-scope use cases**
-LLaMA is a foundation model (a base model). As such, it should not be used on downstream applications without further risk evaluation and mitigation. In particular, the model has not been trained with human feedback, and can thus generate toxic or offensive content, incorrect information or generally unhelpful answers.
 ## Factors
 **Relevant factors**
@@ -60,12 +60,11 @@ LLaMA is a foundational model, and as such, it should not be used for downstream
 ### Setup
 ```python
-# Install packages
 !pip install -q -U transformers accelerate torch
 ```
 ### GPU Inference in fp16
-This requires a GPU with at least 15GB memory.
 ### First, Load the Model
@@ -103,4 +102,4 @@ _ = model.generate(
     max_new_tokens=20,
     streamer=streamer,
 )
-```

 license: other
 ---
+# 🦙 LLaMA-7B
 LLaMA-7B is a base model for text generation. It was built and released by Meta AI alongside "[LLaMA: Open and Efficient Foundation Language Models](https://arxiv.org/abs/2302.13971)".
 The primary intended users of the model are researchers in natural language processing, machine learning and artificial intelligence.
 **Out-of-scope use cases**
+LLaMA is a base model, also known as a foundation model. As such, it should not be used on downstream applications without further risk evaluation, mitigation, and potential further fine-tuning (for example, on instructions and/or chats). In particular, the model has not been trained with human feedback, and can thus generate toxic or offensive content, incorrect information or generally unhelpful answers.
 ## Factors
 **Relevant factors**
 ### Setup
 ```python
 !pip install -q -U transformers accelerate torch
 ```
 ### GPU Inference in fp16
+This requires a GPU with at least 15GB of VRAM.
 ### First, Load the Model
     max_new_tokens=20,
     streamer=streamer,
 )
+```