EpistemeAI
/

Fireball-12B

Text Generation

text-generation-inference

Eval Results (legacy)

Model card Files Files and versions

legolasyiu commited on Aug 22, 2024

Commit

404c30d

·

verified ·

1 Parent(s): 228d010

Update README.md

Files changed (1) hide show

README.md +16 -21

README.md CHANGED Viewed

@@ -32,26 +32,6 @@ Supervised fine-tuning with dataset:
 - candenizkocak/code-alpaca-297k
 - yahma/alpaca-cleaned
-# Uploaded  model
-- **Developed by:** EpistemeAI
-- **License:** apache-2.0
-- **Finetuned from model :** EpistemeAI/Fireball-Mistral-Nemo-Base-2407-sft-v2.2a
-This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
-# Guardrail/Moderation guide:
-For guardrailing and moderating prompts against indirect/direct prompt injections and jailbreaking, please follow the SentinelShield AI GitHub repository:
-[SentinelShield AI](https://github.com/tomtyiu/SentinelShieldAI)
 # Model Card for Fireball-12B
 The Heavy fine-tuned Mistral-Nemo-Base-2407 Large Language Model (LLM) is a pretrained generative text model of 12B parameters trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.
@@ -77,6 +57,11 @@ Mistral Nemo is a transformer model, with the following architecture choices:
 - **Vocabulary size:** 2**17 ~= 128k
 - **Rotary embeddings (theta = 1M)**
 #### Demo
 After installing `mistral_inference`, a `mistral-demo` CLI command should be available in your environment.
@@ -155,4 +140,14 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
   journal = {GitHub repository},
   howpublished = {\url{https://github.com/tatsu-lab/stanford_alpaca}},
 }
-```

 - candenizkocak/code-alpaca-297k
 - yahma/alpaca-cleaned
 # Model Card for Fireball-12B
 The Heavy fine-tuned Mistral-Nemo-Base-2407 Large Language Model (LLM) is a pretrained generative text model of 12B parameters trained jointly by Mistral AI and NVIDIA, it significantly outperforms existing models smaller or similar in size.
 - **Vocabulary size:** 2**17 ~= 128k
 - **Rotary embeddings (theta = 1M)**
+# Guardrail/Moderation guide:
+For guardrailing and moderating prompts against indirect/direct prompt injections and jailbreaking, please follow the SentinelShield AI GitHub repository:
+[SentinelShield AI](https://github.com/tomtyiu/SentinelShieldAI)
 #### Demo
 After installing `mistral_inference`, a `mistral-demo` CLI command should be available in your environment.
   journal = {GitHub repository},
   howpublished = {\url{https://github.com/tatsu-lab/stanford_alpaca}},
 }
+```
+# Uploaded  model
+- **Developed by:** EpistemeAI
+- **License:** apache-2.0
+- **Finetuned from model :** EpistemeAI/Fireball-Mistral-Nemo-Base-2407-sft-v2.2a
+This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)