varadsrivastava
/

BAI_Arg_Alpha

text-generation-inference

Model card Files Files and versions

varadsrivastava commited on Jun 1, 2024

Commit

5cf5723

·

verified ·

1 Parent(s): fc84d2f

Update README.md

Files changed (1) hide show

README.md +23 -4

README.md CHANGED Viewed

@@ -11,12 +11,31 @@ tags:
 base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
 ---
-# Uploaded  model
 - **Developed by:** varadsrivastava
 - **License:** apache-2.0
-- **Finetuned from model :** unsloth/llama-3-8b-Instruct-bnb-4bit
-This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
 ---
+# Model: BAI_LLM_FinArg
 - **Developed by:** varadsrivastava
 - **License:** apache-2.0
+- **Base Model :** unsloth/llama-3-8b-Instruct-bnb-4bit
+# For Proper Inference, please use:
+!pip install "unsloth[colab-new] @ git+https://GitHub.com/unslothai/unsloth.git@April-Llama-3-2024"
+### Loading the fine-tuned model and the tokenizer for inference
+from unsloth import FastLanguageModel
+model, tokenizer = FastLanguageModel.from_pretrained(
+        model_name = "varadsrivastava/BAI_LLM_FinArg",
+        max_seq_length = 20,
+        dtype = torch.bfloat16,
+        load_in_4bit = True
+    )
+### Using FastLanguageModel for fast inference
+FastLanguageModel.for_inference(model)
+# Prompt template:
+"""<|begin_of_text|><|start_header_id|>system<|end_header_id|>
+{instruction}<|eot_id|><|start_header_id|>user<|end_header_id|>
+Sentence: {row['text']}<|eot_id|><|start_header_id|>assistant<|end_header_id|>
+Class: {row['label']}<|eot_id|>"""
+NOTE: This model was trained 2x faster using Unsloth and Huggingface's TRL library.