llmware
/

bling-tiny-llama-v0

Text Generation

text-generation-inference

Model card Files Files and versions

doberst commited on Dec 29, 2023

Commit

439a804

·

1 Parent(s): d2e2d23

Update README.md

Files changed (1) hide show

README.md +2 -5

README.md CHANGED Viewed

@@ -36,9 +36,6 @@ For test run results (and good indicator of target use cases), please see the fi
 - **License:** Apache 2.0
 - **Finetuned from model:** TinyLlama-1.1b - 2.5T checkpoint
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### Direct Use
@@ -60,7 +57,7 @@ Any model can provide inaccurate or incomplete information, and should be used i
 ## How to Get Started with the Model
-The fastest way to get started with dRAGon is through direct import in transformers:
     from transformers import AutoTokenizer, AutoModelForCausalLM
     tokenizer = AutoTokenizer.from_pretrained("bling-tiny-llama-v0", trust_remote_code=True)
@@ -68,7 +65,7 @@ The fastest way to get started with dRAGon is through direct import in transform
 Please refer to the generation_test .py files in the Files repository, which includes 200 samples and script to test the model.  The **generation_test_llmware_script.py** includes built-in llmware capabilities for fact-checking, as well as easy integration with document parsing and actual retrieval to swap out the test set for RAG workflow consisting of business documents.
-The dRAGon model was fine-tuned with a simple "\<human> and \<bot> wrapper", so to get the best results, wrap inference entries as:
     full_prompt = "<human>: " + my_prompt + "\n" + "<bot>:"

 - **License:** Apache 2.0
 - **Finetuned from model:** TinyLlama-1.1b - 2.5T checkpoint
 ### Direct Use
 ## How to Get Started with the Model
+The fastest way to get started with BLING is through direct import in transformers:
     from transformers import AutoTokenizer, AutoModelForCausalLM
     tokenizer = AutoTokenizer.from_pretrained("bling-tiny-llama-v0", trust_remote_code=True)
 Please refer to the generation_test .py files in the Files repository, which includes 200 samples and script to test the model.  The **generation_test_llmware_script.py** includes built-in llmware capabilities for fact-checking, as well as easy integration with document parsing and actual retrieval to swap out the test set for RAG workflow consisting of business documents.
+The BLING model was fine-tuned with a simple "\<human> and \<bot> wrapper", so to get the best results, wrap inference entries as:
     full_prompt = "<human>: " + my_prompt + "\n" + "<bot>:"