Update README.md
Browse files
README.md
CHANGED
|
@@ -15,6 +15,13 @@ slim-sa-ner-3b combines two of the most popular traditional classifier functions
|
|
| 15 |
|
| 16 |
This 'combo' model is designed to illustrate the potential power of using function calls on small, specialized models to enable a single model architecture to combine the capabilities of what were traditionally two separate model architectures on an encoder.
|
| 17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
This model is fine-tuned on top of [**llmware/bling-stable-lm-3b-4e1t-v0**](https://huggingface.co/llmware/bling-stable-lm-3b-4e1t-v0), which in turn, is a fine-tune of stabilityai/stablelm-3b-4elt.
|
| 19 |
|
| 20 |
Each slim model has a 'quantized tool' version, e.g., [**'slim-sa-ner-3b-tool'**](https://huggingface.co/llmware/slim-sa-ner-3b-tool).
|
|
|
|
| 15 |
|
| 16 |
This 'combo' model is designed to illustrate the potential power of using function calls on small, specialized models to enable a single model architecture to combine the capabilities of what were traditionally two separate model architectures on an encoder.
|
| 17 |
|
| 18 |
+
The intent of SLIMs is to forge a middle-ground between traditional encoder-based classifiers and open-ended API-based LLMs.
|
| 19 |
+
|
| 20 |
+
**Compared to encoder classifiers**: provide intuitive natural language responses that fit more naturally in LLM-based agent processes, generalize better, provide a better fine-tuning base for specialized domains, and offer the potential for combining different classification modalities into a single model architecture.
|
| 21 |
+
|
| 22 |
+
**Compared to API-based Mega language models**: small, specialized, do a few things well, run locally, and do not require complex prompt instructions to generate syntactically correct format and keys.
|
| 23 |
+
|
| 24 |
+
|
| 25 |
This model is fine-tuned on top of [**llmware/bling-stable-lm-3b-4e1t-v0**](https://huggingface.co/llmware/bling-stable-lm-3b-4e1t-v0), which in turn, is a fine-tune of stabilityai/stablelm-3b-4elt.
|
| 26 |
|
| 27 |
Each slim model has a 'quantized tool' version, e.g., [**'slim-sa-ner-3b-tool'**](https://huggingface.co/llmware/slim-sa-ner-3b-tool).
|