nkasmanoff
/

tool-bert

Text Classification

Generated from Trainer

Model card Files Files and versions

nkasmanoff commited on May 30, 2024

Commit

77d8b8e

·

verified ·

1 Parent(s): 6022556

Update README.md

Files changed (1) hide show

README.md +6 -33

README.md CHANGED Viewed

@@ -15,44 +15,17 @@ should probably proofread and complete it, then remove this comment. -->
 # tool-bert
-This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.0158
-- Accuracy: 0.9886
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 8
-- eval_batch_size: 8
-- seed: 42
-- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
-- lr_scheduler_type: linear
-- num_epochs: 3.0
-### Training results
-| Training Loss | Epoch | Step | Validation Loss | Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|
-| No log        | 1.0   | 64   | 0.1104          | 0.9830   |
-| No log        | 2.0   | 128  | 0.0222          | 0.9886   |
-| No log        | 3.0   | 192  | 0.0158          | 0.9886   |
 ### Framework versions

 # tool-bert
+This model is a fine-tuned version of [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased).
+It uses a custom made dataset of sample user instructions, which are classified to a number of possible local assistant function calling endpoints.
+For example, given an input query, tool-bert returns a prediction as to what tool to use to augment a downstream LLM generated output with.
+More information on these tools to follow, but example tools are "play music", "check the weather", "get the news", "take a photo", or use no tool.
+Basically, this model is meant to be a means of allowing very small LLMs (i.e. 8B and below) to use function calling.
+All limitations and biases are inherited from the parent model.
 ### Framework versions