alfazick
/

llama-3.1-8b-function-calling

Text Generation

function-calling

Model card Files Files and versions

alfazick commited on Nov 26, 2025

Commit

369095e

·

verified ·

1 Parent(s): 5fbe284

Update README.md

Files changed (1) hide show

README.md +7 -2

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ Fine-tuned [Llama 3.1 8B Instruct](https://huggingface.co/meta-llama/Llama-3.1-8
 ## Training
 - **Dataset:** 900 examples from [Salesforce/xlam-function-calling-60k](https://huggingface.co/datasets/Salesforce/xlam-function-calling-60k)
-- **Method:** LoRA (r=16, alpha=16)
 - **Trainable params:** 42M / 8B (0.52%)
 - **Epochs:** 1
 - **Loss:** 0.66 → 0.63
@@ -62,4 +62,9 @@ You are a helpful assistant with access to the following tools or function calls
 - Trained on 900 examples (proof of concept)
 - May have argument variations vs ground truth
-- Best for single/simple tool calls

 ## Training
 - **Dataset:** 900 examples from [Salesforce/xlam-function-calling-60k](https://huggingface.co/datasets/Salesforce/xlam-function-calling-60k)
+- **Method:** LoRA
 - **Trainable params:** 42M / 8B (0.52%)
 - **Epochs:** 1
 - **Loss:** 0.66 → 0.63
 - Trained on 900 examples (proof of concept)
 - May have argument variations vs ground truth
+- Best for single/simple tool calls
+## Training Details
+- **Framework:** Unsloth 2025.11.2 + TRL
+- **Hardware:** RTX 5090 (32GB)
+- **Method:** LoRA (r=16, alpha=16)