Model Card for Model ID
Basically used to summarize text from the Open-i dataset. Duplicated to add a custom handler for Inference Endpoints
Training Details
Training Data
I used the Open-i dataset
Training Hyperparameters
Training regime: [More Information Needed]
16 Mixed Precision
LR of 0.0-1
5 Epochs
lambda medical weight of 0.075 and lambda negation weight of 0.045
Used 2nd iteration of summary medical concepts file
- Downloads last month
- 2
Model tree for Chilliwiddit/Openi-llama3.1-8B-WeightedLoss-Asclepius
Base model
meta-llama/Llama-3.1-8B Quantized
unsloth/Meta-Llama-3.1-8B-bnb-4bit