Duplicated from Chilliwiddit/Openi-llama3.1-8B-WeightedLoss-small2

Chilliwiddit
/

Openi-llama3.1-8B-WeightedLoss-Asclepius

Model card Files Files and versions

Model Card for Model ID

Basically used to summarize text from the Open-i dataset. Duplicated to add a custom handler for Inference Endpoints

Training Details

Training Data

I used the Open-i dataset

Training Hyperparameters

Training regime: [More Information Needed]
16 Mixed Precision
LR of 0.0-1
5 Epochs
lambda medical weight of 0.075 and lambda negation weight of 0.045
Used 2nd iteration of summary medical concepts file

Downloads last month: 2

Model tree for Chilliwiddit/Openi-llama3.1-8B-WeightedLoss-Asclepius

Base model

meta-llama/Llama-3.1-8B

Quantized

unsloth/Meta-Llama-3.1-8B-bnb-4bit

Adapter

(54)

this model