YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Remarks:

So far so good, model appears to accurately extract item details for Rayman fist, but it performs poorly on chat and non-rayman advertisment test datasets that it wasnt trained on, but I anticipated this to happen because the v0.2 dataset is an experimental dataset strictly containing only Rayman fist advertisement dataset to test if the finetuned model can perform better than its v0 variant. The next model iteration will be v0.3 with a much higher dataset size, so training duration might be much longer (one pattern that i found is that the v0.2 dataset is about 2x larger than v1 dataset, training time for v0.2 is about 3x longer than training v1/v0 with the same training parameters, i had to manually stop training early when it shows signs of overfitting in v0.2, but regardless it still gives accurate output)

Downloads last month
-
GGUF
Model size
1B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support