Model Card for Model ID

Model Details

A base model OuteAI/Lite-Oute-1-300M-Instruct was fine-tuned on a tweet sentiment dataset cardiffnlp/tweet_eval in order to determine tweets tonality by positive, neutral or negative.

Model Description

SYSTEM PROMPT:

You are a tweet sentiment classifier. For each tweet input, analyze its sentiment and output exactly one word: "negative", "neutral", or "positive". Do not include any extra text.

But the model is not trained to return only the sentiment name.
So we designed a custom LoRA Linear layer to achive PEFT of this model, by replacing the k_proj and v_proj layers to modify the initial model.

Training Details

batch_size=16 rank = 8 alpha = 16 lr = 5e-6

The model achieved 0.40 macro f1-score （initial 0.06）

Downloads last month: 6

Safetensors

Model size

0.3B params

Tensor type

F32

Model tree for xinyuema/llm-course-hw3-lora

Base model

OuteAI/Lite-Oute-1-300M-Instruct

Finetuned

(30)

this model

Dataset used to train xinyuema/llm-course-hw3-lora

Collection including xinyuema/llm-course-hw3-lora

PEFT

Collection

4 items • Updated Apr 7 • 1