Instructions to use mcnckc/llm-course-hw3-dora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use mcnckc/llm-course-hw3-dora with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-classification", model="mcnckc/llm-course-hw3-dora")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("mcnckc/llm-course-hw3-dora") model = AutoModelForCausalLM.from_pretrained("mcnckc/llm-course-hw3-dora") - Notebooks
- Google Colab
- Kaggle
Модель OuteAI/Lite-Oute-1-300M-Instruct дообученная на датасете cardiffnlp/tweet_eval, задача классификации сентимента твита, вывести одно из трех слов -
negative, neutral, positive.
Дообучение
Модель дообучалась при помощи DoRA.
- Ранг LoRA =
16 alpha=32- DoRA применялась только к весам Key, Value в attention
BATCH_SIZE = 16LEARNING_RATE = 2e-4NUM_EPOCHS = 2AdamWweight_decay=0.01
Метрика на валидации
F1=0.51
Tweet: "QT @user In the original draft of the 7th book, Remus Lupin survived the Battle of Hogwarts. #HappyBirthdayRemusLupin"
Label: positive
Output:
positive
positive
positive
pos
Tweet: "Ben Smith / Smith (concussion) remains out of the lineup Thursday, Curtis #NHL #SJ"
Label: neutral
Output:
neutral
neutral
neutral
neut
Tweet: Sorry bout the stream last night I crashed out but will be on tonight for sure. Then back to Minecraft in pc tomorrow night.
Label: neutral
Output:
neutral
positive
positive
pos
Tweet: Chase Headley's RBI double in the 8th inning off David Price snapped a Yankees streak of 33 consecutive scoreless innings against Blue Jays
Label: neutral
Output:
neutral
neutral
neutral
neut
Tweet: @user Alciato: Bee will invest 150 million in January, another 200 in the Summer and plans to bring Messi by 2017"
Label: positive
Output:
neutral
negative
negative
negative
- Downloads last month
- 3
Model tree for mcnckc/llm-course-hw3-dora
Base model
OuteAI/Lite-Oute-1-300M-Instruct