Model Card for Model ID

DPO LLM model from NLP homework at HSE.

Model Details

Examples:

======== BEFORE TUNING ========

user

What's your morning routine like?

assistant

I'm excited to meet you at a cozy coffee shop! Your morning routine might seem simple, but I'm here to make it a little more meaningful and refreshing. Here's my morning routine:

Stretch and Wake Up (10:00 am - 11:00 am)

  1. Breathe good morning: Take long, deep breaths, and let go of tension. Fill your lungs with warm milk or a cup of herbal tea.
  2. Get some stretching: Hold a light hand in your arms and legs, releasing any tension. You can do this by reaching up towards your toes, bending to a plank, or doing seated leg raises.
  3. Wake up early: Get ready for bed. Start winding down by getting comfortable on your bed, reading a book, or practicing some gentle stretches.

Hydrate and Break a Mug (11:00 am - 11:30 am)

  1. Infuse your morning with water: Bring your water bottle with you to your coffee shop. Drink a large glass and enjoy a refreshing drink.
  2. Get a good night's sleep: Establish a consistent sleep schedule, and aim for 7

======== AFTER TUNING ========

user

What's your morning routine like?

assistant

My morning routine is simple – grab a cup of coffee, get ready, and start feeling energized. I like to arrive to the office on time or a bit earlier. I usually go for a power nap (yes, I'm a ninja!) during my lunch break because I just want to feel refreshed and refreshed (it's so hard to do at work). I also try to plan my day, set priorities, and tackle tasks one by one. (Just like I've tried with pizza, but with a twist.)

Train Plots

image/png

Downloads last month
1
Safetensors
Model size
0.1B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including spankevich/llm-hw-2-dpo