You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Qwen2-1.5B-DPO-Finetuned

This is a DPO fine-tuned version of Qwen2-1.5B.

Model Description

  • Architecture: Transformer-based language model
  • Parameters: 1.5 billion
  • Fine-tuning: DPO (Direct Preference Optimization)
  • Base Model: Qwen/Qwen2-1.5B

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

model_name = "SpringDai/Qwen2-1.5B-DPO-Finetuned"

# Load model and tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype=torch.float16,
    device_map="auto"
)

# Inference
inputs = tokenizer("What is machine learning?", return_tensors="pt")
outputs = model.generate(**inputs, max_length=100, temperature=0.7)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
Downloads last month
-
Safetensors
Model size
2B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for SpringDai/Qwen2-1.5B-DPO-Finetuned

Base model

Qwen/Qwen2-1.5B
Finetuned
(71)
this model