SmolLM2-360M Base IT

This is an instruction-tuned version of HuggingFaceTB/SmolLM2-360M.

The model was fine-tuned for general instruction following using Alpaca-style supervised fine-tuning.

Quick Start

import torch
from transformers import AutoTokenizer, AutoModelForCausalLM

model_id = "srmty/smolLM_360M_Base_it"

tokenizer = AutoTokenizer.from_pretrained(model_id, subfolder="final")

model = AutoModelForCausalLM.from_pretrained(
    model_id,
    subfolder="final",
    torch_dtype=torch.float16 if torch.cuda.is_available() else torch.float32,
    device_map="auto" if torch.cuda.is_available() else None,
)

if tokenizer.pad_token is None:
    tokenizer.pad_token = tokenizer.eos_token

model.eval()

Training Data

The model was fine-tuned on:

teknium/GPTeacher-General-Instruct

The data was formatted using Alpaca-style prompts.

Prompt Format

Use this format during inference:

Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
{instruction}

### Input:
{input}

### Response:

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for srmty/smolLM_360M_Base_it

Base model

HuggingFaceTB/SmolLM2-360M

Finetuned

(108)

this model