Echo-SmolTools-114M-Intent-CLF-Gen

GitHub License Python Model Collection Hybrid Collection Working Paper

This is a generative sequence classification model based on the Echo-DSRN architecture. It was merged from the base model ethicalabs/Echo-DSRN-114M-v0.1.2 and the PEFT adapter ethicalabs/Echo-SmolTools-114M-Intent-PEFT.

No additional linear head is trained — the adapter's generative knowledge is used directly via constrained next-token scoring: for each candidate label the model sums the log-probability of each of its tokens, then picks the highest-scoring one.

Model Details

  • Architecture: EchoForGenerativeClassification
  • Base model: ethicalabs/Echo-DSRN-114M-v0.1.2
  • Adapter: ethicalabs/Echo-SmolTools-114M-Intent-PEFT
  • Labels: 60 Amazon MASSIVE intents (51 languages)
  • Dtype: bfloat16
  • Constraint Method: Next-token generative scoring

Usage

This model requires trust_remote_code=True to load the custom architecture.

import torch
from transformers import AutoTokenizer
from echo_dsrn.modeling_generative_clf import EchoForGenerativeClassification

model_id = "ethicalabs/Echo-SmolTools-114M-Intent-CLF-Gen" # or your hub path

tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
model = EchoForGenerativeClassification.from_pretrained(
    model_id,
    trust_remote_code=True,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)

# Single utterance
label, probs = model.classify("Enter your text here", tokenizer)
print(f"Prediction: {label}")
Downloads last month
118
Safetensors
Model size
0.1B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for ethicalabs/Echo-SmolTools-114M-Intent-CLF-Gen

Finetuned
(2)
this model

Collection including ethicalabs/Echo-SmolTools-114M-Intent-CLF-Gen