Rin

A model trained for agentic work on device.

  • Long-horizon agentic tasks, coding, and everyday chat
  • Reasons privately, then answers cleanly
  • ~4B parameters — runs locally on consumer hardware

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
tok = AutoTokenizer.from_pretrained("Loke-60000/rin-4b")
model = AutoModelForCausalLM.from_pretrained("Loke-60000/rin-4b", device_map="auto")
msgs = [{"role": "user", "content": "Write a Python function to reverse a string."}]
inputs = tok.apply_chat_template(msgs, add_generation_prompt=True, return_tensors="pt").to(model.device)
print(tok.decode(model.generate(**{"input_ids": inputs}, max_new_tokens=512)[0]))
Downloads last month
2
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support