ishanb3d/synthetic_qa
Viewer • Updated • 1.92M • 29
A 2M-parameter question-answering model built to probe the lower limits of how small a usable generative QA model can be. It produces somewhat coherent responses to questions, given its extreme size constraints.
Prompts should follow this exact format:
<bos>Question: What is the purpose of unit testing in software projects?\nAnswer:
from transformers import AutoModelForCausalLM, AutoTokenizer
model_id = "ishanb3d/atto-language-model"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
prompt = "<bos>Question: What is the purpose of unit testing in software projects?\nAnswer:"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(**inputs, max_new_tokens=64)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
This model is intended exclusively for research and development — for example, studying small-model behavior, capability limits, and synthetic-data training dynamics.
At only 2M parameters, output quality is limited. Responses may be incoherent, factually wrong, or otherwise unreliable, and the model should not be used in production or any setting requiring accuracy or safety.
Released under CC BY 4.0.