💻 Twinkle Coder
Collection
The collection related to the coding task • 1 item • Updated
This model is a full-parameter SFT checkpoint for SQL generation, trained from mistralai/Devstral-Small-2505 and exported to Hugging Face safetensors format.
mistralai/Devstral-Small-2505MistralForCausalLMsafetensors with model.safetensors.index.jsonThe SFT run merged the following datasets:
from transformers import AutoModelForCausalLM, AutoTokenizer
repo_or_path = "<hf-username-or-org>/<model-repo>"
tokenizer = AutoTokenizer.from_pretrained(repo_or_path, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(
repo_or_path,
torch_dtype="bfloat16",
)
config.jsongeneration_config.jsontekken.jsonmodel-00001-of-00021.safetensors ... model-00021-of-00021.safetensorsmodel.safetensors.index.jsonIf you use this model, please cite this repository:
Base model
mistralai/Mistral-Small-3.1-24B-Base-2503