maifeeulasad
/

askubuntu-model

text2text-generation

Model card Files Files and versions

Metrics Training metrics Community

maifeeulasad commited on Jul 14, 2025

Commit

6f95460

·

verified ·

1 Parent(s): 45f41f9

[chores]: updated readme;

Files changed (1) hide show

README.md +23 -38

README.md CHANGED Viewed

@@ -1,59 +1,44 @@
 ---
-base_model: unsloth/deepseek-r1-distill-qwen-1.5b-unsloth-bnb-4bit
 library_name: transformers
 model_name: askubuntu-model
 tags:
-- generated_from_trainer
 - sft
 - unsloth
 - trl
-licence: license
 ---
 # Model Card for askubuntu-model
-This model is a fine-tuned version of [unsloth/deepseek-r1-distill-qwen-1.5b-unsloth-bnb-4bit](https://huggingface.co/unsloth/deepseek-r1-distill-qwen-1.5b-unsloth-bnb-4bit).
-It has been trained using [TRL](https://github.com/huggingface/trl).
 ## Quick start
 ```python
-from transformers import pipeline
-question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
-generator = pipeline("text-generation", model="maifeeulasad/askubuntu-model", device="cuda")
-output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
-print(output["generated_text"])
-```
-## Training procedure
-This model was trained with SFT.
-### Framework versions
-- TRL: 0.19.1
-- Transformers: 4.52.4
-- Pytorch: 2.7.1
-- Datasets: 3.6.0
-- Tokenizers: 0.21.2
-## Citations
-Cite TRL as:
-```bibtex
-@misc{vonwerra2022trl,
-	title        = {{TRL: Transformer Reinforcement Learning}},
-	author       = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
-	year         = 2020,
-	journal      = {GitHub repository},
-	publisher    = {GitHub},
-	howpublished = {\url{https://github.com/huggingface/trl}}
-}
 ```

 ---
+base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
 library_name: transformers
 model_name: askubuntu-model
 tags:
 - sft
 - unsloth
 - trl
+- deepseek
+- qwen
+licence: agpl-3.0
+datasets:
+- maifeeulasad/askubuntu-data
 ---
 # Model Card for askubuntu-model
+This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B).
 ## Quick start
 ```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+from peft import PeftModel
+base_model_id = "deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B"
+peft_model_id = "maifeeulasad/askubuntu-model"
+model = AutoModelForCausalLM.from_pretrained(
+    base_model_id,
+    device_map="auto",
+    trust_remote_code=True,
+)
+model = PeftModel.from_pretrained(model, peft_model_id)
+tokenizer = AutoTokenizer.from_pretrained(base_model_id)
+from transformers import pipeline
+generator = pipeline("text-generation", model=model, tokenizer=tokenizer)
+question = "Tell me how to install rootless docker on ubuntu 18 LTS?"
+output = generator(question, max_new_tokens=16384, return_full_text=False)[0]["generated_text"]
+print(output)
 ```