Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ushakov15
/
MNLP_M2_rag_model
like
0
Text Generation
Transformers
Safetensors
HuggingFaceTB/smol-smoltalk
qwen3
Generated from Trainer
alignment-handbook
trl
sft
conversational
text-generation-inference
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
ushakov15
commited on
May 27, 2025
Commit
0278bb1
·
verified
·
1 Parent(s):
899bd0f
Create README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
ADDED
Viewed
@@ -0,0 +1 @@
1
+
This model is a fine-tuned version of Qwen/Qwen3-0.6B-Base on the ['HuggingFaceTB/smol-smoltalk'] dataset. It has been trained using TRL.