Model Card for Model ID

Word Sense Disambigution task with mimic of reasoning. SFT training with any COT prompting or Reasoning

Model Details

Base model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Acknowledgement

We acknowledge the support of the Supercomputing Wales project, which is part-funded by the European Regional Development Fund (ERDF) via Welsh Government.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for deshanksuman/finetuned-DeepSeek-R1-Distill-Qwen-1.5B_WSD-Think

Finetuned
(621)
this model

Dataset used to train deshanksuman/finetuned-DeepSeek-R1-Distill-Qwen-1.5B_WSD-Think