Model Card for Model ID

Word Sense Disambigution task with mimic of reasoning. SFT training with any COT prompting or Reasoning

Model Details

Base model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Acknowledgement

We acknowledge the support of the Supercomputing Wales project, which is part-funded by the European Regional Development Fund (ERDF) via Welsh Government.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for deshanksuman/finetuned-DeepSeek-R1-Distill-Qwen-1.5B_WSD-Think

Base model

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Finetuned

(621)

this model

deshanksuman
/

finetuned-DeepSeek-R1-Distill-Qwen-1.5B_WSD-Think

Model Card for Model ID

Model Details

Acknowledgement

Model tree for deshanksuman/finetuned-DeepSeek-R1-Distill-Qwen-1.5B_WSD-Think

Dataset used to train deshanksuman/finetuned-DeepSeek-R1-Distill-Qwen-1.5B_WSD-Think