Model Card for Model ID
Word Sense Disambigution task with mimic of reasoning. SFT training with any COT prompting or Reasoning
Model Details
Base model: deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Acknowledgement
We acknowledge the support of the Supercomputing Wales project, which is part-funded by the European Regional Development Fund (ERDF) via Welsh Government.
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for deshanksuman/finetuned-DeepSeek-R1-Distill-Qwen-1.5B_WSD-Think
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B