yolay
/

RAIF-DeepSeek-Qwen-1.5B

Text Generation

text-generation-inference

Model card Files Files and versions

yolay commited on Jul 17

Commit

c2f9de6

·

verified ·

1 Parent(s): b2f5e9c

Update README.md

Files changed (1) hide show

README.md +1 -6

README.md CHANGED Viewed

@@ -1,12 +1,9 @@
 ---
 base_model:
-- meta-llama/Llama-3.1-8B-Instruct
 datasets:
 - yolay/RAIF-ComplexInstruction-DeepSeek
-language:
-- en
 library_name: transformers
-license: apache-2.0
 metrics:
 - accuracy
 pipeline_tag: text-generation
@@ -14,8 +11,6 @@ pipeline_tag: text-generation
 This model belongs to the official implementation of the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models".
-Project Page: https://yanqval.github.io/PAE/
 You can find the paper at https://huggingface.co/papers/2506.01413.
 Existing large language models (LLMs) face challenges of following complex instructions, especially when multiple constraints are present and organized in paralleling, chaining, and branching structures. One intuitive solution, namely chain-of-thought (CoT), is expected to universally improve capabilities of LLMs. However, we find that the vanilla CoT exerts a negative impact on performance due to its superficial reasoning pattern of simply paraphrasing the instructions. It fails to peel back the compositions of constraints for identifying their relationship across hierarchies of types and dimensions.

 ---
 base_model:
+- deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
 datasets:
 - yolay/RAIF-ComplexInstruction-DeepSeek
 library_name: transformers
 metrics:
 - accuracy
 pipeline_tag: text-generation
 This model belongs to the official implementation of the paper "Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models".
 You can find the paper at https://huggingface.co/papers/2506.01413.
 Existing large language models (LLMs) face challenges of following complex instructions, especially when multiple constraints are present and organized in paralleling, chaining, and branching structures. One intuitive solution, namely chain-of-thought (CoT), is expected to universally improve capabilities of LLMs. However, we find that the vanilla CoT exerts a negative impact on performance due to its superficial reasoning pattern of simply paraphrasing the instructions. It fails to peel back the compositions of constraints for identifying their relationship across hierarchies of types and dimensions.