internlm/SIM_COT-LLaMA3-CODI-8B
16B • Updated • 21 • 2
None defined yet.
Beyond the Current Observation: Evaluating Multimodal Large Language Models in Controllable Non-Markov Games
ThoughtFold: Folding Reasoning Chains via Introspective Preference Learning