Intelligent-Internet
/

II-Thought-1.5B-Preview

Model card Files Files and versions

Phu Nguyen commited on Mar 25, 2025

Commit

3fe94a8

·

verified ·

1 Parent(s): 53d23f9

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -1,5 +1,9 @@
 # II-Thought-1.5B-Preview
 ## Overview
 **II-Thought-1.5B-Preview** is a Reinforcement Learning enhanced language model trained on **a subset of [II-Thought-RL-v0](https://huggingface.co/datasets/Intelligent-Internet/II-Thought-RL-v0)**, the first large-scale, multi-task dataset designed for RL. While II-Thought-RL-v0 spans multiple domains (mathematics, coding, medicine, science, etc.), this preview release was trained on randomly sampled **50K math subset** ([dataset link](https://huggingface.co/datasets/Intelligent-Internet/II-Thought-RL-v0-Math-50K)).

 # II-Thought-1.5B-Preview
+<div style="display: flex; justify-content: center;">
+    <img src="https://cdn-uploads.huggingface.co/production/uploads/67c563afa34e1ad5a3533ccf/xBJE1uk9_FGPn2N1emMFR.png" width="800">
+</div>
 ## Overview
 **II-Thought-1.5B-Preview** is a Reinforcement Learning enhanced language model trained on **a subset of [II-Thought-RL-v0](https://huggingface.co/datasets/Intelligent-Internet/II-Thought-RL-v0)**, the first large-scale, multi-task dataset designed for RL. While II-Thought-RL-v0 spans multiple domains (mathematics, coding, medicine, science, etc.), this preview release was trained on randomly sampled **50K math subset** ([dataset link](https://huggingface.co/datasets/Intelligent-Internet/II-Thought-RL-v0-Math-50K)).