Spaces:

mindchain
/

rlm-training-test

Runtime error

mindchain commited on Feb 17

Commit

30f24da

verified ·

1 Parent(s): fd46533

Add README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,10 +1,18 @@
 ---
-title: Rlm Training Test
-emoji: 🌍
-colorFrom: purple
-colorTo: indigo
 sdk: docker
-pinned: false
 ---
-Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

 ---
+title: RLM Training - Needle in Haystack
 sdk: docker
+hardware: t4-small
 ---
+# RLM Training - Recursive Language Model Skills
+Training Qwen3-0.6B-Base to find needles in haystacks using GRPO.
+## Task
+- Long context with hidden facts
+- Model learns to extract specific information
+- 20 steps quick test
+## Based on
+- RLM Paper (arXiv:2512.24601)
+- Sebastian Raschka's GRPO insights