hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.0 Updated Sep 19, 2025
hanspeterlyngsoeraaschoujensen/DeepScaleR-1.5B-lora-256-scaling_factor_5.0-mask_cosine_0.00_0.90 Updated Aug 29, 2025
hanspeterlyngsoeraaschoujensen/Reasoning_Data_25K_Qwen3_4B_Thinking_2507 Viewer • Updated 4 days ago • 25.2k • 28
hanspeterlyngsoeraaschoujensen/terminal-bench-pro-eval-trajectories Viewer • Updated Feb 13 • 22 • 15
hanspeterlyngsoeraaschoujensen/terminal-bench-sample-eval-trajectories Viewer • Updated Feb 13 • 5 • 18
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_4 Updated Sep 24, 2025 • 72
hanspeterlyngsoeraaschoujensen/Qwen3_1.7B-fineweb_edu-train-ctx2048_layer_2 Updated Sep 24, 2025 • 71