AmanPriyanshu/tool-reasoning-sft-TOOLS-context-management-handling Viewer • Updated 26 days ago • 75k • 125
AmanPriyanshu/tool-reasoning-sft-RESEARCH-rlvr-env-retrieval-source Viewer • Updated Mar 25 • 156k • 101
AmanPriyanshu/tool-reasoning-sft-RESEARCH-openresearcher-dataset-sft-deep-research-agent-data-cleaned Updated Mar 25 • 262 • 1
AmanPriyanshu/tool-reasoning-sft-RESEARCH-OpenHands-CodeScout_Training_Rollouts Viewer • Updated Mar 24 • 56.8k • 39
AmanPriyanshu/reasoning-sft-minimax-microsoft-orca-agentinstruct-1M-v1 Viewer • Updated Mar 16 • 945k • 130 • 1
AmanPriyanshu/reasoning-sft-minimax-stratified-kmeans-diverse-reasoning-842K-only Viewer • Updated Mar 15 • 843k • 138
AmanPriyanshu/tool-reasoning-sft-TOOLS-toucan-1.5m-sft-tool-use-data-cleaned-rectified-333k Viewer • Updated Mar 14 • 566k • 88
AmanPriyanshu/RLVR-Env-Retrieval-Source-Retrieval-Synthetic-NVDocs-v1 Viewer • Updated Mar 14 • 100k • 33
AmanPriyanshu/tool-reasoning-sft-CODING-nvidia-Nemotron-Agentic-v1 Viewer • Updated Mar 14 • 331k • 114 • 1
AmanPriyanshu/reasoning-sft-Nemotron-Instruction-Following-Chat-v1 Viewer • Updated Mar 14 • 158k • 61
AmanPriyanshu/tool-reasoning-sft-RESEARCH-grill-lab-browsecomp-plus-runs-data-cleaned-rectified Viewer • Updated Mar 11 • 49.9k • 114
AmanPriyanshu/tool-reasoning-sft-CODING-allenai-SERA-data-cleaned-rectified Viewer • Updated Mar 10 • 211k • 60
AmanPriyanshu/tool-reasoning-sft-TOOLS-hermes-reasoning-tool-style-data-cleaned-rectified-115k Viewer • Updated Mar 10 • 115k • 771
AmanPriyanshu/RLVR-Env-Retrieval-Source-code-search-net-javascript Viewer • Updated Mar 10 • 100k • 39
AmanPriyanshu/tool-reasoning-sft-CODING-CoVe-12k-data-cleaned-rectified Viewer • Updated Mar 7 • 12k • 32
AmanPriyanshu/tool-reasoning-sft-CODING-MEnvData-SWE-Trajectory-data-cleaned-rectified Viewer • Updated Mar 7 • 3.92k • 84 • 1