Jupyter-Agent Running Jupyter Agent NeMo Gym 🧪 Sleeping Jupyter Agent ORS 📓 Sleeping RL Jupyter Agent 📓 Run Python code in an interactive notebook AdithyaSK/jupyter-agent-rl-10 Viewer • Updated 27 days ago • 10 • 52
Code Reasoning AdithyaSK/Qwen-0.5b-Code-Reasoning Text Generation • 0.5B • Updated Feb 7, 2025 • 7 • • 1 AdithyaSK/Qwen-1.5b-Code-Reasoning Text Generation • 2B • Updated Feb 7, 2025 • 9 • 1 AdithyaSK/Qwen-0.5b-Code-Reasoning-v1 Text Generation • 0.5B • Updated Feb 7, 2025 • 8 • • 1 AdithyaSK/Llama-3b-Code-Reasoning Text Generation • 3B • Updated Feb 7, 2025 • 3 • 1
Jupyter-Agent Running Jupyter Agent NeMo Gym 🧪 Sleeping Jupyter Agent ORS 📓 Sleeping RL Jupyter Agent 📓 Run Python code in an interactive notebook AdithyaSK/jupyter-agent-rl-10 Viewer • Updated 27 days ago • 10 • 52
Code Reasoning AdithyaSK/Qwen-0.5b-Code-Reasoning Text Generation • 0.5B • Updated Feb 7, 2025 • 7 • • 1 AdithyaSK/Qwen-1.5b-Code-Reasoning Text Generation • 2B • Updated Feb 7, 2025 • 9 • 1 AdithyaSK/Qwen-0.5b-Code-Reasoning-v1 Text Generation • 0.5B • Updated Feb 7, 2025 • 8 • • 1 AdithyaSK/Llama-3b-Code-Reasoning Text Generation • 3B • Updated Feb 7, 2025 • 3 • 1
Running 40 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 Building and scaling RL environments for LLM training