·
AI & ML interests
None yet
Organizations
models 14
August4293/test-model_v1.0
Image Feature Extraction
• 2.43M • Updated • 3
August4293/Qwen_0.5B-GSM8K-Agent-iteration-2
Text Generation
• 0.5B • Updated • 3
August4293/Qwen_0.5B-GSM8K-Agent-iteration-1
Text Generation
• 0.5B • Updated • 3
August4293/Qwen_0.5B-GSM8K-Agent
Text Generation
• 0.5B • Updated • 3
August4293/qwen_0.5B-agent_without_tool_output_mask
Text Generation
• 0.5B • Updated • 4
August4293/qwen_0.5B-agent_with_tool_output_mask
Text Generation
• 0.5B • Updated • 1
August4293/Qwen2.5-0.5B-Instruct-with-output-tokens
Text Generation
• 0.5B • Updated • 4
August4293/DeepSeek-R1-Distill-Qwen-1.5B-with-output-tokens
Text Generation
• 2B • Updated • 2
August4293/Llama3.1-8B-PRM-Deepseek-Data-4bit
Text Generation
• 8B • Updated • 5
August4293/tiny-llama3.1-8B-PRM-Deepseek-Data
Text Generation
• 2.05M • Updated • 2
datasets 15
August4293/gsm8k_dense_rewards_sorted
Viewer
• Updated • 180 • 5
August4293/gsm8k_dense_rewards_sorted_batch_3
Viewer
• Updated • 500 • 3
August4293/gsm8k_dense_rewards_filtered_batch_3
Viewer
• Updated • 57 • 6
August4293/gsm8k_dense_rewards_sorted_batch_2
Viewer
• Updated • 500 • 5
August4293/gsm8k_dense_rewards_filtered_batch_2
Viewer
• Updated • 57 • 4
August4293/gsm8k_dense_rewards_sorted_batch_1
Viewer
• Updated • 500 • 3
August4293/gsm8k_dense_rewards_filtered_batch_1
Viewer
• Updated • 60 • 4
August4293/agent_math_dataset_extended
Viewer
• Updated • 64 • 5
August4293/agent_math_dataset
Viewer
• Updated • 4 • 3
August4293/tldr-preference-sft-trl-style-sample
Viewer
• Updated • 100 • 4