·
AI & ML interests
None yet
Organizations
None yet
izaznov/financial-model-json-generator
Text Generation
•
8B
•
Updated
•
2
izaznov/Qwen2-0.5B-GRPO-test
Updated
izaznov/qihoo360_Light-R1-7B-DS_awq_quantized
Text Generation
•
8B
•
Updated
•
1
izaznov/qihoo360_Light-R1-14B-DS_awq_quantized
Text Generation
•
15B
•
Updated
•
2
izaznov/r1_qwen_7b_220K_3ep_fn
Text Generation
•
8B
•
Updated
•
2
Text Generation
•
8B
•
Updated
•
1
izaznov/qwen_32b_competition_math
Text Generation
•
34B
•
Updated
•
6
Text Generation
•
15B
•
Updated
•
1
izaznov/qwen_arc_lora_model
Updated
1.26M
•
Updated
izaznov/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
izaznov/ppo_torch_LunarLander-v2
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
izaznov/a2c-PandaPickAndPlace-v3
Reinforcement Learning
•
Updated
izaznov/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
izaznov/ppo-Pyramids_Training
Reinforcement Learning
•
Updated
•
1
izaznov/ppo-SnowballTarget
Reinforcement Learning
•
Updated
izaznov/Reinforce-policy_pixel_copter
Reinforcement Learning
•
Updated
izaznov/Reinforce-policy_Cart_Pole
Reinforcement Learning
•
Updated
izaznov/qrdqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
izaznov/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
1
izaznov/taxi_3_Q_learning
Reinforcement Learning
•
Updated
izaznov/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
izaznov/PPO_LunarLander_v2
Reinforcement Learning
•
Updated
•
1