AI & ML interests
LLMs, SLMs
Organizations
None yet
Token Classification
• 0.1B • Updated • 12
joe-xhedi/Qwen-GRPO-geological-training
Text Generation
• 0.5B • Updated • 21
• 1
joe-xhedi/Qwen-GRPO-Training-2
0.5B • Updated joe-xhedi/Qwen-GRPO-training
0.5B • Updated joe-xhedi/SentimentSeer_LSTM
Updated
joe-xhedi/FashionMNIST-PYTorch-Model
Updated
joe-xhedi/transformer-es-en-model
Updated
joe-xhedi/transformer-de-en-model
Updated
joe-xhedi/gpt-dev-french-english-scrach-model-nn
Updated
joe-xhedi/llama_2_finetuned_product_description
joe-xhedi/llama2-qlora-finetunined-french
joe-xhedi/llama2-qlora-finetunined-argument_parsiong_test
joe-xhedi/llama2-qlora-finetunined-pcs_not_structured-fail
joe-xhedi/ludwig-method-not-worked
joe-xhedi/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
• Updated joe-xhedi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
• Updated joe-xhedi/Reinforce-CartPole8
Reinforcement Learning
• Updated joe-xhedi/unit8-LunarLander-v2
Reinforcement Learning
• Updated joe-xhedi/poca-SoccerTwos
Reinforcement Learning
• Updated joe-xhedi/a2c-PandaPickAndPlace-v3
Reinforcement Learning
• Updated • 11
joe-xhedi/a2c-PandaReachDense-v3
Reinforcement Learning
• Updated • 7
joe-xhedi/llama-2-7b-chuk-test
Text Generation
• Updated • 3
Reinforcement Learning
• Updated joe-xhedi/ppo-SnowballTarget
Reinforcement Learning
• Updated • 1
joe-xhedi/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
• Updated • 8
Reinforcement Learning
• Updated joe-xhedi/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
• Updated Reinforcement Learning
• Updated • 1
joe-xhedi/ppo-LunarLander-v2
Reinforcement Learning
• Updated • 7