AI & ML interests
LLMs, SLMs
Organizations
None yet
Token Classification
•
0.1B
•
Updated
•
5
joe-xhedi/Qwen-GRPO-geological-training
Text Generation
•
0.5B
•
Updated
•
1
•
1
joe-xhedi/Qwen-GRPO-Training-2
0.5B
•
Updated
•
1
joe-xhedi/Qwen-GRPO-training
0.5B
•
Updated
•
2
joe-xhedi/SentimentSeer_LSTM
Updated
joe-xhedi/FashionMNIST-PYTorch-Model
Updated
joe-xhedi/transformer-es-en-model
Updated
joe-xhedi/transformer-de-en-model
Updated
joe-xhedi/gpt-dev-french-english-scrach-model-nn
Updated
joe-xhedi/llama_2_finetuned_product_description
joe-xhedi/llama2-qlora-finetunined-french
joe-xhedi/llama2-qlora-finetunined-argument_parsiong_test
Updated
joe-xhedi/llama2-qlora-finetunined-pcs_not_structured-fail
Updated
joe-xhedi/ludwig-method-not-worked
Updated
joe-xhedi/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
joe-xhedi/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
joe-xhedi/Reinforce-CartPole8
Reinforcement Learning
•
Updated
joe-xhedi/unit8-LunarLander-v2
Reinforcement Learning
•
Updated
joe-xhedi/poca-SoccerTwos
Reinforcement Learning
•
Updated
joe-xhedi/a2c-PandaPickAndPlace-v3
Reinforcement Learning
•
Updated
joe-xhedi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
joe-xhedi/llama-2-7b-chuk-test
Text Generation
•
Updated
•
3
Reinforcement Learning
•
Updated
joe-xhedi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
joe-xhedi/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
joe-xhedi/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
joe-xhedi/ppo-LunarLander-v2
Reinforcement Learning
•
Updated