·
AI & ML interests
None yet
Organizations
maydixit/qwen3_32b_lora_test
Updated
maydixit/qwen3_8b_lora_rl
Updated
maydixit/qwen3_32b_lora_v2
Updated
maydixit/qwen3-8b-lora-self-preservation-rl
Reinforcement Learning
• Updated • 9
• 1
maydixit/qwen3_8b_lora_rl_messages_batch_4
Updated
maydixit/qwen3_8b_lora_rl_messages
Updated
maydixit/qwen3_8b_lora_merged
Text Generation
• 8B • Updated • 2
maydixit/llama_3b_lora_3kdata
Updated
maydixit/llama_lora_3b_mixed
Updated
maydixit/llama_lora_3b_sandbag
Updated
maydixit/llama_lora_3b_direct_qa
Updated
maydixit/llama_lora_3b_instruct_qa_sft_self_preservation_v0
Updated
maydixit/llama_lora_sft_self_preservation_v1
Updated
maydixit/llama_lora_sft_self_preservation
Updated