AI & ML interests
None yet
Organizations
None yet
ewqr2130/llama_ppo_1e6_new_tokenizerstep_8000
Text Generation
•
7B
•
Updated
•
67
ewqr2130/llama_sft_longer
Text Generation
•
7B
•
Updated
•
64
ewqr2130/llama_ppo_1e6step_4000
Text Generation
•
7B
•
Updated
•
63
ewqr2130/llama2-sft-16000
Text Generation
•
7B
•
Updated
•
10
ewqr2130/7B_ppo_phiRM_2GPU_3e-7step_4000
Text Generation
•
7B
•
Updated
•
825
ewqr2130/phi_ppo_phi_RM_1e6step_9500
Text Generation
•
3B
•
Updated
•
11
ewqr2130/mistral-7b-sft-beta__100000_1e-05_RewardModel_2GPU
Text Classification
•
7B
•
Updated
•
10
ewqr2130/alignment-handbook-zephyr-7b-sft-full-dpo-5e7-cont2
Text Generation
•
7B
•
Updated
•
831
ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_102
Text Generation
•
7B
•
Updated
•
812
ewqr2130/phi_ppo_1e-5_REAL_1GPU_batch8step_2400
Text Generation
•
3B
•
Updated
•
7
ewqr2130/alignment-handbook-zephyr-7b_ppo_5e7step_51
Text Generation
•
7B
•
Updated
•
833
ewqr2130/alignment-handbook-zephyr-7b_ppostep_100
Text Generation
•
7B
•
Updated
•
826
ewqr2130/TinyLamma-DPO-40k-Steps
Updated
Text Generation
•
3B
•
Updated
•
7
ewqr2130/alignment-handbook-zephyr-7b-sft-full-dpo-5e7-cont1
Text Generation
•
7B
•
Updated
•
815
Text Generation
•
1B
•
Updated
•
916
•
Text Generation
•
47B
•
Updated
•
13
ewqr2130/llama2-7b-raw-sft
Text Generation
•
7B
•
Updated
•
1.05k
ewqr2130/mistral-inst-v02-dpo
Text Generation
•
7B
•
Updated
•
966
ewqr2130/mistral-7b-raw-sft
Text Generation
•
7B
•
Updated
•
1.03k
Text Classification
•
1B
•
Updated
•
8