-
jeongseokoh/llama3.1_8b_sft_SPEED-28-BoS_HotpotQA_lower_freeze
Updated • 15 -
jeongseokoh/llama3.1_8b_sft_SPEED-24-BoS_HotpotQA_lower_freeze
Updated • 18 -
jeongseokoh/llama3.1_8b_sft_SPEED-20-BoS_HotpotQA_lower_freeze
Updated • 22 -
jeongseokoh/llama3.1_8b_sft_SPEED-16-BoS_HotpotQA_lower_freeze
Updated • 19
jeongseokoh
jeongseokoh
·
AI & ML interests
Large Language Models, Efficient LLM, Trustworthy AI
Recent Activity
updated a collection about 14 hours ago
SPEED submitted a paper about 14 hours ago
Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility updated a collection 6 days ago
SPEED Downstream Task ModelsOrganizations
SPEED Freeze Layers
-
jeongseokoh/llama3.1_8b_sft_SPEED-28-BoS_HotpotQA_lower_freeze
Updated • 15 -
jeongseokoh/llama3.1_8b_sft_SPEED-24-BoS_HotpotQA_lower_freeze
Updated • 18 -
jeongseokoh/llama3.1_8b_sft_SPEED-20-BoS_HotpotQA_lower_freeze
Updated • 22 -
jeongseokoh/llama3.1_8b_sft_SPEED-16-BoS_HotpotQA_lower_freeze
Updated • 19
SPEED Downstream Task Models
models 308
jeongseokoh/llama3.1_8b_sft_vanilla_OpenCode
Updated
jeongseokoh/llama3.1_8b_sft_SPEED-24-BoS_OpenCode
Updated • 23
jeongseokoh/llama3.1_8b_sft_SPEED-20-BoS_OpenCode
Updated • 18
jeongseokoh/llama3.1_8b_sft_SPEED-24-BoS_Nemotron-MATH
Updated • 23
jeongseokoh/llama3.1_8b_sft_SPEED-24-System
Updated • 25
jeongseokoh/llama3.1_8b_sft_SPEED-28-System
Updated • 29
jeongseokoh/llama3.1_8b_sft_SPEED-20
Updated • 21
jeongseokoh/llama3.1_8b_sft_vanilla_HotpotQA_lower_freeze-k24
Updated
jeongseokoh/llama3.1_8b_sft_SPEED-16
Updated • 26
jeongseokoh/llama3.1_8b_sft_SPEED-20-System
Updated • 32
datasets 37
jeongseokoh/math_gsm8k_mmlu_sameAnswers
Viewer • Updated • 6.46k • 3
jeongseokoh/math_gsm8k_mmlu
Viewer • Updated • 30k • 20
jeongseokoh/SameAnswerDifferentQuestion
Viewer • Updated • 1.86k • 34
jeongseokoh/concat_DPO_for_Mathematics
Viewer • Updated • 120k • 3
jeongseokoh/Original_DPO_for_Mathematics
Viewer • Updated • 78.1k • 6
jeongseokoh/prefix_DPO_for_Mathematics
Viewer • Updated • 41.6k • 2
jeongseokoh/prefix_DPO_preparation
Viewer • Updated • 41.6k • 8
jeongseokoh/GSM8K_for_test_DPO
Viewer • Updated • 1.32k • 51
jeongseokoh/MATH_for_test_DPO
Viewer • Updated • 5k • 4
jeongseokoh/Concatenated_DPO
Viewer • Updated • 120k • 315