-
jeongseokoh/llama3.1_8b_sft_SPEED-28-BoS_HotpotQA_lower_freeze
Updated • 15 -
jeongseokoh/llama3.1_8b_sft_SPEED-24-BoS_HotpotQA_lower_freeze
Updated • 18 -
jeongseokoh/llama3.1_8b_sft_SPEED-20-BoS_HotpotQA_lower_freeze
Updated • 22 -
jeongseokoh/llama3.1_8b_sft_SPEED-16-BoS_HotpotQA_lower_freeze
Updated • 19
jeongseokoh
jeongseokoh
·
AI & ML interests
Large Language Models, Efficient LLM, Trustworthy AI
Recent Activity
updated a collection about 17 hours ago
SPEED submitted a paper about 17 hours ago
Shallow Prefill, Deep Decoding: Efficient Long-Context Inference via Layer-Asymmetric KV Visibility updated a collection 7 days ago
SPEED Downstream Task ModelsOrganizations
SPEED
MATH
SPEED Freeze Layers
-
jeongseokoh/llama3.1_8b_sft_SPEED-28-BoS_HotpotQA_lower_freeze
Updated • 15 -
jeongseokoh/llama3.1_8b_sft_SPEED-24-BoS_HotpotQA_lower_freeze
Updated • 18 -
jeongseokoh/llama3.1_8b_sft_SPEED-20-BoS_HotpotQA_lower_freeze
Updated • 22 -
jeongseokoh/llama3.1_8b_sft_SPEED-16-BoS_HotpotQA_lower_freeze
Updated • 19
SPEED Downstream Task Models
SPEED
Latent Self-Consistency
LSC for Majority selection in Short- and Long-form generation
MATH