Jemin Lee

leejaymin

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

commentedon a paper 10 months ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

authored a paper 11 months ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

View all activity

Organizations

upvoted a paper about 2 months ago

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Paper • 2602.15449 • Published Feb 17 • 7

commented a paper 10 months ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3, 2025 • 40 •

authored a paper 11 months ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3, 2025 • 40

upvoted a paper 11 months ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3, 2025 • 40

commented a paper 11 months ago

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3, 2025 • 40 •

commented a paper over 1 year ago

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17 •

authored 4 papers over 1 year ago

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17

Mixed Non-linear Quantization for Vision Transformers

Paper • 2407.18437 • Published Jul 26, 2024

Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems

Paper • 2303.12557 • Published Mar 22, 2023

Quantune: Post-training Quantization of Convolutional Neural Networks using Extreme Gradient Boosting for Fast Deployment

Paper • 2202.05048 • Published Feb 10, 2022

upvoted a paper over 1 year ago

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17

updated a model over 1 year ago

leejaymin/etri-ones-llama3.1-8b-ko

Text Generation • 8B • Updated Sep 6, 2024 • 1 • 1

upvoted a paper over 1 year ago

LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs

Paper • 2408.13467 • Published Aug 24, 2024 • 25

updated a collection over 1 year ago

Llama3.1 Quantization

Collection

Quantized models with GPTQ, AWQ, BnB, and SmoothQuant. • 0 items • Updated Aug 14, 2024

upvoted a collection over 1 year ago

Llama-3.1 Quantization

Collection

Neural Magic quantized Llama-3.1 models • 21 items • Updated Mar 2 • 46

updated 4 models about 2 years ago

Jemin Lee

AI & ML interests

Recent Activity

Organizations

leejaymin's activity