Buckets:

hf-doc-build/doc-dev / trl /pr_4305 /en /liger_kernel_integration.md
rtrm's picture
|
download
raw
1.77 kB

Liger Kernel Integration

Section under construction. Feel free to contribute!

Liger Kernel is a collection of Triton kernels designed specifically for LLM training. It can effectively increase multi-GPU training throughput by 20% and reduce memory usage by 60%. That way, we can 4x our context length, as described in the benchmark below. They have implemented Hugging Face compatible RMSNorm, RoPE, SwiGLU, CrossEntropy, FusedLinearCrossEntropy, with more to come. The kernel works out of the box with FlashAttention, PyTorch FSDP, and Microsoft DeepSpeed.

With this memory reduction, you can potentially turn off cpu_offloading or gradient checkpointing to further boost the performance.

Speed Up Memory Reduction
Speed up Memory
  1. To use Liger-Kernel in SFTTrainer, first install it by:
pip install liger-kernel
  1. Once installed, set use_liger_kernel in SFTConfig. No other changes are needed!
training_args = SFTConfig(
    use_liger_kernel=True,
    ...
)

To learn more about Liger-Kernel, visit their official repository.

Xet Storage Details

Size:
1.77 kB
·
Xet hash:
3077046cce12c8ac1e7ac1f77b68b74a79821159510d2698d906835edc39f10f

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.