esm2_t6_long_int4 / README.md
gabrielbianchin's picture
Create README.md
ab81297 verified
metadata
title: ESM2 Quantized Models

ESM2 Quantized

ESM2 Quantized is an adapted version of the ESM2 architectures. It uses local attention instead of global attention, allowing for models with longer input sizes. ESM2 Quantized models have a context size of 2,050, double that of the standard ESM2 model. This kind of model was trained with int4 quantization. Several ESM2 Quantized models are available:

For detailed information, please refer to the paper.