GenomeOcean-4B-FP8 / quantization.json
ThomasYn's picture
Upload GenomeOcean-4B-FP8
acd7036 verified
{
"method": "fp8_e4m3_per_channel_scale",
"dtype": "float8_e4m3fn",
"scale_dtype": "float32",
"num_tensors": 338
}