Model Card: GenomeOcean-4B-bgcFM (FP8)

Generated: 2026-05-09T18:06:20-0700

Architecture

Parameter	Value
Architecture	MistralForCausalLM
Model Type	mistral
Vocab Size	4096
Hidden Size	3072
Num Hidden Layers	24
Num Attention Heads	12
Intermediate Size	16384
Max Position Embeddings	32768
RoPE Theta	1000000.0

Quantization Method

Format: FP8 (E4M3) per-channel weight-only quantization
Scale DType: float32 per-channel scales
Method: Post-training quantization (PTQ) with per-channel E4M3 weights

Perplexity Results

Metric	Value
Original PPL (BF16)	38464.3814
Quantized PPL (FP8)	38080.5271
PPL Difference	-383.8543
PPL Difference (%)	-1.0%

Quality Assessment: Good - minimal quality loss

Weight Fidelity

Metric	Value
Mean Cosine Similarity	1.003024
Min Cosine Similarity	0.999467
Mean Relative L2 Error	0.026573
Max Relative L2 Error	0.026676
Layers Compared	169

Compression

Metric	Value
Original Size	8.5066 GB
Quantized Size	4.2449 GB
Compression Ratio	49.9%
Space Saved	4.26 GB

Summary

The GenomeOcean-4B-bgcFM model was quantized from BF16 to FP8. Perplexity changed by -1.0% (original: 38464.3814, quantized: 38080.5271). Mean weight cosine similarity is 1.0030. Compression ratio is 49.9% (saved 4.26 GB).

Downloads last month: 15

Safetensors

Model size

4B params

Tensor type

F32

F8_E4M3

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support