AQ-MedAI
/

Ling-Flash-2.0-eagle3

Model card Files Files and versions

eerrr9 commited on 19 days ago

Commit

aeae16e

·

verified ·

1 Parent(s): 9bb9c96

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -18,6 +18,15 @@ The model is trained on **1.4 million high-quality Open-PerfectBlend instruction
 - **High Accuracy Guarantee**: Maintaining 93%+ accuracy on mainstream benchmarks
 - **Production-Grade Optimization**: Achieving 3954 tokens/s output throughput on single NVIDIA H200
 ## Performance
 ### Speculative Sampling Efficiency

 - **High Accuracy Guarantee**: Maintaining 93%+ accuracy on mainstream benchmarks
 - **Production-Grade Optimization**: Achieving 3954 tokens/s output throughput on single NVIDIA H200
+## Efficient Download Guide
+To minimize download time and storage usage, please note the function of the files in the repository:
+**For Inference**: You only need to download config.json and model.safetensors.
+**For Continued Training**: The file training_state.pt contains optimizer states specifically for resuming training. If you only intend to use the model for inference, you can skip downloading this file.
 ## Performance
 ### Speculative Sampling Efficiency