Make model vllm compatible

by vrdn23 - opened Mar 12, 2025

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

-17

vrdn23

Cisco org Mar 12, 2025

•

edited Mar 12, 2025

The model had an un-used layer of final_logits_bias which was full of zeroes that was causing vLLM to fail to load the model. Removing that has no impact on quality which I've verified. Also regenerated the new onnx models and moved them to the onnx folder.

Remove final_logits_bias key which was completely zero anywaya32dcd5f

vrdn23 changed pull request status to open Mar 12, 2025

Remove bin and onnx files5e8486cb

Add onnx files to onnx folderc7bc6e93

vrdn23

Cisco org Mar 12, 2025

I've verified it results match with the new onnx models.

vrdn23 changed pull request status to merged Mar 12, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment