pragmatic-agent / models

Commit History

add decoding interface
fe9397b

m97j commited on

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects
200ed70

m97j commited on

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects
ed9e701

m97j commited on

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects
683b339

m97j commited on

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects
a7532d6

m97j commited on

fix(llm_model): align token chunking and prefix handling with engine
deb604d

m97j commited on

feat(initializer): store prefix cache as input_ids tensors
ac17ed0

m97j commited on

feat(inference engine): add input normalization and attention_mask support
e923fc2

m97j commited on

update import block
2256134

m97j commited on

refactor: move LLM model initialization from global scope to function-level for lazy loading
f62140d

m97j commited on

Edit : add model load method
bf2f314

m97j commited on

Edit : add model load method
7b942de

m97j commited on

update initializer.py
662eb29

m97j commited on

fix: use weights_only=True in torch.load to safely load state_dict
f6e3bea

m97j commited on

Refactor model initialization to use hf_hub_download cache paths
9b58d8f

m97j commited on

Refactor load_llm: use AutoConfig and direct state_dict loading
7d823a8

m97j commited on

Update initializer.py to use explicit Hub filenames
4e65de4

m97j commited on

First codes update
69c12a2

m97j commited on