Spaces:

m97j
/

pragmatic-agent

Sleeping

App Files Files Community

pragmatic-agent / models

Commit History

add decoding interface

fe9397b

m97j commited on Jan 20

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

200ed70

m97j commited on Dec 20, 2025

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

ed9e701

m97j commited on Dec 20, 2025

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

683b339

m97j commited on Dec 20, 2025

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

a7532d6

m97j commited on Dec 19, 2025

fix(llm_model): align token chunking and prefix handling with engine

deb604d

m97j commited on Dec 14, 2025

feat(initializer): store prefix cache as input_ids tensors

ac17ed0

m97j commited on Dec 14, 2025

feat(inference engine): add input normalization and attention_mask support

e923fc2

m97j commited on Dec 14, 2025

update import block

2256134

m97j commited on Dec 13, 2025

refactor: move LLM model initialization from global scope to function-level for lazy loading

f62140d

m97j commited on Dec 13, 2025

Edit : add model load method

bf2f314

m97j commited on Dec 13, 2025

Edit : add model load method

7b942de

m97j commited on Dec 13, 2025

update initializer.py

662eb29

m97j commited on Dec 13, 2025

fix: use weights_only=True in torch.load to safely load state_dict

f6e3bea

m97j commited on Dec 13, 2025

Refactor model initialization to use hf_hub_download cache paths

9b58d8f

m97j commited on Dec 13, 2025

Refactor load_llm: use AutoConfig and direct state_dict loading

7d823a8

m97j commited on Dec 13, 2025

Update initializer.py to use explicit Hub filenames

4e65de4

m97j commited on Dec 13, 2025

First codes update

69c12a2

m97j commited on Dec 12, 2025

Commit History

add decoding interface fe9397b

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects 200ed70

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects ed9e701

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects 683b339

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects a7532d6

fix(llm_model): align token chunking and prefix handling with engine deb604d

feat(initializer): store prefix cache as input_ids tensors ac17ed0

feat(inference engine): add input normalization and attention_mask support e923fc2

update import block 2256134

refactor: move LLM model initialization from global scope to function-level for lazy loading f62140d

Edit : add model load method bf2f314

Edit : add model load method 7b942de

update initializer.py 662eb29

fix: use weights_only=True in torch.load to safely load state_dict f6e3bea

Refactor model initialization to use hf_hub_download cache paths 9b58d8f

Refactor load_llm: use AutoConfig and direct state_dict loading 7d823a8

Update initializer.py to use explicit Hub filenames 4e65de4

First codes update 69c12a2

add decoding interface

fe9397b

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

200ed70

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

ed9e701

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

683b339

fix: Modify code to ensure the input dtype/shape/key is what the ONNX model expects

a7532d6

fix(llm_model): align token chunking and prefix handling with engine

deb604d

feat(initializer): store prefix cache as input_ids tensors

ac17ed0

feat(inference engine): add input normalization and attention_mask support

e923fc2

update import block

2256134

refactor: move LLM model initialization from global scope to function-level for lazy loading

f62140d

Edit : add model load method

bf2f314

Edit : add model load method

7b942de

update initializer.py

662eb29

fix: use weights_only=True in torch.load to safely load state_dict

f6e3bea

Refactor model initialization to use hf_hub_download cache paths

9b58d8f

Refactor load_llm: use AutoConfig and direct state_dict loading

7d823a8

Update initializer.py to use explicit Hub filenames

4e65de4

First codes update

69c12a2