Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
8
Nenad Banfic
nenad1002
Follow
0 followers
·
3 following
nenad1002
AI & ML interests
NLP, LLM, Lora, Reft, DPO, RLHF
Recent Activity
new
activity
8 days ago
jiafatom/nemotron-cpu-fp32:
Fix genai_config.json for onnxruntime-genai compatibility
new
activity
8 days ago
jiafatom/nemotron-cpu-fp32:
Update genai_config.json: rename states_1/states_2 to lstm_hidden_state/lstm_cell_state
new
activity
8 days ago
jiafatom/nemotron-cpu-fp32:
Update tokenizer_config.json
View all activity
Organizations
nenad1002
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
jiafatom/nemotron-cpu-fp32
8 days ago
Fix genai_config.json for onnxruntime-genai compatibility
#1 opened 8 days ago by
nenad1002
Update genai_config.json: rename states_1/states_2 to lstm_hidden_state/lstm_cell_state
#4 opened 8 days ago by
nenad1002
Update tokenizer_config.json
#3 opened 8 days ago by
nenad1002
Fix tokenizer.json decoder stripping spaces
#2 opened 8 days ago by
nenad1002
updated
a model
11 days ago
nenad1002/nemotron-onnx-quantized
Updated
11 days ago
published
a model
11 days ago
nenad1002/nemotron-onnx-quantized
Updated
11 days ago
updated
a model
about 1 month ago
microsoft/Phi-3.5-mini-instruct-onnx
Text Generation
•
Updated
Feb 6
•
132
•
38
New activity in
Qwen/Qwen3-ASR-1.7B
about 1 month ago
Any plan to open the streaming model to run without vLLM?
1
#6 opened about 1 month ago by
nenad1002
New activity in
nenad1002/gemma3-gaia-pt-br-4b-it
8 months ago
make this a CPU model
#2 opened 8 months ago by
metang
Create inference_model.json
#1 opened 8 months ago by
metang
updated
a model
8 months ago
nenad1002/gemma3-gaia-pt-br-4b-it
Updated
Jul 23, 2025
•
1
published
a model
8 months ago
nenad1002/gemma3-gaia-pt-br-4b-it
Updated
Jul 23, 2025
•
1
New activity in
onnxruntime/Gemma-3-ONNX
8 months ago
Adapt vision and embedding models to v4.53.2 Gemma3 modeling code
2
#1 opened 8 months ago by
titaiwang03
updated
4 models
over 1 year ago
nenad1002/quantum-research-bot-v1.0
Text Generation
•
8B
•
Updated
Sep 8, 2024
•
5
nenad1002/quantum-research-bot-v0.9
Text Generation
•
7B
•
Updated
Sep 2, 2024
•
1
nenad1002/quantum-research-bot-v0.9-adapters
Updated
Sep 2, 2024
nenad1002/quantum-research-bot-v1.0-adapters
Updated
Sep 1, 2024
updated
a dataset
over 1 year ago
nenad1002/quantum_science_research_dataset
Viewer
•
Updated
Sep 1, 2024
•
2.83k
•
15
•
1
updated
2 models
over 1 year ago
nenad1002/Llama-3.1-8B-resort_bot_model
Text Generation
•
8B
•
Updated
Aug 29, 2024
nenad1002/Llama-3.1-8B-resort_bot_model-adapters
Updated
Aug 28, 2024
Load more