Maheswar KK
mahiatlinux
AI & ML interests
None yet
Recent Activity
updated a dataset about 14 hours ago
mahiatlinux/qwen-coder-chat-sft-22k published a dataset about 14 hours ago
mahiatlinux/qwen-coder-chat-sft-22k published a model 1 day ago
mahiatlinux/qwen3.5-2b-python-reasoning-sft-Q6_K-GGUFOrganizations
How did you estimate model parameter count before training and use correct hyperparams?
11
#1 opened almost 2 years ago
by
mahiatlinux
Update README.md
#1 opened about 1 year ago
by
mahiatlinux
Fixed the code formatting.
#2 opened about 1 year ago
by
mahiatlinux
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened over 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened about 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened about 1 year ago
by
SFconvertbot
Adding `safetensors` variant of this model
#1 opened about 1 year ago
by
SFconvertbot
Context size in the README does not seem to be correct.
2
#2 opened over 1 year ago
by
mahiatlinux
License?
4
#7 opened almost 2 years ago
by
jbohnslav
Dataset used.
2
#1 opened almost 2 years ago
by
mahiatlinux
[bot] Conversion to Parquet
#1 opened almost 2 years ago
by
parquet-converter
[bot] Conversion to Parquet
#1 opened almost 2 years ago
by
parquet-converter
Librarian Bot: Add language metadata for dataset
#2 opened almost 2 years ago
by
librarian-bot
[bot] Conversion to Parquet
#1 opened almost 2 years ago
by
parquet-converter
Fixed ShareGPT format.
3
#1 opened almost 2 years ago
by
mahiatlinux
[bot] Conversion to Parquet
#1 opened almost 2 years ago
by
parquet-converter
Librarian Bot: Add language metadata for dataset
#2 opened almost 2 years ago
by
librarian-bot