Ronan McGovern
RonanMcGovern
AI & ML interests
Open source LLMs. Fine-tuning. Summarisation. Patents.
Organizations
Query re training losses and collation
1
#8 opened 11 days ago
by
RonanMcGovern
Could you upload a d20 checkpoint, even if janky?
1
#8 opened about 1 month ago
by
RonanMcGovern
Thanks !
1
#1 opened 3 months ago
by
cdtmc
What is the tokenization and alignment approach? i.e. collation
11
#9 opened 5 months ago
by
RonanMcGovern
Does transformers utilize PyTorch SDPA's flash_attention for openai/gpt-oss-20b?
3
#89 opened 6 months ago
by
NooBaymax
Clarification on Recent Changes to Loss and Gradient Accumulation
👍
2
1
#15 opened 6 months ago
by
jiosephlee
Fine tuning of Model
3
#22 opened 6 months ago
by
amritansh
Data norm issue during evaluation/inference
2
#8 opened 7 months ago
by
RonanMcGovern
Amazon Chronos T5 Small Forecasting Google Colab Notebook
👍
1
2
#4 opened over 1 year ago
by
JamesBentley
Ollama with GGUF
2
#6 opened 9 months ago
by
smaram
SGLang very slow ~6 toks with 1 concurrency on H100SXM
1
#3 opened 9 months ago
by
RonanMcGovern
Low inference throughput?
3
#2 opened 10 months ago
by
RonanMcGovern
Model Produces no Outputs
3
#16 opened 11 months ago
by
RonanMcGovern
[BUG] {'use_reentrant': True} results in "Gradients will be None"
2
#74 opened 12 months ago
by
RonanMcGovern
Model does not follow or acknowledge system prompts?
2
#3 opened 12 months ago
by
RonanMcGovern
Transformers 3.0 can't find files
8
#5 opened over 1 year ago
by
r0-0rd
Unable to load finetuned model after saving
3
#57 opened about 1 year ago
by
Charlington
Set base_model & tags metadata
1
#1 opened about 1 year ago
by
tomaarsen
Finetuning for pointing, object detection task
4
#48 opened about 1 year ago
by
yaneivan