Ronan McGovern's picture

Ronan McGovern

RonanMcGovern

·

https://ronanmcgovern.com

AI & ML interests

Open source LLMs. Fine-tuning. Summarisation. Patents.

Organizations

New activity in leduckhai/MultiMed 3 months ago

Transcripts misaligned / wrong

#7 opened 3 months ago by

New activity in kyutai/pocket-tts 5 months ago

Query re training losses and collation

#8 opened 5 months ago by

New activity in karpathy/nanochat-d32 6 months ago

Could you upload a d20 checkpoint, even if janky?

#8 opened 6 months ago by

New activity in Trelis/TRM-ARC-AGI-II 8 months ago

Thanks !

#1 opened 8 months ago by

New activity in kyutai/stt-1b-en_fr 9 months ago

What is the tokenization and alignment approach? i.e. collation

#9 opened 10 months ago by

New activity in openai/gpt-oss-20b 9 months ago

Does transformers utilize PyTorch SDPA's flash_attention for openai/gpt-oss-20b?

#89 opened 11 months ago by

New activity in transformers-community/support 10 months ago

Clarification on Recent Changes to Loss and Gradient Accumulation

#15 opened 11 months ago by

New activity in mistralai/Voxtral-Mini-3B-2507 11 months ago

Fine tuning of Model

#22 opened 11 months ago by

New activity in lerobot/smolvla_base 12 months ago

Data norm issue during evaluation/inference

#8 opened 12 months ago by

New activity in amazon/chronos-t5-small about 1 year ago

Amazon Chronos T5 Small Forecasting Google Colab Notebook

#4 opened about 2 years ago by

New activity in Trelis/Mixtral-8x7B-Instruct-v0.1-function-calling-v3 about 1 year ago

Ollama with GGUF

#6 opened about 1 year ago by

New activity in Qwen/Qwen3-30B-A3B-FP8 about 1 year ago

SGLang very slow ~6 toks with 1 concurrency on H100SXM

#3 opened about 1 year ago by

New activity in leon-se/gemma-3-27b-it-FP8-Dynamic about 1 year ago

Low inference throughput?

#2 opened about 1 year ago by

New activity in Qwen/Qwen2.5-Omni-7B about 1 year ago

Executing Qwen2.5-Omni-7B on SGLang: AttributeError: 'Qwen2_5OmniConfig' object has no attribute 'hidden_size'

#21 opened about 1 year ago by

New activity in google/gemma-3-4b-it over 1 year ago

Model Produces no Outputs

#16 opened over 1 year ago by

New activity in Qwen/Qwen2-VL-7B-Instruct over 1 year ago

[BUG] {'use_reentrant': True} results in "Gradients will be None"

#74 opened over 1 year ago by

New activity in onnx-community/Qwen2.5-Coder-1.5B-Instruct over 1 year ago

Model does not follow or acknowledge system prompts?

#3 opened over 1 year ago by

New activity in onnx-community/Phi-3.5-mini-instruct-onnx-web over 1 year ago

Transformers 3.0 can't find files

#5 opened over 1 year ago by

New activity in vikhyatk/moondream2 over 1 year ago

Unable to load finetuned model after saving

#57 opened over 1 year ago by

New activity in jxm/cde-small-v2 over 1 year ago

Set base_model & tags metadata

#1 opened over 1 year ago by