Ycoder-medium / README.md
louhless's picture
Update README.md
703e17a verified
metadata
language:
  - en
  - de
tags:
  - text-generation
  - causal-lm
  - fine-tuned
  - gguf
  - code
  - python
  - glsl
  - opengl
  - german
  - news
  - experimental
license: apache-2.0
base_model: Qwen/Qwen2.5-Coder-0.5B-Instruct
pipeline_tag: text-generation
library_name: transformers
model_creator: louhless

Ycoder-medium

Ycoder-medium is an experimental local fine-tune of Qwen/Qwen2.5-Coder-0.5B-Instruct created by louhless.

It is targeted at:

  • OpenGL / GLSL
  • Python
  • German replies
  • cautious 2025-2026 news and public-health summaries

Important Note

This model is not trained from scratch.

It is a small LoRA fine-tune on top of Qwen/Qwen2.5-Coder-0.5B-Instruct.

The goal is to improve behavior in a narrow target set. Any “15% improvement” claim should be treated as a target, not a verified benchmark result, unless evaluated on a fixed benchmark before and after training.

Model Details

  • Model name: Ycoder-medium
  • Creator: louhless
  • Base model: Qwen/Qwen2.5-Coder-0.5B-Instruct
  • Architecture: Qwen2 causal language model
  • Context length: 32768
  • Language: English and German
  • Export: GGUF available
  • Status: experimental

Training Focus

The model was tuned for:

  • Python utility code
  • Python code explanations
  • GLSL fragment shaders
  • GLSL vertex shaders
  • OpenGL concepts such as VAO/VBO
  • German short-form answers
  • simple math
  • cautious dated summaries for 2025-2026 public-health/news topics

News / Health Safety

For topics such as Hantavirus, the project uses both small fine-tuning examples and local dated context snippets.

This is intentional: recent news and public-health information should not be trusted from model weights alone.

The model should:

  • answer cautiously
  • mention dates when relevant
  • avoid medical diagnosis
  • avoid treatment promises
  • recommend official sources such as WHO, CDC, ECDC, or local health authorities

It should not be used for diagnosis or medical decision-making.

Training Data

The initial custom dataset includes examples for:

  • Python utility functions and explanations
  • GLSL shaders and OpenGL concepts
  • German short answers
  • simple math
  • dated 2025-2026 Hantavirus summaries based on WHO, CDC, and ECDC public information

Example Prompts

Python

Prompt:

Write Python code to read a JSON file safely.