README.md · khtsly/Mini-Coder-v2-noft at main

Update README.md

7ca037f verified 5 days ago

1.21 kB

	---
	language:
	- en
	- zh
	license: apache-2.0
	base_model:
	- Qwen/Qwen3.5-9B
	- Qwen/Qwen3.5-9B-Base
	tags:
	- text-generation-inference
	- transformers
	- unsloth
	- qwen
	- qwen3.5
	- reasoning
	- chain-of-thought
	- lora
	- luau
	- roblox
	pipeline_tag: image-text-to-text
	datasets:
	- khtsly/luau-stack-hq
	- khtsly/luau-repo-docs-text
	---

	# Mini-Coder-v2-Base (15B)

	Mini-Coder-v2 is build on top of upscaled & merged of Qwen3.5-9B + Qwen3.5-9B-Base model with Continual Pretraining (CPT), we feed ~36.63k high-quality curated luau raw codes & docs text to improves the luau coding tasks & knowledges capability.

	This model present a upscaled of Qwen3.5-9B from only 32 layers to 56 layers for a deeper reasoning capability! This model parameters is 15.07B with Vision, 14.60B without Vision.

	# Uploaded finetuned model

	- Developed by: khtsly
	- License: apache-2.0
	- Finetuned from model : khtsly/Mini-Coder-v2-noft

	This qwen3_5 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.

	[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)