srisree commited on
Commit
3ab0dab
·
verified ·
1 Parent(s): babd4af

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -22
README.md CHANGED
@@ -1,22 +1,42 @@
1
- ---
2
- base_model: unsloth/Qwen3-0.6B-unsloth-bnb-4bit
3
- tags:
4
- - text-generation-inference
5
- - transformers
6
- - unsloth
7
- - qwen3
8
- - gguf
9
- license: apache-2.0
10
- language:
11
- - en
12
- ---
13
-
14
- # Uploaded model
15
-
16
- - **Developed by:** srisree
17
- - **License:** apache-2.0
18
- - **Finetuned from model :** unsloth/Qwen3-0.6B-unsloth-bnb-4bit
19
-
20
- This qwen3 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
21
-
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: unsloth/Qwen3-0.6B-unsloth-bnb-4bit
3
+ tags:
4
+ - text-generation-inference
5
+ - transformers
6
+ - unsloth
7
+ - qwen3
8
+ - gguf
9
+ license: apache-2.0
10
+ language:
11
+ - en
12
+ datasets:
13
+ - srisree/nextjs_typescript_fim_dataset
14
+ ---
15
+ ![](https://cas-bridge.xethub.hf.co/xet-bridge-us/68df672ca9ec9af1d47c7f49/7145158a25898f06dd15a962e660168dcc23d88301d5c6d4001e10cc40baa27e?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Content-Sha256=UNSIGNED-PAYLOAD&X-Amz-Credential=cas%2F20251007%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20251007T075926Z&X-Amz-Expires=3600&X-Amz-Signature=934a3f4f968d245673e42ebb3ecbd5b8e7fa1b1dc978f254e80fa84ca56048b4&X-Amz-SignedHeaders=host&X-Xet-Cas-Uid=64fdcf65010f41e43574ead1&response-content-disposition=inline%3B+filename*%3DUTF-8%27%27logo.png%3B+filename%3D%22logo.png%22%3B&response-content-type=image%2Fpng&x-id=GetObject&Expires=1759827566&Policy=eyJTdGF0ZW1lbnQiOlt7IkNvbmRpdGlvbiI6eyJEYXRlTGVzc1RoYW4iOnsiQVdTOkVwb2NoVGltZSI6MTc1OTgyNzU2Nn19LCJSZXNvdXJjZSI6Imh0dHBzOi8vY2FzLWJyaWRnZS54ZXRodWIuaGYuY28veGV0LWJyaWRnZS11cy82OGRmNjcyY2E5ZWM5YWYxZDQ3YzdmNDkvNzE0NTE1OGEyNTg5OGYwNmRkMTVhOTYyZTY2MDE2OGRjYzIzZDg4MzAxZDVjNmQ0MDAxZTEwY2M0MGJhYTI3ZSoifV19&Signature=lfbWQ7JEfCthxaMxKwHBz9TC8J%7EBVQ1ythRdjN8xB8CjpdnY5HD3KR6H5B7R6U4c4zGUV6d9s5CJlOBx-9F4ltzUTtx9HF%7EKqCxk0z52dFjoH%7ECrGX2rJIyKkdEiZWa%7Eu7EKVfutwrzADQizSy0TbwU5Cvah7B4qOb7zEey2QRvGG-73SrByeLGTuYuzQGWPB9uQR71oTe3hShbKKz2qJs5eEgDaoLrZgo2Q5hazCeuHNim0VqgDkw2-c4UKF2xmRQz3TKY2eHtmw9woOZex4I2Yau0hI6XPFWapmtudCS4rHi%7EtDaA4SgRU4T8N5nnAVOVGOMpvPGoCRQSo%7EaQpew__&Key-Pair-Id=K2L8F4GPSG1IFC)
16
+
17
+
18
+ NanoCoder is a Fill-in-the-Middle (FIM) language model specifically designed for React frontend development and coding assistance. It helps users with intelligent code autocompletion and context-aware generation. The model was fine-tuned using Unsloth on the Qwen 3 0.6B base model, leveraging a high-quality FIM dataset curated from GitHub repositories to enhance coding capabilities and developer productivity.
19
+
20
+ ## 🧠 Datasets
21
+
22
+ We trained **NanoCoder** using a **high-quality Fill-in-the-Middle (FIM)** dataset curated from GitHub repositories:
23
+ [**srisree/nextjs_typescript_fim_dataset**](https://huggingface.co/datasets/srisree/nextjs_typescript_fim_dataset) on Hugging Face.
24
+
25
+ This dataset focuses on **React/Next.js** and **TypeScript** projects, providing rich, real-world coding examples that help the model understand frontend architecture, component composition, and React ecosystem patterns.
26
+
27
+ By leveraging this dataset, **NanoCoder** learns to:
28
+ - Predict and fill missing code intelligently using FIM objectives.
29
+ - Understand React component structures and TypeScript typing patterns.
30
+ - Generate clean, production-grade frontend code snippets.
31
+
32
+ ## ⚙️ FIM Training Colab Script
33
+
34
+ We’re preparing an interactive **Google Colab notebook** for reproducing the **Fill-in-the-Middle (FIM)** fine-tuning process used to train **NanoCoder** with **Unsloth** on the **Qwen 3 0.6B** base model.
35
+
36
+ The Colab script will include:
37
+ - ✅ Environment setup with Unsloth and Qwen 3 0.6B
38
+ - ✅ Loading and preprocessing the [Next.js TypeScript FIM Dataset](https://huggingface.co/datasets/srisree/nextjs_typescript_fim_dataset)
39
+ - ✅ Training configuration (LoRA, batch size, sequence length, etc.)
40
+ - ✅ Evaluation and inference examples
41
+
42
+ 🚀 **Coming soon...** Stay tuned for the full release!