modelcard.md · Matrix-Corp/TouchGrass-3b at main

TouchGrass-3b / modelcard.md

Zandy-Wandy

Upload 39 files

9071ef9 verified 26 days ago

preview code

raw

history blame contribute delete

6.89 kB

	---
	license: apache-2.0
	tags:
	- music
	- text-generation
	- instruction-tuning
	- lora
	- preview
	- untrained
	- qwen3.5
	- touchgrass
	datasets:
	- synthetic
	language:
	- en
	library_name: transformers
	pipeline_tag: text-generation
	---

	# TouchGrass-3B 🎵

	Status: PREVIEW - UNTRAINED MODEL

	This is a preview repository for TouchGrass-3B, a lightweight music AI assistant fine-tuned from Qwen3.-3B-Instruct. This model has NOT been trained yet - it contains randomly initialized LoRA adapters and is not ready for inference.

	## ⚠️ Important Notice

	- Model is UNTRAINED: The LoRA adapters are randomly initialized. Performance will be no better than the base Qwen3.5-3B-Instruct model.
	- For demonstration purposes only: This repository contains the complete codebase and configuration for training the model.
	- Expected performance after training: 94-95% accuracy on music-specific tasks (based on architecture design and synthetic data pipeline).

	## 🎯 Model Overview

	TouchGrass is a specialized music AI assistant built by fine-tuning Qwen3.5 models with:

	- Music Tokenizer Extension: 21+ music-specific tokens (guitar, piano, drums, vocals, theory, DJ, tablature, chords, etc.)
	- Five Specialized Modules:
	- 🎸 Tab & Chord Generation (guitar tabs, chord diagrams)
	- 🎹 Music Theory Engine (scales, intervals, progressions)
	- 👂 Ear Training (interval ID, solfege exercises)
	- 😌 EQ Adapter (frustration detection, emotional adaptation)
	- ✍️ Song Writing Assistant (progressions, lyrics, hooks)
	- LoRA Fine-Tuning: Efficient parameter-efficient fine-tuning
	- Multi-Task Learning: Weighted losses (LM: 1.0, EQ: 0.1, Music: 0.05)

	## 📊 Model Details

	\| Property \| Value \|
	\|----------\|-------\|
	\| Base Model \| Qwen/Qwen3.5-3B-Instruct \|
	\| Model Size \| ~3.5B parameters (with LoRA) \|
	\| Vocab Size \| 32,000 (Qwen3.5 + music tokens) \|
	\| Max Sequence Length \| 4,096 tokens \|
	\| LoRA Rank \| 16 (configurable) \|
	\| Training Data \| Synthetic music QA (10 categories, 80+ templates) \|
	\| Training Steps \| 50,000 (planned) \|
	\| Batch Size \| 8-16 (depending on GPU) \|
	\| Learning Rate \| 2e-4 (with warmup) \|

	## 🏗️ Architecture

	The model extends Qwen3.5 with:
	1. Custom tokenizer with music domain tokens
	2. Five LoRA-adapted modules inserted at transformer layers
	3. Multi-task heads for music-specific predictions
	4. Emotional intelligence via EQ adapter

	## 🚀 Usage (After Training)

	### HuggingFace Transformers

	```python
	from transformers import AutoModelForCausalLM, AutoTokenizer
	from TouchGrass.configuration_touchgrass import TouchGrassConfig
	from TouchGrass.tokenization_touchgrass import TouchGrassTokenizer

	# Load model and tokenizer
	model = AutoModelForCausalLM.from_pretrained("your-username/TouchGrass-3B")
	tokenizer = TouchGrassTokenizer.from_pretrained("your-username/TouchGrass-3B")

	# Generate with instrument context
	prompt = "[GUITAR][BEGINNER] How do I play an F major chord?"
	inputs = tokenizer(prompt, return_tensors="pt")
	outputs = model.generate(**inputs, max_new_tokens=200)
	print(tokenizer.decode(outputs[0]))
	```

	### Ollama (After Training)

	```bash
	# Create Modelfile (provided in repository)
	ollama create touchgrass-3b -f ollama_3b_modelfile

	# Run inference
	ollama run touchgrass-3b "How do I build a chord progression in C major?"
	```

	## 📁 Repository Structure

	This repository contains all necessary files for training:

	```
	touchgrass-3b/
	├── configuration_touchgrass.py # HuggingFace config class
	├── tokenization_touchgrass.py # HuggingFace tokenizer wrapper
	├── train.py # Main training script
	├── configs/
	│ ├── touchgrass_3b_config.py # Model architecture config
	│ ├── touchgrass_7b_config.py # 7B config (for reference)
	│ └── training_config.py # Training hyperparameters
	├── tokenizer/
	│ └── music_token_extension.py # Music token definitions
	├── models/ # Five specialized modules
	│ ├── tab_chord_module.py
	│ ├── music_theory_module.py
	│ ├── ear_training_module.py
	│ ├── eq_adapter.py
	│ └── songwriting_module.py
	├── data/ # Data pipeline
	│ ├── music_qa_generator.py
	│ ├── chat_formatter.py
	│ └── dataset_loader.py
	├── training/
	│ ├── losses.py
	│ ├── trainer.py
	│ └── train.py
	├── inference/
	│ └── inference.py
	├── benchmarks/
	│ ├── evaluate_music_modules.py
	│ └── evaluate_inference.py
	├── tests/ # Comprehensive test suite
	├── ollama_3b_modelfile # Ollama configuration
	├── README.md # Full documentation
	└── PREVIEW_README.md # This preview notice
	```

	## 🧪 Testing

	Run the test suite:

	```bash
	cd touchgrass-3b
	python -m pytest tests/ -v
	```

	## 📚 Documentation

	See [README.md](README.md) for complete documentation including:
	- Installation instructions
	- Training guide
	- Inference examples
	- Module specifications
	- Data generation details
	- Troubleshooting

	## ⚙️ Training (When Resources Available)

	1. Generate synthetic data:
	```bash
	python -c "from data.music_qa_generator import MusicQAGenerator; MusicQAGenerator().generate_dataset(num_samples=10000, output_path='data/music_qa.jsonl')"
	```

	2. Start training:
	```bash
	python train.py --config configs/touchgrass_3b_config.py --data data/music_qa.jsonl --output_dir ./checkpoints
	```

	3. Convert to HuggingFace format:
	```bash
	python -c "from configuration_touchgrass import TouchGrassConfig; from tokenization_touchgrass import TouchGrassTokenizer; config = TouchGrassConfig.from_pretrained('./checkpoints'); tokenizer = TouchGrassTokenizer.from_pretrained('./checkpoints'); config.save_pretrained('./model'); tokenizer.save_pretrained('./model')"
	```

	4. Push to HuggingFace:
	```bash
	huggingface-cli login
	huggingface-cli upload your-username/TouchGrass-3B ./model --repo-type model
	```

	## 🤝 Contributing

	This is a preview. Contributions welcome for:
	- Improving synthetic data quality
	- Adding more music categories
	- Optimizing training efficiency
	- Extending to more instruments

	## 📄 License

	Apache 2.0

	## 🙏 Acknowledgments

	- Built upon [Qwen3.5](https://huggingface.co/Qwen) by Alibaba Cloud
	- Inspired by the need for accessible music education AI
	- Special thanks to the open-source music technology community

	---

	⚠️ REMINDER: This is an UNTRAINED PREVIEW model. Do not use for production inference without completing the training process.