ConicAI_LLM_model / training_config.json
girish00's picture
make project runnable and endpoint-ready
953dad7 verified
raw
history blame contribute delete
267 Bytes
{
"model_name": "Qwen/Qwen2.5-Coder-0.5B-Instruct",
"dataset_size": 8000,
"train_file": "train.json",
"output_dir": "model",
"hf_repo_id": "",
"epochs": 3,
"batch_size": 2,
"learning_rate": 0.0001,
"max_length": 512,
"use_4bit": true
}