Text Generation
Transformers
English
code
coding-assistant
arc
arc-brains
hackathon
ppt
qwen2
lora
sft
Instructions to use ibrahim2806/Arc with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ibrahim2806/Arc with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="ibrahim2806/Arc")# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("ibrahim2806/Arc", dtype="auto") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use ibrahim2806/Arc with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "ibrahim2806/Arc" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ibrahim2806/Arc", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/ibrahim2806/Arc
- SGLang
How to use ibrahim2806/Arc with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "ibrahim2806/Arc" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ibrahim2806/Arc", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "ibrahim2806/Arc" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "ibrahim2806/Arc", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use ibrahim2806/Arc with Docker Model Runner:
docker model run hf.co/ibrahim2806/Arc
| license: apache-2.0 | |
| base_model: Qwen/Qwen2.5-Coder-7B-Instruct | |
| tags: | |
| - code | |
| - coding-assistant | |
| - arc | |
| - arc-brains | |
| - hackathon | |
| - ppt | |
| - qwen2 | |
| - lora | |
| - sft | |
| language: | |
| - en | |
| pipeline_tag: text-generation | |
| library_name: transformers | |
| # π Arc β Friendly Coding Expert | |
| > **Created by Arc Brains: Ibrahim Shaikh, Harsh Goswami, Manas Tamore, Ayush Thakur** | |
| Arc is a powerful coding assistant fine-tuned from **Qwen2.5-Coder-7B-Instruct** on **253K+ high-quality coding examples**. It delivers **complete, production-ready solutions** β never patches. | |
| --- | |
| ## π― What Arc Excels At | |
| | Skill | Description | | |
| |-------|-------------| | |
| | π» **Complete Code** | Full runnable solutions with imports, error handling, docs | | |
| | ποΈ **Hackathon Projects** | Entire apps from scratch β Flask, React, CLI tools | | |
| | π **Presentations** | Generate PowerPoint slides programmatically | | |
| | π **Debugging** | Root-cause analysis with full fixes, not band-aids | | |
| | π **Multi-language** | Python, JavaScript, C++, Java, Ruby, Go, Rust, and more | | |
| | π **Architecture** | Full project structure with all files | | |
| --- | |
| ## π Quick Start | |
| ```python | |
| from transformers import AutoModelForCausalLM, AutoTokenizer | |
| from peft import PeftModel | |
| import torch | |
| # Load | |
| base = AutoModelForCausalLM.from_pretrained( | |
| "Qwen/Qwen2.5-Coder-7B-Instruct", | |
| torch_dtype=torch.bfloat16, | |
| device_map="auto" | |
| ) | |
| model = PeftModel.from_pretrained(base, "ibrahim2806/Arc") | |
| tokenizer = AutoTokenizer.from_pretrained("Qwen/Qwen2.5-Coder-7B-Instruct") | |
| # Chat | |
| messages = [ | |
| {"role": "system", "content": "You are Arc, a friendly coding expert by Arc Brains."}, | |
| {"role": "user", "content": "Build a full Flask REST API for a todo app with CRUD, auth, and SQLite"} | |
| ] | |
| text = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True) | |
| inputs = tokenizer(text, return_tensors="pt").to(model.device) | |
| output = model.generate(**inputs, max_new_tokens=4096, temperature=0.7, do_sample=True) | |
| print(tokenizer.decode(output[0][inputs["input_ids"].shape[1]:], skip_special_tokens=True)) | |
| ``` | |
| --- | |
| ## π Training Details | |
| | Parameter | Value | | |
| |-----------|-------| | |
| | **Base Model** | Qwen/Qwen2.5-Coder-7B-Instruct | | |
| | **Method** | QLoRA (4-bit NF4) | | |
| | **LoRA Rank** | 64 | | |
| | **LoRA Alpha** | 128 | | |
| | **Target Modules** | q/k/v/o_proj, gate/up/down_proj | | |
| | **Learning Rate** | 2e-4 (cosine decay) | | |
| | **Epochs** | 3 | | |
| | **Effective Batch Size** | 32 | | |
| | **Max Sequence Length** | 4096 | | |
| | **Total Training Samples** | ~253,000 | | |
| | **Optimizer** | AdamW | | |
| | **Precision** | BF16 | | |
| --- | |
| ## π Training Datasets | |
| | Dataset | Samples | What It Teaches | | |
| |---------|---------|-----------------| | |
| | [m-a-p/Code-Feedback](https://huggingface.co/datasets/m-a-p/Code-Feedback) | 68K | Multi-turn debugging, project building, iterative refinement | | |
| | [Magicoder-Evol-Instruct-110K](https://huggingface.co/datasets/ise-uiuc/Magicoder-Evol-Instruct-110K) | 110K | Complex algorithmic & system design problems | | |
| | [Magicoder-OSS-Instruct-75K](https://huggingface.co/datasets/ise-uiuc/Magicoder-OSS-Instruct-75K) | 75K | Real-world code from open-source projects | | |
| --- | |
| ## ποΈ Arc Brains Team | |
| Built with β€οΈ by **Ibrahim Shaikh**, **Harsh Goswami**, **Manas Tamore**, and **Ayush Thakur**. | |