Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,47 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
datasets:
|
| 4 |
+
- Bifrost-AI/Solana-Vanguard-Challenge
|
| 5 |
+
language:
|
| 6 |
+
- en
|
| 7 |
+
metrics:
|
| 8 |
+
- accuracy
|
| 9 |
+
- code_eval
|
| 10 |
+
base_model:
|
| 11 |
+
- microsoft/NextCoder-7B
|
| 12 |
+
pipeline_tag: text-generation
|
| 13 |
+
tags:
|
| 14 |
+
- code
|
| 15 |
+
- finance
|
| 16 |
+
- chat
|
| 17 |
+
- text-generation
|
| 18 |
+
- large-language-model
|
| 19 |
+
library_name: transformers
|
| 20 |
+
---
|
| 21 |
+
# NextCoder Mirage SOL 7B
|
| 22 |
+
### This fine-tuned variant of the NextCoder 7B model was supervised fine-tuned on blockchain-specific datasets(Bifrost-AI/Solana-Vanguard-Challenge), optimized for downstream tasks in blockchain coding and smart contract development on the Solana ecosystem.
|
| 23 |
+
The **Solana Vanguard Challenge** dataset, comprising 1,000 diverse and in-depth questions, offers full-spectrum coverage of the Solana ecosystem. It spans fundamental blockchain concepts, advanced on-chain programming in Rust and the Anchor framework, client-side integration in TypeScript, detailed security strategies, and performance as well as regulatory considerations.
|
| 24 |
+
|
| 25 |
+
NextCoder Mirage SOL 7B is in active development with additional fine-tuning sessions, & benchmark statistics coming soon!
|
| 26 |
+
|
| 27 |
+
## Training Session:
|
| 28 |
+
- Time: 9 hours & 56 minutes
|
| 29 |
+
- GPU: NVIDIA GeForce RTX 3090
|
| 30 |
+
- Batches: 500
|
| 31 |
+
- Context-Size: 2043
|
| 32 |
+
- Batch-size: 1
|
| 33 |
+
- Learning-rate: 2e-5
|
| 34 |
+
- Training-loss: 1.09
|
| 35 |
+
- Eval-loss: 0.89
|
| 36 |
+
|
| 37 |
+
## Dataset Composition
|
| 38 |
+
- **Total Questions:** 1,000
|
| 39 |
+
- **Languages Covered:**
|
| 40 |
+
- **Rust:** On-chain smart contract development, security best practices, advanced state management, CPIs, PDAs, and more.
|
| 41 |
+
- **TypeScript:** Client-side integration using @solana/web3.js, wallet adapters, Metaplex for NFT protocols, dynamic transaction composition, and front-end dApp development.
|
| 42 |
+
- **Planned Extensions:**
|
| 43 |
+
- **C# (Solnet):** To be integrated later for .NET ecosystem coverage.
|
| 44 |
+
|
| 45 |
+
|
| 46 |
+
## Disclaimer
|
| 47 |
+
We do not recommend using Qwen3 Bifrost SOL 4B in commercial or real-world applications without further testing and development. This current model(v1) is intended for research and development purposes. While efforts have been made to align it using SFT and DPO, it may still produce outputs that are unexpected, biased, or inaccurate. Please use responsibly.
|