momeaicrypto
/

MoME-A2.7B

Safetensors

qwen2_moe

Model card Files Files and versions

xet

Community

momeaicrypto commited on Feb 20, 2025

Commit

3e7b6aa

verified ·

1 Parent(s): fc939e6

Update README.md

Browse files

Files changed (1) hide show

README.md +24 -23

README.md CHANGED Viewed

@@ -1,39 +1,40 @@
----
 license: apache-2.0
----
 # MoME-A2.7B (Multi-Chain Mixture of Experts)
 ## Introduction
-**MoME** (Multi-Chain Mixture of Experts) is a specialized large language model designed to handle multi-chain transaction analysis and cross-chain data interactions. Built on a Mixture of Experts (MoE) architecture, this model aims to provide detailed, chain-specific context for multiple blockchain networks—such as Aptos, Polkadot, Ripple, and more—within a single inference environment.
-MoME-A2.7B is slated for **open-source release** soon. We will update this card with direct download links once the model weights and checkpoints are made publicly available.
 ## Model Details
-- **Architecture**: Mixture of Experts (MoE) based, upcycled from an existing dense LLM to specialize in multi-chain transaction parsing and domain-specific dialogue.
-- **Parameters**: ~14.3B total parameters, with **2.7B activated parameters** on average at runtime, allowing efficient inference for each chain “expert.”
-- **Performance**: Targets near-parity performance with a larger 7B-class multi-chain model while using roughly **25% fewer computational resources** during training. Early benchmarks show a **1.74x** speed improvement in inference compared to more extensive 7B-class multi-chain baselines.
-- **Training Data**: Mixture of chain-focused textual corpora (including blockchain transaction logs, developer guides, and academic papers). For multi-chain coverage, MoME includes specialized corpora for Aptos, Polkadot, Ripple, and others.
 ## Requirements
-MoME leverages custom modules from the latest Hugging Face `transformers` library. To avoid compatibility issues, **install from source**:
-```
 pip install git+https://github.com/huggingface/transformers
 ```
-This ensures that any custom model classes (e.g., `mome_moe`) are recognized and loaded properly.
 ## Usage
-While MoME-A2.7B can serve as a foundation for multi-chain text generation tasks, **we recommend specialized fine-tuning** (e.g., SFT, RLHF, or additional domain pretraining) to unlock its full potential for:
 - **Cross-chain transaction decoding**
-- **Chain-specific question-answering**
-- **Multi-chain DeFi monitoring**
-- **NFT contract analysis**
-- **General blockchain research and development**
 ### Basic Example
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
-# Hypothetical usage - subject to change when weights become available
 model_name = "momeaicrypto/mome-a2.7b"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 model = AutoModelForCausalLM.from_pretrained(model_name)
@@ -45,12 +46,12 @@ print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ## Limitations & Disclaimer
-1. **Early Release**: MoME is an ongoing project. The model weights will be made available once internal validations are complete.
-2. **Chain Expertise Bias**: While MoME has specialized chain experts, certain blockchains or contract frameworks may have less representation in the training corpus, which can lead to biased or incomplete insights.
-3. **Not Production-Ready**: Users should further fine-tune or adapt MoME for production applications, ensuring appropriate risk mitigation and domain validation.
-4. **Responsible Use**: Please abide by your local regulations and best practices when using AI for financial or blockchain applications.
 ## Citation & Contact
-For questions or collaborations, see our upcoming GitHub repository (link to be provided) or contact the maintainers. If you use MoME in your research or production, please cite accordingly once the official white paper is released.
-*We look forward to releasing the MoME-A2.7B weights soon and enabling broader community engagement for robust multi-chain use cases.*

+```yaml
 license: apache-2.0
+```
 # MoME-A2.7B (Multi-Chain Mixture of Experts)
 ## Introduction
+**MoME** (Multi-Chain Mixture of Experts) is a specialized large language model tailored for multi-chain transaction analysis and cross-chain data workflows. By leveraging a Mixture of Experts (MoE) architecture, MoME delivers chain-specific insights for multiple blockchain networks—such as Aptos, Polkadot, Ripple, and more—all under one inference environment.
+**MoME-A2.7B** will be **open-sourced** soon. We will update this card with direct links to the weights and checkpoints when they become publicly available.
 ## Model Details
+- **Architecture**: MoE-based, derived from a dense LLM and optimized for multi-chain transaction parsing and domain-focused conversation.
+- **Parameters**: Approximately **14.3B total parameters**, with an average of **2.7B activated** at runtime, enabling efficient inference across multiple “expert” domains.
+- **Performance**: Achieves performance on par with a larger 7B-class multi-chain model while requiring around **25%** fewer computational resources. Early benchmarking shows **1.74×** faster inference compared to more extensive multi-chain models.
+- **Training Data**: Trained on a curated set of chain-centric corpora (e.g., on-chain logs, developer manuals, academic references). This specialized data covers Aptos, Polkadot, Ripple, and beyond.
 ## Requirements
+MoME relies on custom modules in the latest `transformers` library from Hugging Face. For best compatibility, install from source:
+```bash
 pip install git+https://github.com/huggingface/transformers
 ```
+This ensures any custom model classes (e.g., `mome_moe`) are properly registered and loaded.
 ## Usage
+While MoME-A2.7B can provide a foundation for multi-chain text generation tasks, **targeted fine-tuning**—such as SFT, RLHF, or extended domain pretraining—is strongly recommended for:
 - **Cross-chain transaction decoding**
+- **Chain-specific Q&A**
+- **DeFi analytics across multiple blockchains**
+- **NFT contract interpretation**
+- **General blockchain R&D**
 ### Basic Example
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
+# Example usage - subject to change once weights are released
 model_name = "momeaicrypto/mome-a2.7b"
 tokenizer = AutoTokenizer.from_pretrained(model_name)
 model = AutoModelForCausalLM.from_pretrained(model_name)
 ```
 ## Limitations & Disclaimer
+1. **Early Release**: MoME remains under development, and final weights will be shared pending internal validation.
+2. **Chain Expertise Bias**: Certain blockchains or contract types may be underrepresented in the training data, leading to potentially incomplete or biased outputs.
+3. **Production Readiness**: Further finetuning or adaptation is advised if using this model in production-critical settings.
+4. **Responsible Use**: Comply with relevant legal and ethical guidelines for AI applications in finance and blockchain.
 ## Citation & Contact
+Questions or collaboration inquiries can be directed to our forthcoming GitHub repo (link to be provided) or directly to the maintainers. If you integrate MoME into research or production, please cite it once the official white paper becomes available.
+*We look forward to releasing MoME-A2.7B and expanding the multi-chain LLM ecosystem.*