tinyllama-dippy / README.md

Update README.md

e2244f8 verified 7 months ago

7.95 kB

	---
	license: mit
	language:
	- de
	- en
	- sr
	pipeline_tag: text-generation
	library_name: adapter-transformers
	tags:
	- bittensor
	- dippy
	- chats
	---
	<div align="center">

	# Dippy SN11: Creating The World's Best Open-Source Roleplay LLM <!-- omit in toc -->


	Please check our [Launch Tweet](https://twitter.com/angad_ai/status/1788993280002175415) for our vision of creating the world's best open-source roleplay LLM.*

	[![DIPPY](/assets/banner.png)](https://dippy.ai)
	[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

	---

	</div>

	- [Introduction](#introduction)
	- [Roadmap](#roadmap)
	- [Overview of Miner and Validator Functionality](#overview-of-miner-and-validator-functionality)
	- [Miner](#miner)
	- [Validator](#validator)
	- [Running Miners and Validators](#running-miners-and-validators)
	- [Running a Miner](#running-a-miner)
	- [Running a Validator](#running-a-validator)
	- [Contributing](#contributing)
	- [License](#license)

	---

	## Introduction

	> Note: The following documentation assumes you are familiar with basic Bittensor concepts: Miners, Validators, and incentives. If you need a primer, please check out https://docs.bittensor.com/learn/bittensor-building-blocks.

	Dippy is one of the world's leading AI companion apps with 1M+ users. The app has ranked [#3 on the App Store](https://x.com/angad_ai/status/1850924240742031526) in countries like Germany, been covered by publications like [Wired magazine](https://www.wired.com/story/dippy-ai-girlfriend-boyfriend-reasoning/) and the average Dippy user spends 1+ hour on the app.

	The Dippy Roleplay subnet on Bittensor aims to create the world's best open-source roleplay LLM by leveraging the collective efforts of the open-source community. This subnet addresses the critical issue of loneliness, which affects a significant portion of the population and is linked to various mental and physical health problems.

	Current SOTA LLMs (Claude, OpenAI etc.) are designed for the assistant use case and lack the empathetic qualities necessary for companionship. While some companies (like Character AI and Inflection) have developed closed-source roleplay LLMs, the open-source alternatives lag significantly behind in performance. Furthermore, recent developments in the LLM space have prioritized objective reasoning capabilities, which only bring minor improvements to the role play space. Thus, the development of roleplay oriented models becomes even more important in the open source world.

	![DIPPY](/assets/comp.png)

	## Roadmap

	Given the complexity of creating a state of the art roleplay LLM, we plan to divide the process into 3 distinct phases.

	Phase 1:
	- [x] Subnet launch with robust pipeline for roleplay LLM evaluation on public datasets and response length
	- [x] Public model leaderboard based on evaluation criteria
	- [x] Introduce Coherence and Creativity as a criteria for live model evaluation

	Phase 2:
	- [x] Publicly release front-end powered by top miner submitted model of the week
	- [x] Integrate top miner submitted model in Official Dippy App
	- [x] Add support for larger parameter models for up to 34B

	Phase 3:
	- [x] Expand the state of the art in roleplay LLMs through continuous iteration and data collection
	- [ ] Redefine definition of SOTA for roleplay LLMs through integrating Dippy app data

	## Overview of Miner and Validator Functionality

	![overview](/assets/architecturenew.png)

	Miners would use existing frameworks, fine tuning techniques, or MergeKit, to train, fine tune, or merge models to create a unique roleplay LLM. These models would be submitted to a shared Hugging Face pool.

	Validators would evaluate the and assess model performance via our protocol and rank the submissions based on an [open scoring format](/docs/llm_scoring.md). We will provide a suite of
	testing and benchmarking protocols with state-of-the-art datasets.

	## Running Miners and Validators
	### Running a Miner
	> Important: Please carefully read through the [FAQ](docs/FAQ.md) and [Detailed Miner Documentation](docs/miner.md). These contain critical information about model requirements, evaluation criteria, and best practices that will help ensure your submissions are valid and competitive.



	### Running a Validator

	#### Requirements
	- Python 3.9+

	#### Setup
	To start, clone the repository and `cd` to it:
	```
	git clone https://github.com/impel-intelligence/dippy-bittensor-subnet.git
	cd dippy-bittensor-subnet
	pip install -e .
	```
	To run the evaluation, simply use the following command:

	```
	python neurons/validator.py --wallet.name WALLET_NAME --wallet.hotkey WALLET_HOT_NAME
	```

	To run auto-updating validator with PM2 (highly recommended):
	```bash
	pm2 start --name sn11-vali-updater --interpreter python scripts/start_validator.py -- --pm2_name sn11-vali --wallet.name WALLET_NAME --wallet.hotkey WALLET_HOT_NAME [other vali flags]
	```

	If you wish to use a local subtensor node, the additional flags required are `--local` in additional to the typical arguments.
	Example:
	```bash
	python neurons/validator.py \
	--wallet.name coldkey \
	--wallet.hotkey hotkey \
	--local \
	--subtensor.network local --subtensor.chain_endpoint ws://chain_endpoint
	```

	Please note that this validator will call the model worker orchestration service hosted by the dippy subnet owners. Current support for local worker orchestration is disabled at this time.

	## Subnet Incentive Mechanism

	The general structure of the incentive mechanism is as follows:
	1. Every miner has a model registered per UID
	2. Each miner's model submission is scored, with details outlined below
	- The scoring mechanism is constantly evolving according to SOTA model bechmark data

	3. The validator compares each miner's score against all the other miners, and calculates a win rate
	- Note that there are some modifiers for a miner's score such as their submission age in relation to other miner submissions (aka time penalty) to combat blatant model copying
	4. Given each miner's win rate, weights are assigned sorted by highest win rate


	### Model Evaluation Criteria
	See [scoring](/docs/llm_scoring.md) for details

	## Subnet Token Management
	See the [subnet token doc](/docs/subnet_token.md) for details

	## Acknowledgement

	Our codebase was originally built upon [Nous Research's](https://github.com/NousResearch/finetuning-subnet) and [MyShell's](https://github.com/myshell-ai/MyShell-TTS-Subnet?tab=readme-ov-file) Subnets. At the time of this writing, we have deviated significantly from these subnet architectures, providing more efficiency and capability.

	## License

	The Dippy Bittensor subnet is released under the [MIT License](./LICENSE).


	# Project Structure Overview

	## Core Components

	### 1. Main Application
	- `neurons/` - Core neural network components
	- `miner.py` - Miner code for submitting a model to the bittensor network
	- `validator.py` - Validation node implementation
	- `model_queue.py` - Queue management for model processing (for internal use)

	### 2. LLM Scoring
	- `scoring/` - All code that determines the scoring for an LLM lives here

	### 3. Utilities
	- `utilities/` - Common utility functions
	- `repo_details.py` - Repository management utilities
	- `validation_utils.py` - Validation helper functions

	### 4. Documentation
	- `docs/` - Project documentation
	- `miner.md` - Miner setup and usage guide
	- `validator.md` - Validator setup and usage guide
	- `FAQ.md` - Frequently asked questions
	- `llm_scoring.md` - LLM Scoring criteria

	### 5. Worker API (for internal use)
	- `wokrer_api/` - API for model validation. Only validators and subnet operators require usage of this API. Miners do not need to set this up in 99% of cases

	## Docker Configuration
	- `evaluator.Dockerfile` - Docker configuration for evaluator (scoring worker)
	- `worker_api/vapi.Dockerfile` - Docker configuration for worker API