---
tags:
- ml-intern
---
# NeuroName: Domain-Specific AI Architecture for Creative Name Generation

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
[![Python 3.9+](https://img.shields.io/badge/python-3.9+-blue.svg)](https://www.python.org/downloads/)
[![PyTorch](https://img.shields.io/badge/PyTorch-2.0+-ee4c2c.svg)](https://pytorch.org/)

## 🧠 What is NeuroName?

**NeuroName** is a purpose-built neural architecture for generating creative, novel names for brands, YouTube channels, social media handles, products, and more. Unlike generic LLMs that produce obvious word combinations, NeuroName creates **genuinely new words** that:

- Sound natural and pronounceable
- Evoke intended meanings without being literal
- Are controllable (length, style, language feel, energy)
- Are truly novel — not existing words or obvious compounds

## 🔬 Why Current LLMs Fail at Creative Naming

| Problem | Why It Happens | NeuroName Solution |
|---------|---------------|-------------------|
| **Too generic** | LLMs predict probable tokens from training distribution | Character-level VAE generates outside known distributions |
| **Obvious combinations** | Token-level = existing word chunks | Char-level latent space enables smooth morphological blending |
| **No sound awareness** | No phonotactic model | Dedicated Phonotactic Discriminator scores pronounceability |
| **Can't be truly novel** | Constrained to recombine training tokens | VAE latent interpolation creates genuinely new sequences |
| **No fine control** | Prompt engineering is imprecise | Energy-based composable attribute control in latent space |
| **RLHF kills creativity** | Safety alignment → conservative outputs | No RLHF; creativity is the objective function |

## 🏗️ Architecture Overview

```
Input: semantic_hints + control_params (length, style, language_feel, energy)
                    │
                    ▼
    ┌─────────────────────────────┐
    │   Semantic Encoder          │  ← Transformer encodes meaning hints
    │   (attention-pooled)        │
    └──────────────┬──────────────┘
                   │
                   ▼
    ┌─────────────────────────────┐
    │   Conditional Prior         │  ← P(z|semantics, controls) - Gaussian
    │   Network (μ, σ learned)    │
    └──────────────┬──────────────┘
                   │
                   ▼ z ~ N(μ, σ²)
    ┌─────────────────────────────┐
    │   Latent Space + EBM        │  ← Energy-based attribute composition
    │   (ODE-guided sampling)     │
    └──────────────┬──────────────┘
                   │
                   ▼
    ┌─────────────────────────────┐
    │   Character Decoder         │  ← Transformer generates char-by-char
    │   (cross-attends to z)      │
    └──────────────┬──────────────┘
                   │
                   ▼
    ┌─────────────────────────────┐
    │   Phonotactic Validator     │  ← CNN+Transformer scores sound quality
    └──────────────┬──────────────┘
                   │
                   ▼
         Generated Name: "Velocix" ✓
```

## 🧬 Key Innovations

### 1. Character-Level VAE (not token-level)
Operates at individual characters, enabling creation of genuinely novel sequences impossible with subword tokenizers.

### 2. Phonotactic Discriminator
Learned model of sound combinations (bigrams, trigrams, syllable structure) based on the **Bouba-Kiki Effect** and cross-linguistic phonotactics. Ensures outputs are pronounceable and pleasant-sounding.

### 3. Morphological Composition Module
Explicit linguistic word-formation operations as differentiable modules:
- **Blending**: "breakfast + lunch → brunch" style merging
- **Affixation**: Meaningful prefix/suffix attachment
- **Vowel Harmony**: Sound shifting for cohesion
- **Clipping + Extension**: Shortening with style

### 4. Energy-Based Composable Control
Multiple attributes (style, length, language feel) composed via energy functions in latent space. Mathematically principled — not prompt hacking.

### 5. Sound Symbolism Integration
Phoneme-meaning associations baked into the architecture:
- **Plosives** (b, d, k, t): Power, strength → "Kodak", "TikTok"
- **Fricatives** (f, s, sh, v): Speed, elegance → "Swift", "Visa"
- **Nasals** (m, n): Warmth, comfort → "Amazon", "Nintendo"
- **Close vowels** (i, e): Precision, tech → "Google", "Pixel"

## 📦 Installation

```bash
pip install torch numpy pyyaml tqdm
git clone https://huggingface.co/asdf98/neuroname
cd neuroname
pip install -e .
```

## 🚀 Quick Start

```python
from neuroname import NeuroNameGenerator

# Initialize generator
generator = NeuroNameGenerator()

# Generate brand names with semantic hints
names = generator.generate(
    semantic_hints=["speed", "technology", "future"],
    style="modern",        # modern/classic/playful/techy/organic/elegant/bold/minimal
    language_feel="latin", # english/latin/greek/japanese/nordic/spanish/french/abstract
    energy="energetic",    # calm/neutral/energetic
    length_range=(5, 8),
    num_names=10,
    temperature=0.8
)
print(names)
# ['Velocix', 'Tervon', 'Nexura', 'Fluxen', 'Zyphos', ...]

# Generate YouTube channel names
names = generator.generate(
    semantic_hints=["gaming", "adventure", "epic"],
    style="playful",
    language_feel="english",
    energy="energetic",
    length_range=(6, 12),
    num_names=10
)

# Generate social media handles
names = generator.generate(
    semantic_hints=["art", "minimal", "aesthetic"],
    style="elegant",
    language_feel="french",
    energy="calm",
    length_range=(4, 8),
    num_names=10
)
```

## 🏋️ Training

```bash
# Train from scratch
python train.py --config configs/default.yaml

# Train with custom data
python train.py --data_path your_names.txt --epochs 100
```

## 📁 Repository Structure

```
neuroname/
├── README.md                    # This file
├── pyproject.toml              # Package configuration
├── neuroname/
│   ├── __init__.py             # Package exports
│   ├── model.py                # Core architecture (VAE + all components)
│   ├── generator.py            # High-level generation interface
│   ├── phonotactics.py         # Phonotactic scoring & sound symbolism
│   ├── morphology.py           # Morphological composition operations
│   ├── latent_ops.py           # Energy-based latent space control
│   ├── data.py                 # Dataset & data loading utilities
│   └── config.py               # Configuration management
├── train.py                    # Training script
├── configs/
│   └── default.yaml            # Default training configuration
└── notebooks/
    └── demo.ipynb              # Interactive demonstration
```

## 📊 Sound Symbolism Research Basis

Our architecture is grounded in linguistic research on sound-meaning associations:

| Phoneme Type | Associations | Example Brands |
|-------------|--------------|----------------|
| Voiced plosives (b, g, d) | Strong, bold, grounded | **B**ose, **G**oogle, **D**ell |
| Voiceless plosives (p, t, k) | Sharp, precise, clean | **P**aypal, **T**esla, **K**odak |
| Fricatives (f, v, s, z) | Fast, flowing, futuristic | **V**isa, **Z**ara, **S**potify |
| Nasals (m, n) | Warm, nurturing, smooth | a**M**azon, **N**intendo |
| Liquids (l, r) | Fluid, dynamic, premium | **L**exus, **R**olex |
| High vowels (i, ee) | Small, quick, technical | P**i**xel, W**ii** |
| Low vowels (a, o) | Big, open, powerful | **A**pple, V**o**lvo |

## 🔧 Technical Details

- **Model Size**: ~15M parameters (intentionally small — domain-specific, not general)
- **Latent Dimension**: 128
- **Character Vocabulary**: 44 chars (lowercase + digits + special)
- **Max Name Length**: 32 characters
- **Training**: ELBO loss + phonotactic reward + attribute classification

## 📄 License

MIT License - see LICENSE file for details.

## 🙏 Acknowledgments

Architecture inspired by:
- [LatentOps](https://arxiv.org/abs/2208.00638) - Composable text controls in latent space
- [LlaMaVAE](https://arxiv.org/abs/2312.13208) - VAE with LLM decoder
- [Bouba-Kiki Effect](https://en.wikipedia.org/wiki/Bouba/kiki_effect) - Sound symbolism research
- [Controllable Text Generation Survey](https://arxiv.org/abs/2408.12599) - CTG methods taxonomy

<!-- ml-intern-provenance -->
## Generated by ML Intern

This model repository was generated by [ML Intern](https://github.com/huggingface/ml-intern), an agent for machine learning research and development on the Hugging Face Hub.

- Try ML Intern: https://smolagents-ml-intern.hf.space
- Source code: https://github.com/huggingface/ml-intern

## Usage

```python
from transformers import AutoModelForCausalLM, AutoTokenizer

model_id = "asdf98/neuroname"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
```

For non-causal architectures, replace `AutoModelForCausalLM` with the appropriate `AutoModel` class.