Spaces:

SulmanK
/

BeatDebate

Build error

BeatDebate / Design /agent_deep_dive_analysis.md

Enhance design documentation for PlannerAgent entity recognition - Expanded the design document to detail current functionality, limitations, strengths, and proposed enhancements for the PlannerAgent. Included before and after examples of query processing to illustrate improvements in entity extraction and coordination strategies. Updated TODO list to reflect ongoing enhancements.

56ba2a5 about 1 year ago

preview code

Raw

History Blame Contribute Delete

19 kB

A newer version of the Gradio SDK is available: 6.19.0

Upgrade

Deep Dive: GenreMoodAgent & DiscoveryAgent Analysis

Overview

This document provides a comprehensive analysis of how the GenreMoodAgent and DiscoveryAgent work internally, including their algorithms, data structures, and decision-making processes.

🎵 GenreMoodAgent Deep Dive

Core Purpose

The GenreMoodAgent is the "mood and genre specialist" that finds music based on emotional states, energy levels, and musical styles using Last.fm's tag-based search system.

🧠 Internal Architecture

1. Initialization & Knowledge Base

# Mood Mappings (10 categories)
mood_tag_mappings = {
    'happy': ['happy', 'uplifting', 'cheerful', 'joyful', 'positive'],
    'sad': ['sad', 'melancholy', 'melancholic', 'emotional', 'depressing'],
    'energetic': ['energetic', 'upbeat', 'pump up', 'motivational', 'driving'],
    'chill': ['chill', 'relaxing', 'mellow', 'laid-back', 'easy listening'],
    'focus': ['instrumental', 'ambient', 'concentration', 'study', 'background'],
    'romantic': ['romantic', 'love', 'intimate', 'passionate', 'sensual'],
    'nostalgic': ['nostalgic', 'retro', 'vintage', 'classic', 'throwback'],
    'aggressive': ['aggressive', 'intense', 'heavy', 'hard', 'powerful'],
    'peaceful': ['peaceful', 'calm', 'serene', 'tranquil', 'soothing'],
    'party': ['party', 'dance', 'club', 'celebration', 'fun']
}

# Energy Level Mappings
energy_level_mappings = {
    'low': ['calm', 'peaceful', 'quiet', 'soft', 'gentle'],
    'medium': ['moderate', 'balanced', 'steady', 'comfortable'],
    'high': ['energetic', 'intense', 'powerful', 'driving', 'upbeat']
}

# Genre Mappings (12 major genres)
genre_tag_mappings = {
    'rock': ['rock', 'alternative rock', 'indie rock', 'classic rock'],
    'electronic': ['electronic', 'electronica', 'ambient', 'techno', 'house'],
    'indie': ['indie', 'independent', 'indie pop', 'indie rock'],
    'pop': ['pop', 'pop rock', 'electropop', 'synth-pop'],
    'hip-hop': ['hip hop', 'rap', 'hip-hop', 'urban'],
    'jazz': ['jazz', 'smooth jazz', 'contemporary jazz', 'jazz fusion'],
    # ... and more
}

2. 5-Step Processing Workflow

Step 1: Mood Analysis (`_analyze_mood_requirements()`)

# Input: "I want upbeat music for working out"
# Output:
{
    'primary_mood': 'energetic',           # Detected from "upbeat"
    'secondary_moods': ['party'],          # Secondary matches
    'energy_level': 'high',               # From "working out" context
    'context_tags': ['motivational'],     # Activity-based tags
    'mood_confidence': 0.8                # Confidence in detection
}

Algorithm:

Extract strategy parameters from PlannerAgent
Scan query for mood keywords using mappings
Determine energy level from context clues
Calculate confidence based on keyword matches

Step 2: Search Tag Generation (`_generate_search_tags()`)

# Input: mood_analysis + strategy
# Output: ['energetic', 'upbeat', 'high_energy', 'motivational']

def _generate_search_tags(strategy, mood_analysis):
    tags = []
    
    # Add primary mood tags (first 2)
    tags.extend(mood_tag_mappings['energetic'][:2])  # ['energetic', 'upbeat']
    
    # Add energy level tags (first 2)  
    tags.extend(energy_level_mappings['high'][:2])   # ['energetic', 'intense']
    
    # Add genre focus from strategy
    focus_areas = strategy.get('focus_areas', [])    # ['rock', 'electronic']
    for area in focus_areas[:2]:
        tags.extend(genre_tag_mappings[area][:2])
    
    return tags

Step 3: Track Search (`_search_tracks_by_tags()`)

# Uses multiple tag combinations for comprehensive search
primary_combinations = [
    ['energetic', 'upbeat'],              # First 2 tags
    ['upbeat', 'high_energy'],            # Next 2 tags  
    ['energetic']                         # Single primary tag
]

# For each combination:
for tag_combo in primary_combinations:
    tag_query = ' '.join(tag_combo)       # "energetic upbeat"
    tracks = await lastfm_client.search_tracks(tag_query, limit=20)
    
    # Add search context to each track
    for track in tracks:
        track['search_tags'] = tag_combo
        track['search_type'] = 'tag_based'

Search Strategy:

Multiple combinations to maximize coverage
Rate limiting (0.1s delays) to respect Last.fm API
Deduplication based on artist + title
Limit to 50 candidates for processing efficiency

Step 4: Filtering & Ranking (`_filter_and_rank_tracks()`)

def _calculate_track_score(track, mood_analysis, strategy):
    score = 0.0
    
    # Base metadata score (20%)
    if track.get('artist') and track.get('name'):
        score += 0.2
    
    # Search tag relevance (30%)
    search_tags = track.get('search_tags', [])
    primary_mood_tags = mood_tag_mappings[mood_analysis['primary_mood']]
    tag_overlap = len(set(search_tags) & set(primary_mood_tags))
    score += (tag_overlap / max(len(primary_mood_tags), 1)) * 0.3
    
    # Popularity score (20%) - log scale normalization
    listeners = int(track.get('listeners', 0))
    if listeners > 0:
        normalized_listeners = min(math.log10(listeners) / 6, 1.0)  # Max at 1M
        score += normalized_listeners * 0.2
    
    # URL availability (20%)
    if track.get('url'):
        score += 0.2
    
    # Quality indicators (10% each)
    if good_artist_name: score += 0.1
    if good_track_name: score += 0.1
    
    return min(score, 1.0)

Scoring Breakdown:

30%: How well search tags match the primary mood
20%: Popularity (more listeners = higher quality)
20%: URL availability (playable tracks preferred)
20%: Basic metadata quality
10%: Artist/track name quality

Step 5: Recommendation Creation (`_create_recommendations()`)

# Top 5 tracks become TrackRecommendation objects
for i, track in enumerate(top_tracks[:5]):
    recommendation = TrackRecommendation(
        title=track['name'],
        artist=track['artist'],
        source="lastfm",
        genres=extracted_genres,                    # From strategy focus areas
        moods=extracted_tags,                       # Primary mood + energy + search tags
        quality_score=track['genre_mood_score'],    # From scoring algorithm
        novelty_score=0.3,                         # Low (focuses on established genres)
        concentration_friendliness_score=calc_concentration_score(),
        confidence=track['genre_mood_score'],
        advocate_source_agent="GenreMoodAgent"
    )

🎯 GenreMoodAgent Strengths

Comprehensive Mood Coverage: 10 mood categories with 5 tags each
Energy Level Matching: High/medium/low energy optimization
Activity Context: Understands workout, study, party contexts
Quality Filtering: Prefers tracks with good metadata and popularity
Tag-Based Discovery: Leverages Last.fm's extensive tagging system

🔍 GenreMoodAgent Limitations

Mainstream Bias: Prefers popular tracks (high listener counts)
Limited Novelty: Low novelty scores (0.3) by design
Tag Dependency: Relies on Last.fm's tag quality
No Artist Similarity: Doesn't understand "like The Beatles" requests

🔍 DiscoveryAgent Deep Dive

Core Purpose

The DiscoveryAgent is the "similarity and underground specialist" that finds music through artist similarity networks and underground exploration, prioritizing novelty and hidden gems.

🧠 Internal Architecture

1. Initialization & Knowledge Base

# Discovery Strategies (4 exploration types)
discovery_strategies = {
    'underground': {
        'max_listeners': 50000,      # Strict underground limit
        'novelty_weight': 0.8,       # High novelty priority
        'similarity_depth': 3        # Deep similarity exploration
    },
    'similar': {
        'max_listeners': 200000,     # More lenient for similarity
        'novelty_weight': 0.4,       # Lower novelty priority
        'similarity_depth': 2        # Focused similarity
    },
    'diverse': {
        'max_listeners': 500000,     # Broad listener range
        'novelty_weight': 0.6,       # Balanced novelty
        'similarity_depth': 4        # Wide exploration
    },
    'balanced': {
        'max_listeners': 100000,     # Default settings
        'novelty_weight': 0.6,
        'similarity_depth': 3
    }
}

# Seed Artists by Genre (curated for discovery)
seed_artists = {
    'rock': ['Radiohead', 'Arctic Monkeys', 'The Strokes', 'Queens of the Stone Age'],
    'electronic': ['Aphex Twin', 'Boards of Canada', 'Four Tet', 'Burial'],
    'indie': ['Bon Iver', 'Sufjan Stevens', 'Fleet Foxes', 'Grizzly Bear'],
    'underground': ['Death Grips', 'clipping.', 'JPEGMAFIA', 'Black Midi'],
    # ... more genres
}

# Underground Indicators (artist name patterns)
underground_indicators = [
    'collective', 'records', 'tape', 'bedroom', 'lo-fi', 'experimental',
    'ambient', 'drone', 'noise', 'post-', 'neo-', 'micro-', 'minimal'
]

2. 5-Step Processing Workflow

Step 1: Discovery Analysis (`_analyze_discovery_requirements()`)

# Input: "Something like The Beatles but more underground"
# Output:
{
    'exploration_type': 'underground',     # Detected from "underground"
    'novelty_priority': 'high',           # From strategy
    'underground_bias': 0.8,              # Boosted from 0.6 due to "underground"
    'discovery_scope': 'medium',          # From strategy
    'similarity_base': 'artist_similarity', # From "like The Beatles"
    'novelty_threshold': 0.7,             # High threshold for underground
    'max_listeners_threshold': 20000      # Calculated from underground_bias
}

Algorithm:

# Detect exploration type from keywords
exploration_indicators = {
    'underground': ['underground', 'hidden', 'unknown', 'discover', 'obscure'],
    'similar': ['similar', 'like', 'reminds', 'sounds like'],
    'diverse': ['diverse', 'variety', 'different', 'mix', 'eclectic'],
    'trending': ['new', 'latest', 'trending', 'popular', 'current']
}

# Adjust underground bias based on query
if 'underground' in query_lower:
    underground_bias = min(underground_bias + 0.2, 1.0)  # Boost bias
elif 'popular' in query_lower:
    underground_bias = max(underground_bias - 0.3, 0.0)  # Reduce bias

# Calculate listener threshold
max_listeners = int(100000 * (1.0 - underground_bias))  # Higher bias = lower threshold

Step 2: Seed Artist Discovery (`_find_seed_artists()`)

# Method 1: Extract from query (THIS IS WHAT MOVES TO PLANNER!)
def _extract_artists_from_query(user_query):
    # Current regex patterns:
    like_match = re.search(r'like\s+([A-Z][a-zA-Z0-9]*(?:\s+[A-Za-z0-9]+)*)', query)
    similar_match = re.search(r'(?i)similar\s+to\s+([A-Z][a-zA-Z0-9]*(?:\s+[A-Za-z0-9]+)*)', query)
    
    # Returns: ["The Beatles"]

# Method 2: Genre-based seeds
focus_areas = strategy.get('focus_areas', [])  # From PlannerAgent
for genre in focus_areas:
    if genre in seed_artists:
        seed_artists.extend(seed_artists[genre][:2])  # Top 2 per genre

# Method 3: Fallback seeds
if not seed_artists:
    seed_artists = seed_artists['general']  # Default curated list

Step 3: Similarity Exploration (`_explore_similar_music()`)

# For each seed artist: "The Beatles"
async def _explore_similar_music(seed_artists, discovery_analysis, strategy):
    all_tracks = []
    
    for seed_artist in seed_artists:  # "The Beatles"
        # Get similar artists
        similar_artists = await _get_similar_artists(seed_artist, client)
        # Returns: ["The Kinks", "The Rolling Stones", "The Who"]
        
        # Get tracks from similar artists
        for similar_artist in similar_artists[:3]:  # Top 3 similar
            artist_tracks = await client.get_artist_top_tracks(similar_artist, limit=10)
            
            for track in artist_tracks:
                track['seed_artist'] = 'The Beatles'
                track['similar_artist'] = similar_artist
                track['discovery_method'] = 'artist_similarity'
                all_tracks.append(track)
        
        # Also get tracks from seed artist itself
        seed_tracks = await client.get_artist_top_tracks(seed_artist, limit=5)
        for track in seed_tracks:
            track['discovery_method'] = 'seed_artist'
            all_tracks.append(track)
    
    return unique_tracks[:100]  # Deduplicated, limited

Similarity Network Exploration:

Seed Artist → Similar Artists → Their Top Tracks
Discovery Chain: Beatles → Kinks → "Waterloo Sunset"
Context Preservation: Each track knows its discovery path
Rate Limiting: 0.1-0.2s delays between API calls

Step 4: Underground Filtering (`_filter_for_underground()`)

def _filter_for_underground(tracks, discovery_analysis, strategy):
    underground_tracks = []
    max_listeners = discovery_analysis['max_listeners_threshold']  # 20,000
    novelty_threshold = discovery_analysis['novelty_threshold']    # 0.7
    
    for track in tracks:
        # Calculate novelty score
        novelty_score = _calculate_novelty_score(track, discovery_analysis)
        
        # Apply underground bias filter
        listener_count = int(track.get('listeners', 0))
        underground_bias = discovery_analysis['underground_bias']  # 0.8
        
        if underground_bias > 0.7:  # High underground bias
            if listener_count > max_listeners:
                continue  # Strict exclusion of popular tracks
        elif underground_bias > 0.4:  # Medium underground bias
            if listener_count > max_listeners * 2:
                novelty_score *= 0.7  # Reduce score for popular tracks
        
        # Apply novelty threshold
        if novelty_score >= novelty_threshold:
            track['novelty_score'] = novelty_score
            underground_tracks.append(track)
    
    # Sort by novelty score (descending)
    return sorted(underground_tracks, key=lambda x: x['novelty_score'], reverse=True)[:20]

Step 5: Novelty Scoring (`_calculate_novelty_score()`)

def _calculate_novelty_score(track, discovery_analysis):
    score = 0.0
    
    # Base metadata score (20%)
    if track.get('artist') and track.get('name'):
        score += 0.2
    
    # Underground bias scoring (40% - MOST IMPORTANT)
    listener_count = int(track.get('listeners', 0))
    max_threshold = discovery_analysis['max_listeners_threshold']  # 20,000
    
    if listener_count == 0:
        score += 0.4  # Maximum underground score
    elif listener_count <= max_threshold:
        # Scale from 0.4 to 0.2 based on listener count
        normalized = 1.0 - (listener_count / max_threshold)
        score += 0.2 + (normalized * 0.2)
    else:
        # Popular tracks get very low scores
        popularity_factor = min(listener_count / max_threshold, 5.0)
        if popularity_factor > 2.0:  # Very popular
            score += 0.05  # Very low score
        else:
            score += 0.1   # Minimum score
    
    # Discovery method bonus (20%)
    if track['discovery_method'] == 'artist_similarity':
        score += 0.2  # Bonus for similarity discovery
    elif track['discovery_method'] == 'seed_artist':
        score += 0.1  # Smaller bonus for direct seed
    
    # Artist diversity bonus (20%)
    artist = track.get('artist', '')
    if len(artist) > 10:  # Longer names often indicate indie artists
        score += 0.1
    if any(indicator in artist.lower() for indicator in underground_indicators):
        score += 0.1  # Bonus for underground name patterns
    
    # Metadata quality (10% each)
    if track.get('url'): score += 0.1
    if track.get('seed_artist') != track.get('similar_artist'): score += 0.1
    
    return min(score, 1.0)

Novelty Scoring Breakdown:

40%: Underground bias (fewer listeners = higher score)
20%: Discovery method (similarity > seed artist)
20%: Artist diversity (indie indicators, name length)
20%: Basic metadata and discovery chain quality

🎯 DiscoveryAgent Strengths

Artist Similarity Networks: Leverages Last.fm's similarity data
Underground Discovery: Sophisticated listener count filtering
Novelty Optimization: Complex scoring algorithm for hidden gems
Discovery Chain Tracking: Maintains provenance of recommendations
Exploration Strategies: 4 different exploration modes

🔍 DiscoveryAgent Limitations

Artist Extraction: Currently does basic regex parsing (MOVING TO PLANNER!)
API Dependency: Relies heavily on Last.fm similarity data
Popularity Bias: May miss quality popular tracks
Limited Genre Expansion: Focuses on similarity rather than genre exploration

🤝 How They Work Together

Current Coordination

# PlannerAgent creates strategies:
{
    "genre_mood_agent": {
        "focus_areas": ["rock", "pop"],
        "energy_level": "high",
        "mood_priority": "energetic"
    },
    "discovery_agent": {
        "novelty_priority": "medium",
        "similarity_base": "artist_similarity",
        "underground_bias": 0.6
    }
}

Complementary Strengths

GenreMoodAgent: Finds established, quality tracks that match mood/energy
DiscoveryAgent: Finds novel, underground tracks through artist similarity
Together: Provide both familiar comfort and exciting discovery

Example Output Comparison

Query: "Upbeat music like The Beatles"

GenreMoodAgent Results:

"Here Comes the Sun" - The Beatles (popular, upbeat, high quality)
"Mr. Blue Sky" - ELO (similar mood, well-known)
"Good as Hell" - Lizzo (modern upbeat, high energy)

DiscoveryAgent Results:

"Waterloo Sunset" - The Kinks (Beatles-similar, underground classic)
"She's Always a Woman" - Billy Joel (Beatles influence, moderate popularity)
"Golden" - Harry Styles (Beatles-inspired, newer artist)

Combined: Perfect balance of familiar favorites and exciting discoveries!

🚀 Enhancement Opportunities

1. Entity Recognition Migration

Move DiscoveryAgent._extract_artists_from_query() to PlannerAgent
Enhance with LLM-based entity recognition
Provide pre-extracted entities to both agents

2. Cross-Agent Learning

Share successful discovery patterns between agents
Coordinate to avoid duplicate recommendations
Balance familiarity vs. novelty based on user feedback

3. Advanced Filtering

Activity-aware filtering (workout vs. study music)
Time-based preferences (morning vs. evening music)
Mood progression (start calm, build energy)

This deep dive shows that both agents are sophisticated specialists with complementary strengths. The enhanced PlannerAgent will make them even more powerful by providing better entity understanding and coordination!

Deep Dive: GenreMoodAgent & DiscoveryAgent Analysis

Overview

🎵 GenreMoodAgent Deep Dive

Core Purpose

🧠 Internal Architecture

1. Initialization & Knowledge Base

2. 5-Step Processing Workflow

Step 1: Mood Analysis (_analyze_mood_requirements())

Step 2: Search Tag Generation (_generate_search_tags())

Step 3: Track Search (_search_tracks_by_tags())

Step 4: Filtering & Ranking (_filter_and_rank_tracks())

Step 5: Recommendation Creation (_create_recommendations())

🎯 GenreMoodAgent Strengths

🔍 GenreMoodAgent Limitations

🔍 DiscoveryAgent Deep Dive

Core Purpose

🧠 Internal Architecture

1. Initialization & Knowledge Base

2. 5-Step Processing Workflow

Step 1: Discovery Analysis (_analyze_discovery_requirements())

Step 2: Seed Artist Discovery (_find_seed_artists())

Step 3: Similarity Exploration (_explore_similar_music())

Step 4: Underground Filtering (_filter_for_underground())

Step 5: Novelty Scoring (_calculate_novelty_score())

🎯 DiscoveryAgent Strengths

🔍 DiscoveryAgent Limitations

🤝 How They Work Together

Current Coordination

Complementary Strengths

Example Output Comparison

🚀 Enhancement Opportunities

1. Entity Recognition Migration

2. Cross-Agent Learning

3. Advanced Filtering

Step 1: Mood Analysis (`_analyze_mood_requirements()`)

Step 2: Search Tag Generation (`_generate_search_tags()`)

Step 3: Track Search (`_search_tracks_by_tags()`)

Step 4: Filtering & Ranking (`_filter_and_rank_tracks()`)

Step 5: Recommendation Creation (`_create_recommendations()`)

Step 1: Discovery Analysis (`_analyze_discovery_requirements()`)

Step 2: Seed Artist Discovery (`_find_seed_artists()`)

Step 3: Similarity Exploration (`_explore_similar_music()`)

Step 4: Underground Filtering (`_filter_for_underground()`)

Step 5: Novelty Scoring (`_calculate_novelty_score()`)