Spaces:

cdpearlman
/

LLMVis

Running

App Files Files Community

LLMVis / rag_docs /experiment_first_analysis.md

cdpearlman

Add 30 RAG documents for chatbot knowledge base

78691d1 about 1 month ago

preview code

raw

history blame contribute delete

2.91 kB

Experiment: Your First Analysis

Goal

Learn how to run your first analysis and walk through each pipeline stage to understand how a transformer model processes text.

Prerequisites

None -- this is the starting experiment.

Steps

Step 1: Select a Model

In the generator section at the top, find the "Select Model" dropdown.
Choose "GPT-2 (124M)" from the list.
Wait for the model to load. You'll see a status message indicating the model is ready.

Step 2: Enter a Prompt

In the "Enter Prompt" textarea, type: The cat sat on the
Leave the generation settings at their defaults (1 beam, a few tokens).

Step 3: Run the Analysis

Click the "Analyze" button.
Wait for the analysis to complete. The pipeline stages and generation results will appear.

Step 4: Explore the Generated Sequences

Look at the generated sequence(s) below the generator. You should see how GPT-2 continues your prompt. Common completions might include "mat," "floor," "bed," or similar words.

Step 5: Walk Through the Pipeline

Now expand each of the 5 pipeline stages by clicking on them:

Stage 1 - Tokenization: Click to expand. You'll see your prompt split into tokens. Notice how each word (and its leading space) becomes a separate token. Count the tokens -- "The cat sat on the" should produce about 5 tokens.

Stage 2 - Embedding: Click to expand. You'll see that each token was converted into a 768-dimensional vector. This is GPT-2's hidden dimension.

Stage 3 - Attention: Click to expand. This is the richest stage:

Look at the head categories. You should see heads grouped into Previous-Token, First/Positional, Bag-of-Words, Syntactic, and Other.
Click on a category (like "Previous-Token") to see which specific heads belong to it.
Below the categories, you'll see the BertViz visualization. Try clicking on individual head squares to see their attention patterns.

Stage 4 - MLP: Click to expand. You'll see the expand-compress pattern: 768 → 3072 → 768. This shows GPT-2's feed-forward network dimensions.

Stage 5 - Output: Click to expand. You'll see:

Your prompt with the predicted next token highlighted
The confidence percentage
A top-5 bar chart showing the model's top predictions

Step 6: Reflect

Think about what you observed:

How many tokens did your prompt become?
What was the model's top prediction? How confident was it?
Were there any surprising alternative predictions in the top 5?

What's Next?

Try changing the prompt and running the analysis again. Compare results with different inputs:

A factual prompt: "The capital of France is"
A creative prompt: "Once upon a time, there was a"
A technical prompt: "The function takes an input and"

Then move on to Experiment: Exploring Attention Patterns to dive deeper into what the attention heads are doing.