Eric Xu commited on
Commit
85d7c12
·
1 Parent(s): 4fb07e1

Acknowledge Nemotron's rich narrative fields

Browse files

The dataset has professional_persona, skills_and_expertise,
career_goals, and persona narratives that encode seniority, industry,
technical depth, and decision style — not just demographic columns.
Most domains work out of the box without LLM-generated personas.

Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -37,9 +37,9 @@ Anything someone else evaluates.
37
  | **Content** — blog post, video | Readers at different expertise levels | Whether it hits the right level, what's confusing |
38
  | **Profile** — dating, professional bio | Population sample by age, education, occupation | How different demographics perceive you |
39
 
40
- SGO ships with a 1M-person census-grounded dataset ([Nemotron-Personas-USA](https://huggingface.co/datasets/nvidia/Nemotron-Personas-USA)) that covers **age, sex, education, occupation, marital status, and US geography**. This works well out of the box for profiles, policy, and any consumer-facing evaluation.
41
 
42
- For specialized domains (B2B buyers, hiring managers, VCs), the dataset doesn't have fields like company size, seniority, or budget so SGO generates those personas via LLM with explicit stratification constraints. The results are still useful, but see [limitations](#limitations) on LLM-generated panels vs. census-grounded ones.
43
 
44
  In each case, SGO tells you **where you stand**, **what's working**, **what's not**, and **what specific change would help the most** — broken down by audience segment.
45
 
 
37
  | **Content** — blog post, video | Readers at different expertise levels | Whether it hits the right level, what's confusing |
38
  | **Profile** — dating, professional bio | Population sample by age, education, occupation | How different demographics perceive you |
39
 
40
+ SGO ships with a 1M-person census-grounded dataset ([Nemotron-Personas-USA](https://huggingface.co/datasets/nvidia/Nemotron-Personas-USA)) with structured demographics (age, sex, education, occupation, marital status, US geography) plus rich narrative fields professional persona, skills and expertise, career goals, hobbies, cultural background, and personality. The narratives naturally encode things like seniority, industry, technical depth, and decision-making style, even though those aren't separate columns.
41
 
42
+ This means most domains work out of the box the LLM evaluates from the persona's full context, not just the demographic fields. For highly specialized panels (e.g., Series B VCs, enterprise procurement officers), SGO can generate personas via LLM with explicit stratification constraints. See [limitations](#limitations) on generated vs. census-grounded panels.
43
 
44
  In each case, SGO tells you **where you stand**, **what's working**, **what's not**, and **what specific change would help the most** — broken down by audience segment.
45