mtyrrell commited on
Commit
2eaa419
·
1 Parent(s): fb1edcc

updated innovation classifier (prompt)

Browse files
Files changed (2) hide show
  1. src/pipeline.py +11 -8
  2. src/prompts.py +34 -6
src/pipeline.py CHANGED
@@ -132,16 +132,19 @@ def process_data(uploaded_file, sens_level, azure_client, azure_deployment):
132
  )
133
 
134
  elif model_name == 'INNOVATION_CLASSIFICATION':
135
- df['innovation_classification'] = df.apply(
136
- lambda x: classify_innovation(
 
 
137
  azure_client,
138
  azure_deployment,
139
- x['id'],
140
- x['tech_txt'],
141
- x['tech_rationale_txt']
142
- ),
143
- axis=1
144
- )
 
145
 
146
 
147
  logger.info(f"Completed: {model_name}")
 
132
  )
133
 
134
  elif model_name == 'INNOVATION_CLASSIFICATION':
135
+ results = []
136
+ total_rows = len(df)
137
+ for idx, row in df.iterrows():
138
+ result = classify_innovation(
139
  azure_client,
140
  azure_deployment,
141
+ row['id'],
142
+ row['tech_txt'],
143
+ row['tech_rationale_txt']
144
+ )
145
+ results.append(result)
146
+ progress_bar.progress((len(results)) / total_rows)
147
+ df['innovation_classification'] = results
148
 
149
 
150
  logger.info(f"Completed: {model_name}")
src/prompts.py CHANGED
@@ -14,13 +14,41 @@ def prompt_innovation(concept: str) -> str:
14
  locally manufactured lithium-ion batteries for custom-built e-mobility solutions, or reactive power
15
  compensation systems to enhance grid stability.
16
 
17
- Classification rubric:
18
- - NOT INNOVATIVE: already common/deployed in the local context, or standard practice with no clear novelty.
19
- - MODERATELY INNOVATIVE: established elsewhere but new to the local context, or a meaningful adaptation.
20
- - VERY INNOVATIVE: novel approach globally or locally with clear differentiation from standard solutions.
21
- - INSUFFICIENT INFO: not enough detail to classify innovation level.
 
 
 
 
 
 
22
 
23
- If the concept is too vague to judge novelty, choose the lowest classification that is defensible and reflect uncertainty in confidence.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
24
 
25
  Concept for review:
26
  {concept}
 
14
  locally manufactured lithium-ion batteries for custom-built e-mobility solutions, or reactive power
15
  compensation systems to enhance grid stability.
16
 
17
+ Important clarifications for classification:
18
+
19
+ TECHNOLOGY AVAILABILITY: If the text indicates that the core technologies are already available,
20
+ deployed, or proven in the country (even for different sectors or applications) this strongly
21
+ signals an established market. Applying existing in-country technology to a new sector (e.g.,
22
+ solar from residential to mining) is deployment, not innovation.
23
+
24
+ SCALING vs. INNOVATION: Scaling up existing pilots, increasing unit counts, or expanding
25
+ geographic reach of proven technologies does not constitute innovation. Similarly, changing
26
+ ownership models (e.g., community-owned vs. privately-owned) or delivery mechanisms for
27
+ established technologies is not technical innovation.
28
 
29
+ COMBINING TECHNOLOGIES: Bundling or integrating multiple established technologies (e.g., solar +
30
+ e-mobility + enterprise systems) does not create innovation if each component technology is
31
+ already available and proven in-country. The combination must involve genuinely novel technical
32
+ integration or adaptation to qualify.
33
+
34
+ CLAIMS vs. EVIDENCE: Discount self-described "innovation" claims. Focus on whether the underlying
35
+ technology/approach itself is novel, not whether the project framing or business model is described as new.
36
+
37
+ Classification rubric:
38
+ - NOT INNOVATIVE: Technologies already available, deployed, or proven in-country (even if in
39
+ different sectors); scaling or replicating existing pilots; combining established technologies
40
+ without novel technical integration; standard practice with no clear technical novelty.
41
+ - MODERATELY INNOVATIVE: Technology genuinely new to the country (not just the sector) with no
42
+ prior in-country availability or deployment; OR a technically meaningful adaptation that goes
43
+ beyond repackaging or recombining existing solutions.
44
+ - VERY INNOVATIVE: Novel approach globally or locally with clear technical differentiation from
45
+ standard solutions; represents a genuine technological advancement, not just deployment scale
46
+ or business model change.
47
+ - INSUFFICIENT INFO: Not enough detail to classify innovation level.
48
+
49
+ Default to the lowest defensible classification. If technologies are described as "available"
50
+ or "proven" in-country, default to NOT INNOVATIVE unless there is clear evidence of genuine
51
+ technical novelty beyond deployment context.
52
 
53
  Concept for review:
54
  {concept}