## Welcome to the Second Lab - Week 1, Day 3

Today we will work with lots of models! This is a way to get comfortable with APIs.

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Important point - please read</h2>
            <span style="color:#ff7800;">The way I collaborate with you may be different to other courses you've taken. I prefer not to type code while you watch. Rather, I execute Jupyter Labs, like this, and give you an intuition for what's going on. My suggestion is that you carefully execute this yourself, <b>after</b> watching the lecture. Add print statements to understand what's going on, and then come up with your own variations.<br/><br/>If you have time, I'd love it if you submit a PR for changes in the community_contributions folder - instructions in the resources. Also, if you have a Github account, use this to showcase your variations. Not only is this essential practice, but it demonstrates your skills to others, including perhaps future clients or employers...
            </span>
        </td>
    </tr>
</table>

In [6]:
# Start with imports - ask ChatGPT to explain any package that you don't know

import os
import json
from dotenv import load_dotenv
from openai import OpenAI
from anthropic import Anthropic
from IPython.display import Markdown, display

In [7]:
# Always remember to do this!
load_dotenv(override=True)

True

In [9]:
# Print the key prefixes to help with any debugging

openai_api_key = os.getenv('OPENAI_API_KEY')
anthropic_api_key = os.getenv('ANTHROPIC_API_KEY')
google_api_key = os.getenv('GOOGLE_API_KEY')
deepseek_api_key = os.getenv('DEEPSEEK_API_KEY')
groq_api_key = os.getenv('GROQ_API_KEY')

if openai_api_key:
    print(f"OpenAI API Key exists and begins {openai_api_key[:8]}")
else:
    print("OpenAI API Key not set")
    
if anthropic_api_key:
    print(f"Anthropic API Key exists and begins {anthropic_api_key[:7]}")
else:
    print("Anthropic API Key not set (and this is optional)")

if google_api_key:
    print(f"Google API Key exists and begins {google_api_key[:2]}")
else:
    print("Google API Key not set (and this is optional)")

if deepseek_api_key:
    print(f"DeepSeek API Key exists and begins {deepseek_api_key[:3]}")
else:
    print("DeepSeek API Key not set (and this is optional)")

if groq_api_key:
    print(f"Groq API Key exists and begins {groq_api_key[:4]}")
else:
    print("Groq API Key not set (and this is optional)")

OpenAI API Key exists and begins sk-proj-
Anthropic API Key not set (and this is optional)
Google API Key exists and begins AI
DeepSeek API Key not set (and this is optional)
Groq API Key not set (and this is optional)


In [10]:
request = "Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. "
request += "Answer only with the question, no explanation."
messages = [{"role": "user", "content": request}]

In [11]:
messages

[{'role': 'user',
  'content': 'Please come up with a challenging, nuanced question that I can ask a number of LLMs to evaluate their intelligence. Answer only with the question, no explanation.'}]

In [12]:
openai = OpenAI()
response = openai.chat.completions.create(
    model="gpt-4o-mini",
    messages=messages,
)
question = response.choices[0].message.content
print(question)


If you had to design a new ethical framework for AI decision-making that prioritizes both individual rights and collective well-being, what core principles would you include, and how would you address potential conflicts between those principles?


In [13]:
competitors = []
answers = []
messages = [{"role": "user", "content": question}]

In [None]:
# The API we know well

model_name = "gpt-4o-mini"

response = openai.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
# Anthropic has a slightly different API, and Max Tokens is required

model_name = "claude-3-7-sonnet-latest"

claude = Anthropic()
response = claude.messages.create(model=model_name, messages=messages, max_tokens=1000)
answer = response.content[0].text

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
gemini = OpenAI(api_key=google_api_key, base_url="https://generativelanguage.googleapis.com/v1beta/openai/")
model_name = "gemini-2.0-flash"

response = gemini.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
deepseek = OpenAI(api_key=deepseek_api_key, base_url="https://api.deepseek.com/v1")
model_name = "deepseek-chat"

response = deepseek.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [None]:
groq = OpenAI(api_key=groq_api_key, base_url="https://api.groq.com/openai/v1")
model_name = "llama-3.3-70b-versatile"

response = groq.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)


## For the next cell, we will use Ollama

Ollama runs a local web service that gives an OpenAI compatible endpoint,  
and runs models locally using high performance C++ code.

If you don't have Ollama, install it here by visiting https://ollama.com then pressing Download and following the instructions.

After it's installed, you should be able to visit here: http://localhost:11434 and see the message "Ollama is running"

You might need to restart Cursor (and maybe reboot). Then open a Terminal (control+\`) and run `ollama serve`

Useful Ollama commands (run these in the terminal, or with an exclamation mark in this notebook):

`ollama pull <model_name>` downloads a model locally  
`ollama ls` lists all the models you've downloaded  
`ollama rm <model_name>` deletes the specified model from your downloads

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/stop.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Super important - ignore me at your peril!</h2>
            <span style="color:#ff7800;">The model called <b>llama3.3</b> is FAR too large for home computers - it's not intended for personal computing and will consume all your resources! Stick with the nicely sized <b>llama3.2</b> or <b>llama3.2:1b</b> and if you want larger, try llama3.1 or smaller variants of Qwen, Gemma, Phi or DeepSeek. See the <A href="https://ollama.com/models">the Ollama models page</a> for a full list of models and sizes.
            </span>
        </td>
    </tr>
</table>

In [None]:
!ollama pull llama3.2

In [None]:
ollama = OpenAI(base_url='http://localhost:11434/v1', api_key='ollama')
model_name = "llama3.2"

response = ollama.chat.completions.create(model=model_name, messages=messages)
answer = response.choices[0].message.content

display(Markdown(answer))
competitors.append(model_name)
answers.append(answer)

In [34]:
# So where are we?

print(competitors)
print(answers)


['gpt-4o-mini', 'gemini-2.0-flash', 'llama3.2']
["Designing an ethical framework for AI decision-making that balances individual rights and collective well-being is a complex and vital task. Below are core principles that could guide this framework, along with suggestions for addressing potential conflicts between them:\n\n### Core Principles\n\n1. **Autonomy and Respect for Individual Rights**: \n   - AI systems should respect individuals' rights to privacy, consent, and self-determination. Users should have control over their data and the decisions that affect their lives.\n  \n2. **Transparency and Explainability**: \n   - AI decision-making processes should be transparent. Users should have access to clear explanations regarding how decisions are made, the data used, and the algorithms applied. This builds trust and facilitates informed consent.\n\n3. **Beneficence and Non-Maleficence**: \n   - AI systems should prioritize promoting well-being and preventing harm, both at the indiv

In [None]:
# It's nice to know how to use "zip"
for competitor, answer in zip(competitors, answers):
    print(f"Competitor: {competitor}\n\n{answer}")


In [36]:
# Let's bring this together - note the use of "enumerate"

together = ""
for index, answer in enumerate(answers):
    together += f"# Response from competitor {index+1}\n\n"
    together += answer + "\n\n"

In [None]:
print(together)

In [38]:
judge = f"""You are judging a competition between {len(competitors)} competitors.
Each model has been given this question:

{question}

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...], "reason": "..."}}

Here are the responses from each competitor:

{together}

Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks."""


In [39]:
display(Markdown(judge))

You are judging a competition between 3 competitors.
Each model has been given this question:

If you had to design a new ethical framework for AI decision-making that prioritizes both individual rights and collective well-being, what core principles would you include, and how would you address potential conflicts between those principles?

Your job is to evaluate each response for clarity and strength of argument, and rank them in order of best to worst.
Respond with JSON, and only JSON, with the following format:
{"results": ["best competitor number", "second best competitor number", "third best competitor number", ...], "reason": "..."}

Here are the responses from each competitor:

# Response from competitor 1

Designing an ethical framework for AI decision-making that balances individual rights and collective well-being is a complex and vital task. Below are core principles that could guide this framework, along with suggestions for addressing potential conflicts between them:

### Core Principles

1. **Autonomy and Respect for Individual Rights**: 
   - AI systems should respect individuals' rights to privacy, consent, and self-determination. Users should have control over their data and the decisions that affect their lives.
  
2. **Transparency and Explainability**: 
   - AI decision-making processes should be transparent. Users should have access to clear explanations regarding how decisions are made, the data used, and the algorithms applied. This builds trust and facilitates informed consent.

3. **Beneficence and Non-Maleficence**: 
   - AI systems should prioritize promoting well-being and preventing harm, both at the individual and collective levels. This involves assessing the potential positive and negative impacts of AI decisions on both fronts.

4. **Justice and Fairness**: 
   - AI systems must be fair, seeking to eliminate bias and discrimination. Both individual and community benefits should be distributed equitably, ensuring that marginalized groups are not disproportionately harmed.

5. **Accountability and Responsibility**: 
   - There must be clear lines of accountability for AI decisions, ensuring that human oversight is maintained. Stakeholders, including developers and users, should be answerable for the outcomes of AI systems.

6. **Sustainability and Long-Term Considerations**: 
   - AI should be designed and implemented in ways that consider long-term impacts on society, the environment, and future generations, ensuring that collective well-being is maintained.

7. **Participatory Design and Engagement**: 
   - Engaging diverse stakeholders in the design and deployment of AI systems ensures that multiple perspectives are considered. This can help in identifying potential conflicts between individual rights and collective well-being.

### Addressing Conflicts Between Principles

Conflicts may arise between individual rights and collective well-being, and the following strategies can help manage these tensions:

1. **Prioritization of Principles**: 
   - Establish a hierarchy of principles to guide decision-making. For example, individual rights might take precedence in cases involving personal data privacy, while collective well-being might be prioritized in public health scenarios.

2. **Contextual Analysis**: 
   - Assess the specific context of each decision. Situational factors can influence how principles should be applied, potentially leading to different outcomes based on the context of use.

3. **Multi-Stakeholder Dialogues**: 
   - Facilitate discussions among diverse stakeholders to address conflicts. Engaging users, ethicists, developers, and policy-makers can lead to more equitable solutions that reflect a consensus on values.

4. **Iterative Feedback Mechanisms**: 
   - Implement systems that allow for continuous evaluation and adjustment of AI decisions based on real-world outcomes. Feedback loops can help identify and rectify conflicts as they arise.

5. **Scenario Planning**: 
   - Utilize predictive modeling and scenario analysis to foresee potential conflicts between principles, allowing for proactive measures to mitigate adverse effects.

6. **Ethical Oversight Committees**: 
   - Establish independent review boards to oversee AI systems, ensuring that ethical considerations are adhered to and providing an additional layer of accountability.

By adhering to these core principles and implementing approaches to address conflicts, the ethical framework for AI decision-making can strive to balance the rights of individuals with the well-being of society as a whole. This encompasses a commitment to evolving our understanding of ethics as technology advances and societal values shift.

# Response from competitor 2

Okay, here's an outline of a new ethical framework for AI decision-making, designed to balance individual rights and collective well-being, along with strategies for resolving potential conflicts:

**Framework Name:**  "Harmony AI" (or similar evocative name)

**I. Core Principles:**

1.  **Respect for Human Dignity and Autonomy:**
    *   **Description:** Every individual interacting with or affected by an AI system has inherent worth and the right to make informed choices about their lives. This includes the right to privacy, freedom of expression, and protection from manipulation.
    *   **Operationalization:**
        *   AI systems must be designed to be transparent about their capabilities, limitations, and potential biases.
        *   Individuals should have control over their data and the ability to opt-out of AI-driven processes where feasible.
        *   AI systems should not be used to coerce or exploit individuals.
        *   Accessibility should be a core design principle to ensure equal access and benefit for diverse users (e.g., language, disability, age).

2.  **Beneficence and Non-Maleficence (Do Good, Do No Harm):**
    *   **Description:** AI should be used to promote well-being, reduce suffering, and avoid causing harm to individuals, groups, or the environment.
    *   **Operationalization:**
        *   Rigorous impact assessments are mandatory before deploying AI systems, considering potential social, economic, and environmental consequences.
        *   AI systems must be designed to be robust, reliable, and safe, with mechanisms for monitoring and mitigating unintended consequences.
        *   Prioritize AI applications that address pressing societal challenges such as healthcare, education, and poverty alleviation.
        *   Implement "kill switches" or fail-safe mechanisms to shut down or redirect AI systems that pose an imminent threat.

3.  **Justice and Fairness:**
    *   **Description:** AI systems should be designed and deployed in a way that ensures equitable outcomes and avoids perpetuating or exacerbating existing inequalities.  This includes distributive justice (fair allocation of resources and opportunities), procedural justice (fair decision-making processes), and corrective justice (redress for harms).
    *   **Operationalization:**
        *   Data used to train AI systems must be representative and free from discriminatory biases.
        *   AI algorithms should be regularly audited for fairness and accuracy across different demographic groups.
        *   AI-driven decisions should be transparent and explainable, allowing individuals to understand the reasoning behind them and challenge unfair outcomes.
        *   Consideration of historical disadvantages and structural inequalities in designing AI solutions (e.g., affirmative action principles where appropriate).

4.  **Collective Well-being and Sustainability:**
    *   **Description:** AI should be used to promote the common good, support sustainable development, and protect the environment for current and future generations.
    *   **Operationalization:**
        *   Prioritize AI applications that address global challenges such as climate change, pandemics, and resource scarcity.
        *   Promote the responsible development and use of AI in areas such as healthcare, education, and infrastructure.
        *   Ensure that AI systems are energy-efficient and minimize their environmental impact.
        *   Foster international cooperation on AI governance and ethical standards.
        *   Long-term, consider the potential existential risks posed by advanced AI and develop safeguards to mitigate them.

5.  **Transparency, Accountability, and Explainability:**
    *   **Description:** AI systems should be transparent about their functionality and decision-making processes, and those responsible for their design, deployment, and use should be held accountable for their impacts.  Explainability (the ability to understand *why* an AI made a particular decision) is crucial.
    *   **Operationalization:**
        *   Develop clear standards for AI explainability, requiring AI systems to provide justifications for their decisions that are understandable to non-experts.  This may involve techniques like SHAP values, LIME, or other explainable AI (XAI) methods.
        *   Establish independent oversight bodies to monitor and regulate AI development and deployment.
        *   Implement robust mechanisms for auditing AI systems and identifying and addressing biases and errors.
        *   Develop clear legal frameworks that assign liability for harm caused by AI systems.
        *   Promote open-source AI development to encourage transparency and collaboration.

6.  **Continuous Learning and Adaptation:**
    *   **Description:**  Ethical frameworks for AI must be dynamic and adaptable to evolving technologies and societal values.  This requires ongoing monitoring, evaluation, and refinement of ethical principles and guidelines.
    *   **Operationalization:**
        *   Establish mechanisms for gathering feedback from stakeholders and incorporating it into the design and deployment of AI systems.
        *   Promote interdisciplinary research on the ethical, legal, and social implications of AI.
        *   Foster public dialogue and debate about the ethical challenges posed by AI.
        *   Regularly review and update ethical guidelines and regulations to reflect advances in AI technology and changes in societal values.  Embrace agile governance approaches.

**II. Addressing Conflicts Between Principles:**

Conflicts between individual rights and collective well-being are inevitable.  The following strategies can help to resolve them:

1.  **Proportionality:**
    *   Any restriction on individual rights in the name of collective well-being must be proportionate to the threat or benefit.  The least restrictive means necessary should be used.  Is the benefit to society significant enough to justify the infringement on an individual's right?

2.  **Necessity:**
    *   The restriction on individual rights must be necessary to achieve the desired outcome.  Are there alternative solutions that would not infringe on individual rights?

3.  **Transparency and Public Justification:**
    *   Any decision that prioritizes collective well-being over individual rights must be transparent and justified to the public.  The rationale for the decision should be clearly explained, and stakeholders should have the opportunity to provide feedback.

4.  **Due Process and Redress:**
    *   Individuals who are negatively affected by AI-driven decisions should have access to due process and redress.  This includes the right to appeal decisions, seek compensation for harm, and challenge the validity of the AI system.

5.  **Deliberative Processes and Stakeholder Engagement:**
    *   Engage in inclusive and deliberative processes to weigh competing values and interests.  Involve stakeholders from diverse backgrounds in the development and implementation of AI policies.  This includes ethicists, legal experts, technologists, policymakers, and members of the public.  Citizen assemblies or similar participatory mechanisms can be valuable.

6.  **Prioritization Framework:**
    *   Develop a framework for prioritizing ethical considerations in specific contexts.  This framework should identify the core values that are most relevant to the situation and provide guidance on how to balance competing interests.  For example, in healthcare settings, the principle of beneficence (doing good) may take precedence over the principle of autonomy in certain situations (e.g., emergency care).  However, these prioritizations should be carefully considered and justified.

7.  **Context-Specific Considerations:**
    *   Recognize that ethical considerations can vary depending on the context.  A solution that is appropriate in one setting may not be appropriate in another.  For example, the use of facial recognition technology may be more acceptable in high-security environments than in public spaces.

8.  **Sunset Clauses and Regular Review:**
    *   Implement sunset clauses for AI systems that restrict individual rights.  This ensures that these systems are regularly reviewed and re-evaluated to determine whether they are still necessary and proportionate.

9.  **Insurance and Compensation Mechanisms:**
    *   Explore the use of insurance and compensation mechanisms to provide redress to individuals who are harmed by AI systems.  This can help to mitigate the negative consequences of AI and promote accountability.

10. **"Ethics by Design" and "Value Sensitive Design":**  Incorporate ethical considerations from the very beginning of the AI development process.  Use frameworks like "Value Sensitive Design" to proactively identify and address potential ethical issues.

**III. Example Scenarios & Application of the Framework:**

Let's consider a few examples:

*   **Scenario 1:  AI-Powered Predictive Policing:**  An AI system is used to predict crime hotspots and allocate police resources.  This could infringe on individual rights to privacy and freedom of movement if it leads to disproportionate surveillance of certain communities.
    *   **Application of Harmony AI:**
        *   Transparency: The AI system's algorithms and data sources must be transparent and subject to independent audit.
        *   Fairness:  Data used to train the AI system must be carefully vetted for bias.
        *   Proportionality: The use of AI-powered policing must be proportionate to the actual crime rate in the areas being targeted.
        *   Due Process: Individuals who are stopped or questioned based on AI predictions must be treated with respect and have access to due process.
        *   Explainability:  Police officers must be able to explain the basis for their actions.

*   **Scenario 2:  AI-Driven Healthcare Diagnosis:**  An AI system is used to diagnose medical conditions.  This could lead to inaccurate diagnoses or biased treatment if the system is not properly designed and validated.
    *   **Application of Harmony AI:**
        *   Beneficence & Non-Maleficence:  The AI system must be rigorously tested and validated to ensure its accuracy and safety.
        *   Transparency & Explainability: Doctors must be able to understand the AI system's reasoning and explain it to patients.
        *   Autonomy:  Patients must have the right to seek a second opinion and make their own healthcare decisions.
        *   Justice:  The AI system must be designed to be fair and equitable across different demographic groups.

*   **Scenario 3:  AI-Powered Job Recruitment:**  An AI system is used to screen job applicants. This could perpetuate existing biases and limit opportunities for underrepresented groups.
    *   **Application of Harmony AI:**
        *   Fairness:  Algorithms and training data must be audited and adjusted to prevent biased outcomes.
        *   Transparency:  Candidates should understand how the AI system is evaluating their application.
        *   Autonomy:  Candidates should have the right to human review if they are rejected by the AI system.
        *   Beneficence:  The system should aim to identify candidates with the potential to succeed, not just those who fit a narrow profile.

**IV.  Key Considerations for Implementation:**

*   **Education and Training:**  Educate developers, policymakers, and the public about the ethical implications of AI.
*   **International Cooperation:**  Foster international collaboration on AI governance and ethical standards.
*   **Enforcement Mechanisms:**  Develop effective enforcement mechanisms to ensure compliance with ethical guidelines and regulations.
*   **Continuous Monitoring and Evaluation:**  Regularly monitor and evaluate the impact of AI systems and adapt ethical frameworks as needed.

This "Harmony AI" framework provides a starting point for developing more comprehensive and context-specific ethical guidelines for AI decision-making.  The key is to prioritize human dignity, promote well-being, and ensure fairness, while remaining flexible and adaptable to the evolving landscape of AI technology.


# Response from competitor 3

Designing an ethical framework for AI decision-making that balances individual rights with collective well-being is crucial to ensure AI systems are fair, transparent, and beneficial to society. Here's a proposed core set of principles:

Core Principles:

1. **Respect for Individual Autonomy**: Ensure that AI decisions respect individuals' autonomy, dignity, and freedom from coercion or manipulation. This includes protecting individual rights to privacy, consent, and the ability to make informed choices.
2. **Promoting Fairness and Non-Discrimination**: Implement mechanisms to prevent AI biases and ensure fairness in decision-making processes. This includes avoiding discrimination based on race, gender, religion, sexual orientation, age, disability, or other protected characteristics.
3. **Coluntary Transparency and Explainability**: Ensure that AI decisions are transparent, explainable, and provide context for human review and audit. This enables informed understanding of AI-driven outcomes and mitigates potential biases.
4. **Human Oversight and Control**: Limit AI decision-making to well-defined, specific domains where the benefits outweigh the risks. Human oversight and control ensure accountability when AI decisions conflict with individual rights or collective well-being.
5. **Safety and Vulnerability Protection**: Implement measures to safeguard vulnerable populations from AI-driven harm, including protection against algorithmic profiling and data misuse.
6. **Inclusive Value Alignment**: Incorporate stakeholders' values and interests into the development process, promoting inclusivity, diversity, and stakeholder engagement.

Addressing Potential Conflicts:

1. **Multi-Operator Framework**: Introduce multi-operator decision-making frameworks that engage multiple stakeholders, including human experts, algorithmic experts, and representative communities. This fosters a collaborative environment to resolve conflicts.
2. **Conflict Resolution Mechanisms**: Develop robust conflict resolution mechanisms, such as appeal systems or grievance procedures, to address disagreements between AI-driven decisions and individual rights or collective well-being.
3. **Value-Based Co-Design**: Implement value-based co-design processes where diverse stakeholders collaborate on defining algorithmic objectives in line with shared moral compasses.
4. **Human-AI Hybrid Modeling**: Utilize human-AI hybrid modeling approaches that leverage the strengths of both humans and AI systems, ensuring human judgment is embedded within decision-making processes.
5. **Regulatory Efficacy and Oversight**: Develop regulatory frameworks that promote effective governance over AI deployment, enabling accountability mechanisms to mitigate conflicts.
6. **Hybrid Feedback Loops**: Establish dynamic feedback loops between AI decision-makers and human stakeholders, allowing for ongoing assessment of system performance, identification of shortcomings, and continuous improvement.

Potential Conflict Resolution Strategies:

1. Human intervention in the decision-making process
2. Use of explainable AI techniques such as feature attribution or model interpretability to identify biases
3. Development of value-based AI systems that can dynamically adjust objectives to align with user preferences
4. Collaboration between humans, machines, and representatives from impacted communities to provide contextual input for decision-making

Implementing this framework requires a multidisciplinary approach, involving experts in computer science, ethics, philosophy, sociology, law, and more. The effectiveness of the framework relies on continuous monitoring, evaluation, and improvement, as AI systems evolve and interact with society.



Now respond with the JSON with the ranked order of the competitors, nothing else. Do not include markdown formatting or code blocks.

In [40]:
judge_messages = [{"role": "user", "content": judge}]

In [41]:
# Judgement time!

openai = OpenAI()
response = openai.chat.completions.create(
    model="o3-mini",
    messages=judge_messages,
)
results = response.choices[0].message.content
print(results)


{"results": ["2", "1", "3"], "reason": "Competitor 2's response is the most comprehensive, providing detailed core principles with operational steps, real-world examples, and robust conflict resolution strategies that cover multiple dimensions of ethical AI decision-making. Competitor 1 also offers a well-structured framework with clear principles and methods to address conflicts, but its overall depth and detail are slightly less than competitor 2. Competitor 3 presents a clear and structured approach with important points, yet it is less thorough and detailed compared to the other two responses."}


In [42]:
# OK let's turn this into results!

results_dict = json.loads(results)
ranks = results_dict["results"]
for index, result in enumerate(ranks):
    competitor = competitors[int(result)-1]
    print(f"Rank {index+1}: {competitor}")

Rank 1: gemini-2.0-flash
Rank 2: gpt-4o-mini
Rank 3: llama3.2


<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/exercise.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#ff7800;">Exercise</h2>
            <span style="color:#ff7800;">Which pattern(s) did this use? Try updating this to add another Agentic design pattern.
            </span>
        </td>
    </tr>
</table>

<table style="margin: 0; text-align: left; width:100%">
    <tr>
        <td style="width: 150px; height: 150px; vertical-align: middle;">
            <img src="../assets/business.png" width="150" height="150" style="display: block;" />
        </td>
        <td>
            <h2 style="color:#00bfff;">Commercial implications</h2>
            <span style="color:#00bfff;">These kinds of patterns - to send a task to multiple models, and evaluate results,
            are common where you need to improve the quality of your LLM response. This approach can be universally applied
            to business projects where accuracy is critical.
            </span>
        </td>
    </tr>
</table>