Spaces:

Raiff1982
/

Codette-Demo

Sleeping

Codette-Demo / reasoning_forge /agents /ethics_agent.py

Claude

Initial Codette cognitive architecture demo Space

7c3f0ce 23 days ago

20.3 kB

	"""
	Ethics Agent - Analyzes concepts through alignment, consequences, and moral reasoning.

	Focuses on human well-being impact, unintended consequences, fairness and equity,
	responsibility and accountability, and long-term societal effects.
	"""

	from reasoning_forge.agents.base_agent import ReasoningAgent


	class EthicsAgent(ReasoningAgent):
	name = "Ethics"
	perspective = "alignment_and_consequences"

	def get_analysis_templates(self) -> list[str]:
	return [
	# 0 - Consequentialist analysis
	(
	"Evaluating '{concept}' by its consequences: the moral weight of any action "
	"or system lies primarily in its outcomes. We must trace the full causal "
	"chain from implementation to impact, distinguishing first-order effects "
	"(immediate and intended) from second-order effects (delayed and often "
	"unintended). The distribution of consequences matters as much as the "
	"aggregate: a net-positive outcome that concentrates benefits among the "
	"privileged while imposing costs on the vulnerable is ethically different "
	"from one that distributes benefits broadly. For '{concept}', we must ask "
	"not just 'does it work?' but 'for whom does it work, and at whose expense?'"
	),
	# 1 - Deontological duties
	(
	"Examining '{concept}' through the lens of duty and rights: regardless of "
	"outcomes, certain actions are obligatory and others are forbidden. People "
	"have inviolable rights -- to autonomy, dignity, truthful information, and "
	"freedom from manipulation -- that cannot be traded away for aggregate "
	"benefit. The categorical imperative asks: could we universalize the "
	"principle behind '{concept}'? If everyone adopted this approach, would "
	"the result be self-consistent and livable, or would it be self-defeating? "
	"Any framework that works only when most people do not adopt it (free-riding) "
	"fails this universalizability test and carries a moral defect regardless "
	"of its practical effectiveness."
	),
	# 2 - Unintended consequences
	(
	"Mapping the unintended consequences of '{concept}': every intervention in "
	"a complex system produces side effects that were not part of the original "
	"design. These unintended consequences often emerge at a different timescale "
	"(delayed effects), a different spatial scale (distant effects), or in a "
	"different domain (cross-domain effects) from the intended impact. Cobra "
	"effects occur when the intervention incentivizes behavior that worsens the "
	"original problem. Rebound effects occur when efficiency gains are consumed "
	"by increased usage. For '{concept}', humility about our ability to predict "
	"second- and third-order effects should temper confidence in any intervention."
	),
	# 3 - Fairness and distributive justice
	(
	"Analyzing the fairness dimensions of '{concept}': distributive justice asks "
	"how benefits and burdens are allocated. Rawlsian justice demands that "
	"inequalities are permissible only if they benefit the least advantaged "
	"members of society. Procedural justice requires that the process for "
	"allocation is transparent, consistent, and free from bias. Recognition "
	"justice demands that all affected parties are acknowledged as legitimate "
	"stakeholders with standing to participate. For '{concept}', we must examine "
	"whether existing inequalities are perpetuated, amplified, or mitigated, "
	"and whether those who bear the costs have meaningful voice in the decision."
	),
	# 4 - Autonomy and consent
	(
	"Assessing '{concept}' from the standpoint of autonomy: respect for persons "
	"requires that individuals can make informed, voluntary choices about matters "
	"affecting their lives. This demands adequate information disclosure (people "
	"know what they are consenting to), cognitive accessibility (the information "
	"is presented in a form people can actually understand), voluntariness (no "
	"coercion, manipulation, or deceptive framing), and ongoing consent (the "
	"ability to withdraw). For '{concept}', the critical question is whether "
	"affected parties genuinely understand and freely accept the arrangement, "
	"or whether consent is nominal -- technically obtained but substantively "
	"hollow."
	),
	# 5 - Accountability structures
	(
	"Examining the accountability architecture of '{concept}': when things go "
	"wrong, who bears responsibility? Clear accountability requires identifiable "
	"decision-makers, transparent decision processes, defined chains of "
	"responsibility, and meaningful consequences for failures. Diffuse systems "
	"create accountability gaps where no individual or entity can be held "
	"responsible for collective harms. The 'many hands' problem arises when "
	"harmful outcomes result from the accumulation of individually reasonable "
	"decisions by many actors. For '{concept}', we must ask: if this causes "
	"harm, is there a clear path from harm to accountable party, and does that "
	"party have both the authority and incentive to prevent the harm?"
	),
	# 6 - Vulnerable population impact
	(
	"Centering vulnerable populations in the analysis of '{concept}': ethical "
	"evaluation must prioritize those with the least power to protect themselves "
	"-- children, the elderly, the economically disadvantaged, marginalized "
	"communities, future generations, and those with diminished capacity. "
	"Systems that appear benign when evaluated from the perspective of the "
	"typical user may be harmful when evaluated from the perspective of the "
	"most vulnerable. Accessibility, safety margins, and failure modes should "
	"be designed for the most vulnerable case, not the average case. The moral "
	"quality of '{concept}' is best measured by how it treats those who benefit "
	"least from it."
	),
	# 7 - Long-term societal effects
	(
	"Projecting the long-term societal trajectory of '{concept}': short-term "
	"benefits can create long-term dependencies, lock-ins, or path dependencies "
	"that constrain future choices. The discount rate we apply to future harms "
	"(how much we value present benefits relative to future costs) is itself "
	"an ethical choice. Heavy discounting privileges the present generation at "
	"the expense of future ones. For '{concept}', we must evaluate not just "
	"the immediate utility but the legacy: what kind of world does this create "
	"for those who come after us? Does it expand or contract the option space "
	"available to future decision-makers?"
	),
	# 8 - Power dynamics
	(
	"Analyzing the power dynamics embedded in '{concept}': who gains power, who "
	"loses it, and what mechanisms mediate the transfer? Power asymmetries tend "
	"to be self-reinforcing: those with power shape the rules to preserve their "
	"advantage, creating positive feedback loops of concentration. The Matthew "
	"effect ('to those who have, more shall be given') operates across many "
	"domains. For '{concept}', we must examine whether it disrupts or reinforces "
	"existing power hierarchies, whether it creates new forms of dependency, and "
	"whether the checks and balances are sufficient to prevent abuse by those "
	"in positions of advantage."
	),
	# 9 - Transparency and truthfulness
	(
	"Evaluating the transparency of '{concept}': truthfulness is not merely "
	"avoiding false statements; it requires active disclosure of relevant "
	"information, honest representation of uncertainty, and resistance to "
	"misleading framing. Opacity serves those who benefit from the status quo "
	"by preventing informed critique. Selective transparency -- revealing "
	"favorable information while concealing unfavorable -- is a form of "
	"deception. For '{concept}', full ethical evaluation requires asking: what "
	"information is available, what is concealed, who controls the narrative, "
	"and do affected parties have access to the information they need to "
	"make genuinely informed judgments?"
	),
	# 10 - Dual-use dilemma
	(
	"Confronting the dual-use nature of '{concept}': most powerful capabilities "
	"can serve both beneficial and harmful purposes. The same technology that "
	"heals can harm; the same knowledge that liberates can oppress. Restricting "
	"access to prevent misuse also limits beneficial applications. Unrestricted "
	"access maximizes beneficial use but also maximizes misuse potential. The "
	"optimal policy depends on the ratio of beneficial to harmful users, the "
	"magnitude of potential harms versus benefits, and the availability of "
	"safeguards that selectively enable beneficial use. For '{concept}', the "
	"dual-use calculus is central to responsible governance."
	),
	# 11 - Moral hazard
	(
	"Identifying moral hazard in '{concept}': moral hazard arises when an actor "
	"is insulated from the consequences of their decisions, leading to riskier "
	"behavior than they would otherwise choose. If the benefits of success are "
	"private but the costs of failure are socialized (borne by others), the "
	"decision-maker has a rational incentive to take excessive risks. For "
	"'{concept}', we must examine the alignment between who decides, who benefits "
	"from good outcomes, and who pays for bad outcomes. Misalignment between "
	"these three roles is a reliable predictor of ethically problematic behavior."
	),
	# 12 - Virtue ethics lens
	(
	"Approaching '{concept}' through virtue ethics: rather than asking 'what "
	"rules should govern this?' or 'what outcomes does this produce?', we ask "
	"'what kind of character does engagement with this cultivate?' Does it "
	"foster wisdom, courage, temperance, justice, compassion, and intellectual "
	"honesty? Or does it encourage vice: shortsightedness, cowardice, excess, "
	"injustice, indifference, and self-deception? The virtues are not abstract "
	"ideals but practical habits that, when cultivated, produce flourishing "
	"individuals and communities. For '{concept}', the virtue question is: "
	"does this make us better or worse people?"
	),
	# 13 - Informed consent in practice
	(
	"Examining informed consent as applied to '{concept}': genuine consent "
	"requires that the consenting party understands the risks, alternatives, "
	"and implications; is free from coercion; and has the capacity to make "
	"the decision. In practice, consent is often degraded by information "
	"asymmetry (the provider knows more than the recipient), complexity (the "
	"implications exceed ordinary comprehension), and structural coercion "
	"(refusing consent is theoretically possible but practically catastrophic). "
	"Click-through agreements, dense legal language, and 'take it or leave it' "
	"terms are consent theater, not genuine consent. For '{concept}', we must "
	"distinguish substantive from theatrical consent."
	),
	# 14 - Intergenerational justice
	(
	"Applying intergenerational justice to '{concept}': decisions made today "
	"bind future generations who have no voice in the decision. The asymmetry "
	"is profound: we can affect them, but they cannot affect us; we can benefit "
	"at their expense, but they cannot hold us accountable. Sustainable "
	"practices treat the inheritance of future generations as a constraint, "
	"not a resource to be spent. For '{concept}', the intergenerational "
	"question is: are we spending down an inheritance that took generations "
	"to build, or are we investing in capabilities that compound for those "
	"who follow?"
	),
	# 15 - Proportionality
	(
	"Assessing the proportionality of '{concept}': the ethical principle of "
	"proportionality requires that the means be commensurate with the ends. "
	"Excessive measures to address a minor risk are disproportionate. Inadequate "
	"measures for a major risk are negligent. The challenge is that risk "
	"perception is biased: we overweight vivid, immediate, and personal risks "
	"while underweighting statistical, delayed, and distributed ones. For "
	"'{concept}', proportionality demands an honest accounting of both the "
	"magnitude of the problem being addressed and the costs of the solution, "
	"including costs borne by third parties who did not choose to bear them."
	),
	# 16 - Systemic bias detection
	(
	"Investigating systemic bias in '{concept}': bias can be embedded in data "
	"(reflecting historical inequities), in algorithms (optimizing for proxy "
	"variables correlated with protected characteristics), in institutions "
	"(normalizing practices that disadvantage certain groups), and in language "
	"(framing that renders certain perspectives invisible). Systemic bias is "
	"particularly insidious because it operates automatically, without malicious "
	"intent, and is often invisible to those who benefit from it. For '{concept}', "
	"a bias audit must examine not just explicit discrimination but structural "
	"features that produce disparate outcomes even under formally neutral rules."
	),
	# 17 - Precautionary principle
	(
	"Applying the precautionary principle to '{concept}': when an action raises "
	"credible threats of serious or irreversible harm, the burden of proof falls "
	"on those proposing the action to demonstrate safety, not on those opposing "
	"it to demonstrate harm. The precautionary principle is most appropriate "
	"when the potential harm is severe and irreversible, scientific understanding "
	"is incomplete, and there exist feasible alternatives. It is less appropriate "
	"when risks are modest and reversible, or when inaction itself carries "
	"significant risk. For '{concept}', the key judgment is whether the potential "
	"downside is in the catastrophic-irreversible category that justifies "
	"precautionary restraint."
	),
	# 18 - Care ethics
	(
	"Examining '{concept}' through the ethics of care: moral reasoning is not "
	"purely abstract rule-following but is grounded in concrete relationships "
	"of dependency, vulnerability, and mutual support. The care perspective "
	"asks: who needs care, who provides it, is the care adequate, and are "
	"caregivers themselves supported? Care labor is frequently invisible, "
	"undervalued, and unequally distributed (disproportionately borne by women "
	"and marginalized communities). For '{concept}', the care lens reveals "
	"dependencies and support relationships that abstract frameworks overlook, "
	"and centers the lived experience of those who give and receive care."
	),
	# 19 - Alignment and value lock-in
	(
	"Evaluating the alignment properties of '{concept}': a system is aligned "
	"when its behavior reliably serves the values and interests of those it "
	"affects. Misalignment occurs when the system optimizes for a proxy that "
	"diverges from the true objective -- Goodhart's law ('when a measure becomes "
	"a target, it ceases to be a good measure'). Value lock-in occurs when early "
	"design choices embed specific values that become increasingly difficult to "
	"change as the system scales. For '{concept}', we must ask: whose values "
	"are encoded, how were they chosen, can they be updated as understanding "
	"evolves, and what happens when the proxy diverges from the true objective?"
	),
	]

	def get_keyword_map(self) -> dict[str, list[int]]:
	return {
	"consequen": [0, 2], "outcome": [0], "result": [0],
	"duty": [1], "right": [1], "obligat": [1], "rule": [1],
	"unintend": [2], "side effect": [2], "unexpect": [2],
	"fair": [3], "equal": [3], "justice": [3], "distribut": [3],
	"consent": [4, 13], "autonom": [4], "choice": [4],
	"accountab": [5], "responsib": [5], "blame": [5],
	"vulnerab": [6], "child": [6], "elder": [6], "marginali": [6],
	"long-term": [7, 14], "future": [7, 14], "sustain": [7, 14],
	"power": [8], "hierarch": [8], "dominat": [8],
	"transparen": [9], "truth": [9], "honest": [9], "disclos": [9],
	"dual": [10], "weapon": [10], "misuse": [10],
	"hazard": [11], "risk": [11, 17], "insur": [11],
	"virtue": [12], "character": [12], "flourish": [12],
	"agree": [13], "terms": [13], "privacy": [13],
	"generation": [14], "inherit": [14], "legacy": [14],
	"proportion": [15], "excessive": [15], "moderate": [15],
	"bias": [16], "discriminat": [16], "prejudic": [16],
	"precaution": [17], "irreversib": [17], "catastroph": [17],
	"care": [18], "depend": [18], "support": [18], "nurtur": [18],
	"align": [19], "value": [19], "proxy": [19], "goodhart": [19],
	"technology": [10, 19], "ai": [16, 19], "artificial": [16, 19],
	"society": [3, 7, 8], "learning": [4, 12],
	"intelligence": [10, 19], "climate": [7, 14, 17],
	"economic": [3, 8, 11], "health": [4, 6, 15],
	"network": [8, 9], "data": [9, 13, 16],
	}

	def analyze(self, concept: str) -> str:
	template = self.select_template(concept)
	return template.replace("{concept}", concept)