Foundations

AI Ethics Introduction

Understand why AI ethics matters and the core principles guiding responsible AI development.

What is AI Ethics?

AI ethics is the field of study concerned with ensuring that AI systems are designed, developed, and deployed in ways that are safe, fair, transparent, and beneficial to humanity.

As AI becomes more capable and widely deployed, ethical considerations become critical for developers, organizations, and society.

Why AI Ethics Matters

Scale: AI systems affect millions of people simultaneously
Automation: Decisions once made by humans are now automated
Opacity: AI decision-making can be difficult to understand
Feedback loops: AI can amplify existing biases at scale
Power: AI concentrates power in those who build and control it

Core Ethical Principles

Principle	Description
Fairness	AI should treat people equitably
Transparency	Decisions should be explainable
Privacy	Data should be protected
Safety	Systems should not cause harm
Accountability	Someone must be responsible
Beneficence	AI should benefit humanity

Key Frameworks

EU AI Act: Risk-based regulatory framework
NIST AI Risk Management Framework: US voluntary guidelines
IEEE Ethically Aligned Design: Technical standards
Partnership on AI: Industry principles

Example

python

# Illustrating bias in AI systems
import random

# Simulated hiring model (simplified example of biased data)
def biased_hiring_model(candidate: dict) -> float:
    """
    This model has learned biases from historical data.
    Historical data was biased: certain groups were underrepresented in senior roles.
    """
    score = 0.5  # baseline

    # Legitimate factors
    score += candidate.get("years_experience", 0) * 0.05
    score += candidate.get("skills_match", 0) * 0.3

    # Problematic: name-based bias learned from historical data
    # (Illustrates how bias enters ML systems)
    if candidate.get("name_signals_majority_group"):
        score += 0.1  # unfair advantage

    return min(1.0, score)

# Testing for bias
def audit_model_for_bias():
    """Test if a model produces different outcomes for similar candidates"""
    base_candidate = {
        "years_experience": 5,
        "skills_match": 0.8,
    }

    group_a = {**base_candidate, "name_signals_majority_group": True}
    group_b = {**base_candidate, "name_signals_majority_group": False}

    score_a = biased_hiring_model(group_a)
    score_b = biased_hiring_model(group_b)

    print(f"Group A score: {score_a:.2f}")
    print(f"Group B score: {score_b:.2f}")
    print(f"Score difference: {abs(score_a - score_b):.2f}")

    if abs(score_a - score_b) > 0.05:
        print("WARNING: Potential bias detected — similar candidates receive different scores")
    else:
        print("No significant bias detected")

audit_model_for_bias()

# Fairness-aware version
def fair_hiring_model(candidate: dict) -> float:
    """Only use job-relevant factors"""
    score = 0.5
    score += candidate.get("years_experience", 0) * 0.05
    score += candidate.get("skills_match", 0) * 0.3
    # No demographic factors
    return min(1.0, score)

Try it yourself — PYTHON

# Illustrating bias in AI systems
import random

# Simulated hiring model (simplified example of biased data)
def biased_hiring_model(candidate: dict) -> float:
    """
    This model has learned biases from historical data.
    Historical data was biased: certain groups were underrepresented in senior roles.
    """
    score = 0.5  # baseline

# Legitimate factors
    score += candidate.get("years_experience", 0) * 0.05
    score += candidate.get("skills_match", 0) * 0.3

# Problematic: name-based bias learned from historical data
    # (Illustrates how bias enters ML systems)
    if candidate.get("name_signals_majority_group"):
        score += 0.1  # unfair advantage

return min(1.0, score)

# Testing for bias
def audit_model_for_bias():
    """Test if a model produces different outcomes for similar candidates"""
    base_candidate = {
        "years_experience": 5,
        "skills_match": 0.8,
    }

group_a = {**base_candidate, "name_signals_majority_group": True}
    group_b = {**base_candidate, "name_signals_majority_group": False}

score_a = biased_hiring_model(group_a)
    score_b = biased_hiring_model(group_b)

print(f"Group A score: {score_a:.2f}")
    print(f"Group B score: {score_b:.2f}")
    print(f"Score difference: {abs(score_a - score_b):.2f}")

if abs(score_a - score_b) > 0.05:
        print("WARNING: Potential bias detected — similar candidates receive different scores")
    else:
        print("No significant bias detected")

audit_model_for_bias()

# Fairness-aware version
def fair_hiring_model(candidate: dict) -> float:
    """Only use job-relevant factors"""
    score = 0.5
    score += candidate.get("years_experience", 0) * 0.05
    score += candidate.get("skills_match", 0) * 0.3
    # No demographic factors
    return min(1.0, score)