Trustworthy AI Flashcards
Why Trustworthy AI
Is it Accurate
Is it Fair
is it easy to understand
is it accountable
did anyone tamper with it
Procedural Justice
Perceived fairness of the rules and processes to determine outcomes
Distributive Justice
perceived fairness of the outcomes or resource allocations themselves
AI Fairness considerations
1) The impact of outcomes
2) Nature and scope of decisions
3) Operational complexity and limits to scale
4) Compliance and governance capabilities
Fairness: Allocation
Allocation: used as classification problem
Fairness: Group Fairness
Individual Fairness
Groups defined by protected attributes receiving similar treatments
individual: similar individuals receiving similar outcomes
Fairness Metrics
Statistical Parity Difference
Equal Opportunity Difference
Pillars of Trustworthy AI
- *Explainability** is the ability of the AI model to explain how and why it arrived at a particular decision
- *Fairness** is the ability of the AI model to be free of bias in its decisions and to avoid unfair treatment of certain groups
- *Robustness** is the ability of the AI model to be safe and secure and not be vulnerable to any tampering or compromising the data they are trained on.
- *Transparency** is the ability to disclose information to increase the understanding of how an AI model or service was created and deployed
- *Governance** is the ability to direct, manage and control the AI activities throughout the AI lifecycle.
test, evaluation, verification, and validation (TEVV)
AI Risks and Trustworthiness (NIST)
- valid and reliable
- safe
- secure and resilient
- accountable and trans- parent
- explainable and interpretable
- privacy-enhanced
- fair with harmful bias managed
Validation
Validation is the “confirmation, through the provision of objective evidence, that the re- quirements for a specific intended use or application have been fulfilled” (Source: ISO 9000:2015).
Reliability
Reliability
Reliability is defined in the same standard as the “ability of an item to perform as required, without failure, for a given time interval, under given conditions” (Source: ISO/IEC TS 5723:2022). Reliability is a goal for overall correctness of AI system operation under the conditions of expected use and over a given period of time, including the entire lifetime of the system.
Accuracy
Accuracy is defined by ISO/IEC TS 5723:2022 as “closeness of results of observations, computations, or estimates to the true values or the values accepted as being true.”
Robustness or generalizability
Robustness or generalizability is defined as the “ability of a system to maintain its level of performance under a variety of circumstances”
AI systems, as well as the ecosystems in which they are deployed, may be said to be re- silient if they can withstand unexpected adverse events or unexpected changes in their envi- ronment or use – or if they can maintain their functions and structure in the face of internal and external change and degrade safely and gracefully when this is necessary