AI assessments and games in selection: high-risk, validity and accessibility
Gamified and psychometric AI assessments evaluate candidates and are therefore high-risk (Annex III). Three questions are decisive: does it really measure what matters, is it free of bias, and does it not exclude people with a disability? Emotion analysis via image or voice is moreover prohibited.
Short answer: AI assessments โ gamified tests, psychometric analyses, automated case scoring โ evaluate candidates and therefore fall under Annex III, point 4 of the AI Act as high-risk. Three questions decide whether such a tool passes muster: does it really measure something related to the job (validity), is it free of bias, and does it not exclude people with a disability? And note: an assessment that infers emotions via image or voice is outright prohibited.
Validity: does it measure what it promises?
The big promise of AI assessments is objectivity. But a model that correlates with "successful past employees" often mainly measures what the past looked like โ not what the job needs now. Without substantiated, job-relevant validation, a high "fit score" is false certainty. Ask the supplier for validation evidence and keep it.
Bias: looking objective is not being objective
As with CV screening, assessments learn from data containing past patterns. A game rewarding reaction speed or language use can systematically favour certain groups. The risk of discrimination is real and lies with the employer.
Accessibility: the silent exclusion
Gamified assessments assume a "standard candidate": good eyesight, fast clicking, fluent in the test language. Anyone with a visual, motor or cognitive disability, or less command of the language, scores lower for reasons unrelated to the job. Without reasonable accommodation that becomes indirect discrimination.
The emotion line
Assessments with video or voice analysis sometimes promise to measure "soft skills" or "engagement". To the extent that amounts to inferring emotions, it is prohibited under Article 5 โ not "if done carefully", but no.
What to do
- Treat every selection assessment as high-risk: human oversight, monitoring, information to the candidate.
- Demand validation evidence from the supplier: does it measure job-relevant traits?
- Test for bias and for outcomes by group.
- Offer reasonable accommodation and an alternative route for those who cannot take the assessment fairly.
- Drop emotion and "soft skill from video" features.
An assessment that does not measure what the job needs, is not free of bias and is not accessible is not an objective filter but a legal risk with a scientific veneer.
Sources
- https://eur-lex.europa.eu/eli/reg/2024/1689/oj
Regulation (EU) 2024/1689 (AI Act): Annex III point 4 (assessment of candidates = high-risk); Art. 5 prohibits emotion recognition. - https://eur-lex.europa.eu/eli/reg/2016/679/oj
Regulation (EU) 2016/679 (GDPR): basis, special-category data and transparency for automated assessment.
Read next
AI and accessibility: the European Accessibility Act and discrimination risks
AI must be usable by people with disabilities. From June 2025 the European Accessibility Act sets accessibility requirements for many digital products and services, while the AI Act counters discriminatory outcomes of AI systems.
AI in credit scoring: high risk under the AI Act
AI that assesses consumer creditworthiness is high-risk under Annex III of the AI Act. On top of that come GDPR Article 22 (automated decisions) and the prohibition of discrimination. Three regimes at once, with bias control at the core.
AI in education: what does the AI Act mean for schools and trainers?
AI that determines access to education, evaluates learning outcomes or monitors exam behaviour falls under Annex III and is high-risk. Emotion recognition in education is banned. The GDPR (often minors' data) and the AI-literacy duty also apply.