发布日期: 2025-11-25

更新日期: 2025-11-27

文章字数: 1.4k

阅读时长: 5 分

阅读次数:

⚠️ 以下所有内容总结都来自于大语言模型的能力，如有错误，仅供参考，谨慎使用
🔴 请注意：千万不要用于严肃的学术场景，只能用于论文阅读前的初筛！
💗 如果您觉得我们的项目对您有帮助 ChatPaperFree ，还请您给我们一些鼓励！⭐️ HuggingFace免费体验

2025-11-25 更新

Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals

Authors:Dena F. Mujtaba, Nihar R. Mahapatra

AI-enhanced personality assessments are increasingly shaping hiring decisions, using affective computing to predict traits from the Big Five (OCEAN) model. However, integrating AI into these assessments raises ethical concerns, especially around bias amplification rooted in training data. These biases can lead to discriminatory outcomes based on protected attributes like gender, ethnicity, and age. To address this, we introduce a counterfactual-based framework to systematically evaluate and quantify bias in AI-driven personality assessments. Our approach employs generative adversarial networks (GANs) to generate counterfactual representations of job applicants by altering protected attributes, enabling fairness analysis without access to the underlying model. Unlike traditional bias assessments that focus on unimodal or static data, our method supports multimodal evaluation-spanning visual, audio, and textual features. This comprehensive approach is particularly important in high-stakes applications like hiring, where third-party vendors often provide AI systems as black boxes. Applied to a state-of-the-art personality prediction model, our method reveals significant disparities across demographic groups. We also validate our framework using a protected attribute classifier to confirm the effectiveness of our counterfactual generation. This work provides a scalable tool for fairness auditing of commercial AI hiring platforms, especially in black-box settings where training data and model internals are inaccessible. Our results highlight the importance of counterfactual approaches in improving ethical transparency in affective computing.

人工智能增强的性格评估正越来越多地影响招聘决策，利用情感计算预测大五模型（OCEAN）的特征。然而，将人工智能融入这些评估引发了道德上的担忧，特别是关于基于训练数据的偏见放大问题。这些偏见可能导致基于性别、种族和年龄等受保护属性的歧视性结果。为解决这一问题，我们引入了一个基于反事实的理论框架，系统地评估和量化人工智能驱动的性格评估中的偏见。我们的方法采用生成对抗网络（GANs）生成工作申请人的反事实表示，通过改变受保护的属性进行反事实生成，从而在无法访问底层模型的情况下进行公平性分析。与传统的专注于单模态或静态数据的偏见评估不同，我们的方法支持跨视觉、音频和文本特征的多模态评估。这种全面方法在招聘等高风险应用中尤为重要，第三方供应商经常将人工智能系统作为黑匣子提供。将该方法应用于最先进的性格预测模型，我们的方法揭示了不同人群之间的显著差异。我们还使用受保护的属性分类器验证了我们的框架的有效性，以确认我们的反事实生成的效果。这项工作为审计商业人工智能招聘平台的公平性提供了一个可扩展的工具，特别是在无法访问训练数据和模型内部的黑匣子环境中。我们的结果强调了反事实方法在改善情感计算中的道德透明度方面的重要性。

论文及项目相关链接

PDF

Summary
AI增强型人格评估正越来越多地影响招聘决策，利用情感计算预测大五人格模型中的特质。但AI的集成引发了伦理问题，特别是在训练数据中的偏见放大问题。偏见可能导致基于保护属性（如性别、种族和年龄）的歧视结果。为解决这一问题，我们引入基于假设的框架来系统地评估和量化AI驱动的人格评估中的偏见。我们的方法使用生成对抗网络（GANs）生成工作申请者的假设表示，通过改变保护属性，可以在不了解底层模型的情况下进行公平分析。该方法支持视觉、音频和文本特征的跨模态评估，不同于传统关注单模态或静态数据的偏见评估。我们将其应用于最新的人格预测模型，并揭示不同人口群体之间的显著差异。我们的框架使用保护属性分类器进行验证，证明假设生成的有效性。这项工作为商业AI招聘平台的公平审计提供了可伸缩的工具，特别是在无法访问训练数据和模型内部的黑匣子环境中。我们的研究强调了假设方法在情感计算中提高伦理透明度的重要性。

Key Takeaways