缩放偏见：视频会议中“声音细小”的社会成本

缩放偏见：视频会议中“声音细小”的社会成本
Zoom bias: The social costs of having a 'tinny' sound during video conferences

原始链接: https://phys.org/news/2025-03-bias-social-tinny-video-conferences.html

耶鲁大学一项发表于《美国国家科学院院刊》(Proceedings of the National Academy of Sciences) 的研究表明，视频会议中麦克风质量差会对演讲者的形象造成负面影响。研究人员发现，“尖细”的声音，通常与廉价麦克风相关，会导致听者对演讲者的智力、可信度甚至浪漫吸引力评价较低。通过涉及不同性别、口音甚至合成语音的实验，参与者始终更倾向于音质更丰富、更洪亮的声音。这种偏见也延伸到招聘场景中，音质更好的候选人被认为更适合录用。该研究揭示了一种潜在的无意识偏见来源，因为麦克风质量往往与社会经济地位相关。研究人员强调，虽然人们会注意自己在视频通话中的外表，但他们可能意识不到自己的声音在别人听来是什么样的，这可能会阻碍他们的职业和社交机会。因此，测试麦克风质量对于在线沟通至关重要。

Hacker News 上的一个讨论围绕着音频质量对视频会议的影响（“Zoom 偏见”）。用户强调清晰的音频对于留下积极印象以及避免被忽略至关重要。讨论的解决方案包括：避免使用蓝牙耳机作为麦克风输入，因为蓝牙耳机会有压缩；使用外部麦克风，例如 V-Moda BoomPro 或 Antlion ModMic Wireless，它们可以夹在现有的耳机上；甚至使用专业级的电容麦克风。一些人更喜欢桌面麦克风，而另一些人则探索领夹式麦克风。灯光和摄像头的质量也很重要。投资更好的音视频设备被比作投资一件定制西装——一项值得的专业升级。具体的麦克风推荐包括 Yeti Blue、Saramonic MV2000 和 Rode VideoMic2，这些麦克风因其改进的音质和硬件静音按钮等功能而受到好评。用户注意到这些升级带来的显著差异，甚至带来了职业上的认可。

（评论） 2024-05-27

（评论） 2024-07-29

AI 耳机让佩戴者可以通过观察人群中的某个人来聆听他们的声音 2024-05-30

（评论） 2023-12-25

原文

Most job candidates know to dress nicely for Zoom interviews and to arrange a professional-looking background for the camera. But a new Yale study suggests they also ought to test the quality of their microphones.

A tinny voice caused by a cheap mic, researchers say, could sink their chances.

Through a series of experiments, the study demonstrates that tinny speech—a thin, metallic sound—during video conferences can have surprisingly deep social consequences, leading listeners to lower their judgments of a speaker's intelligence, credibility, and romantic desirability. It can also hurt an individual's chances of landing a job.

These effects could be a potential source of unintentional bias and discrimination, given the likelihood that microphone quality is correlated with socioeconomic status, the researchers said.

"Now that videoconferencing has become so ubiquitous, we wondered how the sounds of people's voices might be influencing others' impressions, beyond the actual words they speak," said senior author Brian Scholl, professor of psychology in Yale's Faculty of Arts and Sciences and the Wu Tsai Institute.

"Every experiment we conducted showed that a familiar tinny or hollow sound associated with a poor-quality microphone negatively affects people's impressions of a speaker—independent of the message conveyed.

"This is both fascinating and concerning, especially when the sound of your voice is determined not just by your vocal anatomy, but also by the technology you're using."

The study is published in the journal the Proceedings of the National Academy of Sciences. Robert Walter-Terrill, a Ph.D. candidate in psychology in Yale's Graduate School of Arts and Sciences, is the study's lead author. Joan Danielle K. Ongchoco, an assistant professor in the psychology department of the University of British Columbia, and a former Ph.D. student at Yale, is a co-author.

Scholl, who is also director of the Yale Perception & Cognition Lab, was inspired to pursue the study by a faculty meeting that he had participated in over Zoom during the early stages of the COVID-19 pandemic.

During that video conference, one colleague was using a high-quality microphone in his home recording studio, which made his voice sound especially rich and resonant. Another colleague was using an old laptop that lent a tinny quality to his voice.

In that moment, it occurred to Scholl that the points raised by the colleague with the better sound quality seemed more profound, and those raised by the colleague with the weak mic seemed less compelling, even though he tended to see eye to eye with the latter colleague. It made him curious about how superficial audio quality may affect listeners' perceptions of a speaker.

To answer that question, the researchers performed a series of six experiments in which participants listened to a short speech recording and then made judgments about the speaker.

In each experiment, half of the participants were randomly assigned to listen to a recording that was clear and lively as though delivered through a high-quality microphone, while the other half heard a distorted version of the same message that mimicked the tinny and hollow sound associated with poor-quality microphones.

Importantly, the words themselves were always identical, and the distortions did not affect the comprehensibility of the messages: In each experiment, the participants were asked to transcribe the words and sentences they had heard to confirm they had understood them.

Across the experiments, the researchers varied both the speaker's gender (male and female) and accent (American and British). In some experiments, they even used obviously computer-synthesized voices, for which the "speakers" clearly couldn't be responsible, in terms of either their vocal anatomy or their microphone choices.

In an experiment testing the effects of microphone quality on whether a candidate is hirable, participants listened to a male human voice deliver a standard elevator pitch for a sales job. (This experiment was repeated using a computerized voice. It was also used to test the effects of audio quality on perceptions of intelligence.)

In an experiment concerning romantic desirability, they listened to a female human voice responding positively to a potential suitor's profile on a dating app. In one focused on credibility, participants listened to a computerized female voice with a British accent deny culpability for a traffic accident.

In these and the other experiments, participants' value judgments significantly favored the recordings with the richer and more resonant tones. They perceived the people from the higher-quality recordings as more hirable, desirable, intelligent, and credible. This work shows that judgments from speech are based not only on the content of their message, but also on the superficial vehicle through which that message is delivered.

The researchers noted that this effect may be particularly difficult to notice in practice.

"During videoconferencing, of course, you know how you look, since you can see yourself too," Walter-Terrill noted. "But on a call with dozens of people, you may be the only one who doesn't know how you sound to everyone else: you may hear yourself as rich and resonant, while everyone else hears a tinny voice."

More information: Walter-Terrill, Robert et al, Superficial auditory (dis)fluency biases higher-level social judgment, Proceedings of the National Academy of Sciences (2025). DOI: 10.1073/pnas.2415254122. doi.org/10.1073/pnas.2415254122

Citation: Zoom bias: The social costs of having a 'tinny' sound during video conferences (2025, March 24) retrieved 27 March 2025 from https://phys.org/news/2025-03-bias-social-tinny-video-conferences.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

缩放偏见：视频会议中“声音细小”的社会成本 Zoom bias: The social costs of having a 'tinny' sound during video conferences

缩放偏见：视频会议中“声音细小”的社会成本
Zoom bias: The social costs of having a 'tinny' sound during video conferences