新账号在HN使用EM破折号的可能性是旧账号的10倍。
New accounts on HN 10x more likely to use em-dashes

原始链接: https://www.marginalia.nu/weird-ai-crap/hn/

Hacker News (HN) 用户越来越怀疑有机器人大量涌入,理由是发布了无意义的帖子,并且整体感觉“不对劲”。为了调查,一位用户分析了最近的评论,具体比较了新注册账户和老用户的评论。 对双方各700条评论的分析显示出显著差异。新账户使用过多的标点符号,如破折号和箭头,的可能性是老账户的**近十倍**(17.5% vs 1.8%),具有非常高的统计显著性(p=7e-20)。它们也更有可能提及人工智能和大型语言模型(18.7% vs 11.8%,p=0.0018)。 虽然人类用户偶尔也会使用这些写作风格,但这种巨大的差异表明存在自动化活动。这些数据支持了人们日益增长的担忧,即机器人正在显著影响 HN 的评论环境。

## 黑客新闻与AI生成评论:摘要 最近黑客新闻上的一场讨论指出,AI生成评论的一个潜在指标是:新账户过度使用破折号(—)。作者观察到,新评论中使用破折号的比例为32:1,表明这超出了典型的人类写作模式。 用户推测,这是因为AI模型被指示使用破折号等风格元素来显得更像人类,同时也默认使用正式的写作风格。一些评论员指出,他们已经停止使用破折号,担心被标记为AI。 对话还涉及平台上的更广泛的机器人活动问题,包括对叙事操纵的担忧,以及区分人类用户、AI辅助用户和完全自动化的机器人之间的困难。 提出的解决方案包括身份验证,以及简单地接受人类生成内容和AI生成内容日益难以区分的现状。 一些用户戏谑地接受了破折号的“咔哒”声,将其视为机器人存在的标志。 最终,这场讨论强调了在日益复杂的AI时代,维持真实的在线讨论的挑战。
相关文章

原文

I’ve had this sense that HN has gotten absolutely innundated with bots last few months. First most obvious giveaway is the frequency with which you see accounts posting brilliant insights like

13 60 well and t6ctctfuvuh7hguhuig8h88gd to f6gug7h8j8h6fzbuvubt GB I be cugttc fav uhz cb ibub8vgxgvzdrc to bubuvtxfh tf d xxx h z j gj uxomoxtububonjbk P.l.kvh cb hug tf 6 go k7gtcv8j9j7gimpiiuh7i 8ubg

or

1662476506

or

Аё

Beyond the accounts that are visibly glitching out, the vibe is also seriously off. Lots of comments that are incredibly banal, or oddly off topic. Hard to really put a finger on how, but I had the idea of scraping /newcomments and /noobcomments to see if I could make sense of it. First is for comments that are recently made, and the second is for comments that are recently made by newly registred accounts.

With some simple statistics, I quickly found that:

  • Comments from newly registered accounts are nearly 10x more likely to use em-dashes, arrows, and other symbols in their text (17.47% vs 1.83% of comments). p = 7e-20

  • Comments from newly registered accounts on HN are also more likely to mention AI and LLMs (18.67% vs 11.8% of comments). p=0.0018

Sample size isn’t enormous, about 700 of each category, but these are pretty big differences. While regular humans sometimes use EM-dashes, arrows, and the like, it’s hard to explain why new accounts would be 10x more prone to using them than established accounts.

Sources and data

联系我们 contact @ memedata.com