研究:回归基础方法在语言分析中可与人工智能相媲美或超越它。
Study: Back-to-basics approach can match or outperform AI in language analysis

原始链接: https://www.manchester.ac.uk/about/news/back-to-basics-approach-can-match-or-outperform-ai/

曼彻斯特大学的一项新研究揭示了一种令人惊讶的作者身份识别方法:分析语法。由安德烈亚·尼尼博士领导,该方法名为LambdaG,在确定文本作者方面与先进的人工智能系统相匹配或*优于*它们,尽管其相对简单。 LambdaG专注于语法、句子结构和标点符号中的模式——为每位作者创建独特的“行为特征”,而不是依赖于计算成本高昂且通常不透明的人工智能模型。 在12个真实世界的数据集(电子邮件、论坛、评论)上测试,它被证明比几个基于神经网络的系统更准确。 至关重要的是,LambdaG 提供了*透明度*,清晰地显示了驱动其结论的*哪些*语法特征。 这与“黑盒”人工智能形成对比,使其在法庭语言学、刑事调查和学术诚信等可解释性至关重要的领域具有潜在价值。 该研究挑战了更复杂的人工智能总是产生更好结果的假设,证明了基于语言学分析的力量。

黑客新闻 新的 | 过去的 | 评论 | 提问 | 展示 | 招聘 | 提交 登录 研究:回归基础的方法在语言分析中可以匹配或超越人工智能 (manchester.ac.uk) 9 分,由 giuliomagnifico 发表于 3 小时前 | 隐藏 | 过去的 | 收藏 | 讨论 帮助 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请 YC | 联系 搜索:
相关文章

原文

A new study led by Dr Andrea Nini at The University of Manchester has found that a grammar-based approach to language analysis can match or outperform advanced AI systems in identifying who wrote a text. The method, called LambdaG, uses patterns in grammar and sentence construction rather than large-scale AI models, offering comparable accuracy with greater transparency and lower computational cost.

Key findings

  • A grammar-based authorship analysis method matched or exceeded leading AI systems across most test datasets
  • The approach outperformed several neural network-based authorship verification models
  • Researchers tested the method across 12 real-world writing datasets including emails, forums and reviews
  • The system is more transparent than many AI models because it shows which grammatical patterns informed decisions
  • Researchers say the findings challenge assumptions that more complex AI always produces better results

What did the study find?

Researchers found that a relatively simple, linguistically grounded method can perform as well as - and in some cases better than - complex artificial intelligence systems in identifying authorship.

The study suggests that increasingly sophisticated AI is not always necessary for high-performing writing analysis, particularly when methods are designed around established principles of how language works.

How does the LambdaG method work?

The method, called LambdaG, analyses patterns in grammar rather than relying on large-scale machine learning models.

It builds a statistical profile of how an individual writes by measuring features such as function word usage (words like it, of and the), sentence structure, punctuation patterns and other grammatical habits.

The researchers say these features create a distinctive behavioural signature for each writer.

Why is this different from AI-based authorship analysis?

Many current authorship verification systems rely on complex AI models trained on vast datasets. While effective, these systems can be difficult to interpret, computationally expensive and hard to explain in high-stakes settings such as legal investigations. By contrast, LambdaG provides a transparent explanation of which grammatical features influenced its conclusions.

How accurate was the method?

Researchers tested LambdaG across 12 datasets designed to reflect real-world writing scenarios, including emails, online forum posts and consumer reviews.

In most cases, the method achieved higher accuracy than several established authorship verification systems, including neural network-based approaches.

Why does grammar reveal authorship?

The researchers argue that grammar acts as a behavioural signature, like how we write our signature or how we walk.

Over time, individuals develop unconscious habits in how they structure sentences and use language. These habits create identifiable linguistic patterns that can distinguish one writer from another.

What are the potential applications?

The researchers say the method could support work in:

  • Forensic linguistics
  • Criminal investigations
  • Online abuse detection
  • Academic integrity monitoring
联系我们 contact @ memedata.com