Meta 的脑部扫描系统可无创读取句子,代码已开源
From brain waves to words: a new path to communication without surgery

原始链接: https://ai.meta.com/blog/brain2qwerty-brain-ai-human-communication/?_fb_noscript=1

研究人员发布了 **Brain2Qwerty v2**,这是非侵入式脑机文字转换技术的一项重大突破。该系统利用深度学习解码原始脑磁图(MEG)信号,实现了 61% 的词汇准确率。这不仅较以往的非侵入式方法有了巨大提升,也向着媲美手术级系统的性能迈进了一步。 与传统的植入式神经假体不同,Brain2Qwerty v2 无需进行手术。它通过在神经数据上微调大语言模型,将嘈杂的脑部活动转化为连贯的文本。针对 9 名参与者进行的 22,000 个句子的测试表明,准确率随数据量的增加呈对数线性增长,这意味着进一步扩大规模有望很快填补其与手术替代方案之间的性能差距。 为了促进科学进步,研究团队现已开源 v1 和 v2 版本的训练代码,其合作伙伴 BCBL 也正发布 v1 版本的数据集。这项研究旨在构建开放的脑部基础模型,最终目标是帮助神经损伤患者恢复交流能力。通过共享这些工具和数据集,开发者希望能加速全球神经科学领域的研究、诊断以及相关脑部疾病的治疗进程。

Meta 发布了一套全新的开源脑扫描系统,旨在以非侵入式的方式解码句子。尽管专家指出该技术并非完全创新,相比现有方法仅有微小的改进,但其底层代码和数据集的公开仍受到了研究界的赞赏。 Hacker News 上围绕这一消息的讨论体现了技术好奇心与显著不安的交织。评论者强调了几个实际障碍,特别是所需的脑磁图(MEG)硬件体积庞大,以及该系统目前仅 78% 的识别准确率。 除了技术局限性,讨论很快转向了反乌托邦式的猜想。用户表达了深切的隐私担忧,担心未来脑数据可能被商品化用于大模型训练、被用于监控,或被利用来提取密码等敏感信息。另一些人则以幽默的方式,设想了“大脑防御”的场景——例如在脑中背诵无意义的内容来扰乱数据,以此作为一种必要的抵抗手段。总的来说,人们的态度存在明显分歧:一方面是学术界对进展的认可,另一方面则是对神经监控所带来影响的生存恐惧。
相关文章

原文

Last year, we introduced Brain2Qwerty v1, research that uses AI to decode brain activity into text without any surgical implant. Now we're sharing the next step: Brain2Qwerty v2, the highest-performing end-to-end pipeline capable of real-time sentence decoding from non-invasive brain recordings, approaching levels of accuracy previously exclusive to techniques that require brain surgery.

To help accelerate neuroscience breakthroughs, we're releasing the full training code for Brain2Qwerty v1 and v2, and our partner, the Basque Center on Cognition, Brain, and Language (BCBL), is releasing the v1 dataset. We believe this research has the potential to make a real difference for the millions of people who suffer from brain lesions that prevent them from communicating. Invasive procedures like stereotactic electroencephalography and electrocorticography have shown that a neuroprosthesis feeding signals to an AI decoder can restore communication, but they're difficult to scale. Our noninvasive approach can help bridge that gap.

We trained Brain2Qwerty v2 on approximately 22,000 sentences from nine volunteer participants, each recorded for 10 hours wearing a magnetoencephalography (MEG) device while actively typing. Instead of relying on hand-crafted pipelines to detect neural events, we use end-to-end deep learning to decode directly from raw brain signals.

Fine-tuning large language models on neural data allows the system to leverage semantic context, bridging the gap between noisy brain recordings and coherent language. We also deployed AI agents to explore optimizations for the decoding pipeline, with final training configurations selected manually by engineers.

The result: Brain2Qwerty v2 recovers sentences coherently from noisy neural inputs, achieving a word accuracy rate of 61%, significantly improving upon the 8% word accuracy from other non-invasive methods. And for our best participant, we achieve a 78% word accuracy, where more than half of all sentences are decoded with one word error or less.

We also find that decoding accuracy improves log-linearly with data volume, suggesting that the remaining performance gap with surgical approaches could be further narrowed through data scaling alone. This work contributes to our efforts to build open foundational models of the brain, with our Tribev2 model for perception encoding, NeuralSet to process brain data at scale, and NeuralBench to systematically evaluate models. We do this in close collaboration with the community, through our recent $5 million fund to stimulate open datasets in our Digital Brain Project. Our hope is that this work, done in the open, advances neuroscience to identify, diagnose, and treat neurological disorders faster than in siloes.

联系我们 contact @ memedata.com