你以为训练数据是从哪里来的?
Where did you think the training data was coming from?

原始链接: https://idiallo.com/blog/where-did-the-training-data-come-from-meta-ai-rayban-glasses

围绕Meta智能眼镜的最新争议——直接将数据传输至Facebook——凸显了一个更广泛且普遍接受的现实:在现代技术下,真正的隐私越来越难以实现。从用胶带遮盖网络摄像头的笔记本电脑(甚至包括科技公司首席执行官的电脑),到Windows 10和Chromebook等操作系统,设备现在 routinely 收集用户数据用于AI训练和定向广告。 微软、谷歌,甚至苹果等公司,尽管声称注重隐私,但仍会收集遥测数据、个性化数据等,这些往往隐藏在冗长且未经质疑的服务条款中。Meta高度依赖广告收入(去年98%的1890亿美元),因此具有特别强烈的数据收集动机,正如其过去利用数十亿Instagram图片用于AI开发的行为所证明。 核心问题不是单一事件,而是AI运作的基本方式——它*需要*大量用户数据来学习。接受任何连接互联网的设备,意味着放弃控制权并信任公司负责任地使用你的信息,而这种信任越来越难以建立。

黑客新闻 新 | 过去 | 评论 | 提问 | 展示 | 招聘 | 提交 登录 训练数据你以为是从哪里来的? (idiallo.com) 23点 由 speckx 1小时前 | 隐藏 | 过去 | 收藏 | 讨论 帮助 指南 | 常见问题 | 列表 | API | 安全 | 法律 | 申请YC | 联系 搜索:
相关文章

原文

When the news broke that Meta's smart glasses were feeding data directly into their Facebook servers, I wondered what all the fuss was about. Who thought AI glasses used to secretly record people would be private? Then again, I've grown cynical over the years.

The camera on your laptop is pointed at you right now. When activated, it can record everything you do. When Zuckerberg posted a selfie with his laptop visible in the background, people were quick to notice that both the webcam and the microphone had black tape over them. If the CEO of one of the largest tech companies in the world doesn't trust his own device, what are the rest of us supposed to do?

On my Windows 7 machine, I could at least assume the default behavior wasn't to secretly spy on me. With good security hygiene, my computer would stay safe. For Windows 10 and beyond, that assumption may no longer hold. Microsoft's incentives have shifted. They now require users to create an online account, which comes with pages of terms to agree to, and they are in the business of collecting data.

As part of our efforts to improve and develop our products, we may use your data to develop and train our AI models.

That's your local data being uploaded to their servers for their benefit. Under their licensing agreement (because you don't buy Windows, you only license it) you are contractually required to allow certain information to be sent back to Microsoft:

By accepting this agreement or using the software, you agree to all of these terms, and consent to the transmission of certain information during activation and during your use of the software as per the privacy statement described in Section 3. If you do not accept and comply with these terms, you may not use the software or its features.

The data transmitted includes telemetry, personalization, AI improvement, and advertising features.

On a Chromebook, there was never an option to use the device without a Google account. Google is in the advertising business, and reading their terms of service, even partially, it all revolves around data collection. Your data is used to build a profile both for advertising and AI training.

None of this is a secret. It's public information, buried in those terms of service agreements we blindly click through. Even Apple, which touts itself as privacy-first in every ad, was caught using user data without consent. Tesla employees were found sharing videos recorded inside customers' private homes.


While some treat the Ray-Ban glasses story as an isolated incident, here is Yann LeCun, Meta's former chief AI scientist, describing transfer learning using billions of user images:

We do this at Facebook in production, right? We train large convolutional nets to predict hashtags that people type on Instagram, and we train on literally billions of images. Then we chop off the last layer and fine-tune on whatever task we want. That works really well.

That was seven years ago, and he was talking about pictures and videos people upload to Instagram. When you put your data on someone else's server, all you can do is trust that they use it as intended. Privacy policies are kept deliberately vague for exactly this reason. Today, Meta calls itself AI-first, meaning it's collecting even more to train its models.

Meta's incentive to collect data exceeds even that of Google or Microsoft. Advertising is their primary revenue source. Last year, it accounted for 98% of their forecasted $189 billion in revenue.

Yes, Meta glasses record you in moments you expect to be private, and their workers process those videos at their discretion. We shouldn't expect privacy from a camera or a microphone, or any internet-connected device, that we don't control. That's the reality we have to accept.

AI is not a magical technology that simply happens to know a great deal about us. It is trained on a pipeline of people's information: video, audio, text. That's how it works. If you buy the device, it will monitor you.


联系我们 contact @ memedata.com