（评论）

（评论）
(comments)

原始链接: https://news.ycombinator.com/item?id=40954497

我认为这是对神经网络和类似“人工智能”的技术在未来如何实施的一瞥。大量的工程、对已知技术的大量巧妙操作与强大的、训练有素的模型交织在一起，处于中心位置。现在我认为像 chatgpt 这样的东西只是制作可以概括和处理数据的基础模型的第一步。 There isn't a lot of work going into processing the inputs into something the model can best understand (not at the tokenizer level, even before that)。 We have a basic field about this i。e。 prompt engineers but nothing as sophisticated as Alphafold exists for natural language or images yet。People are stacking LLMs together and putting system prompts in to assist this input processing。 Maybe when we have some more complex systems in place, we can see something resembling a real AGI。

I consider this a glimpse into how neural networks and "AI"-like techs would be implemented in the future. Lots of engineering, lots of clever manipulations of known techniques woven together with a powerful, well trained, model, at the center.

Right now I think stuff like chatgpt is only at the first step of making that foundational model that can generalize and process data. There isn't a lot of work going into processing the inputs into something the model can best understand (not at the tokenizer level, even before that). We have a basic field about this i.e. prompt engineers but nothing as sophisticated as Alphafold exists for natural language or images yet.

People are stacking LLMs together and putting system prompts in to assist this input processing. Maybe when we have some more complex systems in place, we can see something resembling a real AGI.

（评论） (comments)

（评论）
(comments)