(评论)
(comments)

原始链接: https://news.ycombinator.com/item?id=43479985

Hacker News 的讨论线程关注 Gemini 2.5 Pro 推理任务可行性的能力,暗示了大型语言模型的下一个发展阶段。评论者推测,实现 AGI(通用人工智能)的关键不在于创造单个超级智能模型,而在于有效地协调现有的大型语言模型和工具。 一位用户建议将复杂请求分解成由大型语言模型、智能体和工具组成的网络可以处理的子请求——这是一种通过协调函数调用来进行模块化问题解决的概念。这强调了强大的路由/协调层的的重要性。 另一位用户表达了同样的观点,指出将大型语言模型分层,并增加抽象级别,结合强化学习,可以显著提高规划任务的性能。大家感觉各个组件已经具备能力,挑战在于创造将它们结合在一起的“粘合剂”。原发帖人表示同意,他认为我们已经拥有了 AGI 的基础模块,只需要改进如何协调现有模型。


原文
Hacker News new | past | comments | ask | show | jobs | submit login
Gemini 2.5 Pro reasons about task feasibility (intellectronica.net)
30 points by intellectronica 2 hours ago | hide | past | favorite | 5 comments










Next step into LLM evolution is teaching them to procrastinate


Would be cool if the LLM can break up the request into sub-requests processable by LLMs. Current talk about agents mention some sort of router/orchestrator that delegates to other agents. But these can be another llm, another agent, another router itself or a simple tool call, etc - all function calls that wrap other llm-enabled sub components.

My feeling is that we have the pieces to build AGI. Like humans, we don't need a 400IQ person to solve all problems ('AGI'). What we have is coordination problems and in LLM land it's 'the glue' that's missing. Hopeful it's a matter of patterns/best-practices emerging.



> But these can be another llm

Yes! I share the feeling that once LLMs get good enough at some abstraction level, you can always put another "level" on top that should abstract what already works into bite sized pieces. Hassabis also mentions this in a recent podcast, different levels of abstraction. We'll probably see some tooling in this space shortly, to coordinate between the different levels. And then RL it and watch it demolish planning tasks benchmarks.

We might very well already be at the point where every level is achievable, we just have to glue them together.



And yes, I share the view / feeling that we basically got the AGI building blocks. Models will continue improving, but we can already get most of what we need just by orchestrating the latest generation of SOTA models. Crazy time to be alive!


I bet it can do that if hooked up to an agent system. Rate limits are still very restrictive now in the free API, but as soon as they make it available for more frequent use we'll find out.






Join us for AI Startup School this June 16-17 in San Francisco!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact



Search:
联系我们 contact @ memedata.com