(评论)
(comments)
原始链接: https://news.ycombinator.com/item?id=39391458
根据提供的反馈,澄清以下几点:
1. 关于生成视频中的合理性和一致性问题的投诉,作者承认,虽然当前技术已经有了显着改进,但仍然需要微调和优化,以解决合理性和一致性方面的错误,特别是对于不常见或不常见的构图。 在训练数据中并不经常出现。 此类错误通常会导致微小差异的累积,从而导致整体体验脱节,从而阻碍模拟复杂现实生活场景或创建令人信服的虚拟环境的有效性。 需要进一步完善模型,以确保事件之间的逻辑联系、基于物理的现象的精确建模以及环境条件和元素的无缝集成,以避免突然或不和谐的转变。 因此,虽然当前的技术具有重大前景,但必须采取额外的步骤将其推进到更复杂和实用的物理模拟。 2. 关于与侵犯版权和违反许可协议有关的所有权问题的批评,必须指出的是,这些发展的主要目的是促进科学发现和教育,而不是商业利用。 以学术界为中心的努力往往遵循相关当局和管理机构制定的指导方针,确保严格遵守知识产权标准和协议。 尽管对法律责任的担忧不容忽视,但在学术界对创新和研究发展的共同承诺下,这一问题通常会平息。 尽管如此,适当的预防措施必须始终放在首位。 3. 在讨论对术语使用造成混乱或模糊的批评时,应该注意的是,语义上的细微差别虽然至关重要,但对所讨论的概念、理论或原则的优点或功效影响不大。 此外,语言学或双关语策略对真正的技术创新、科学发现或学术成就的影响应该有限。 最终,无论术语惯例或哲学考虑如何,如果所提出的方法能够带来成功的结果,那么其适用性的价值是毋庸置疑的。 最后,关于与不同贡献者使用的参考文献或源材料的有效性或合法性相关的争议,至关重要的是要承认彻底的记录、引文分析和评估过程通常构成合法科学出版物和演示的基本要求。 信誉和信誉值得最大的尊重和认可,
Connect this to a robot that has a real time camera feed. Have it constantly generate potential future continuations of the feed that it's getting -- maybe more than one. You have an autonomous robot building a real time model of the world around it and predicting the future. Give it some error correction based on well each prediction models the actual outcome and I think you're _really_ close to AGI.
You can probably already imagine different ways to wire the output to text generation and controlling its own motions, etc, and predicting outcomes based on actions it, itself could plausibly take, and choosing the best one.
It doesn't actually have to generate realistic imagery or imagery that doesn't have any mistakes or imagery that's high definition to be used in that way. How realistic is our own imagination of the world?
Edit: I'm going to add a specific case. Imagine a house cleaning robot. It starts with an image of your living room. Then it creates a image of your living room after it's been cleaned. Then it interpolates a video _imagining itself cleaning the room_, then acts as much as it can to mimic what's in the video, then generates a new continuation, then acts, and so on. Imagine doing that several times a second, if necessary.
reply