(评论)
(comments)

原始链接: https://news.ycombinator.com/item?id=43415107

Hacker News一篇帖子总结如下:Sakofchit 推出了 palette.cam,一款“自主图像编辑器”,它可以根据调整后的元数据(位置/时间)重新生成照片,无需任何提示。其目标是在上下文环境下重新创建图像,同时保留原始图像中的主体,例如面部和体型,这是AI图像编辑中常见的难题。图库中提供了示例,并链接了一个演示。 用户对“自主”特性提出了疑问,并注意到生成的图像中发型和肢体存在随机变化。项目成员 Jchiu1234 解释说,系统会自主决定图像中哪些部分需要保留或更改,并使用扩散模型进行编辑。他们承认模型存在“奇怪的行为”,并将一些瑕疵归因于数据分布。用户 zoklet-enjoyer 指出,这与基于GPS和天气数据的AI相机类似。


原文
Hacker News new | past | comments | ask | show | jobs | submit login
Show HN: We built an agentic image editor that preserves the original structure (palette.cam)
10 points by sakofchit 1 hour ago | hide | past | favorite | 9 comments
Hi everyone,

I’ve been experimenting with app where you can edit images in your camera roll simply by tweaking your photo’s metadata (changing location/time) and our agent will contextually regenerate the photo in that place & time in one shot. There's no prompting involved.

One of the hardest problems we’ve seen with these ai image editing/creation tools is that they struggle with preserving the subjects of the original image (faces, genders, number of people, bodies, animals, etc), and I think we’ve gotten a step closer to making it feel more realistic.

The gallery has some examples that people have been regenerating. https://palette.cam/gallery

Here’s a demo: https://x.com/sakofchit/status/1900274636522193067

Feel free to dm me on Twitter: https://twitter.com/sakofchit if you’d like to try out the TestFlight in the meantime

Would love to know what y'all think!











That's a really cool idea. Reminds me of that AI camera someone made where it generates an image prompt based on, I think, GPS and weather data


Interesting project. What makes this an agent? Just looks like an image transform that uses LLMs.


Good question, we designed a system that looks across the image and chooses which parts of the image to be preserved and which to be changed. And, if a region is selected to change, how should it be changed.

This is all done autonomously and the decision-maker is an agent.



Why does it make random changes, like moving limbs, changing hairstyles (hilariously leaving a black person with dreads, but removing them from the white person beside them :facepalm:) etc?


The diffusion model(s) the agent leverages to edit certain parts of the image are notorious for exhibiting weird behaviors (of course, we will improve these as we progress).

I'll let you figure out why it does some weird things regarding your comment (data distribution).



Need


this is a pretty dope app!


Forgot to switch accounts?


No, lol, I was forced to make an account for this. Mostly just a joke






Join us for AI Startup School this June 16-17 in San Francisco!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact



Search:
联系我们 contact @ memedata.com