Show HN: We built an agentic image editor that preserves the original structure

zoklet-enjoyer · 2025-03-19T19:26:53 1742412413

That's a really cool idea. Reminds me of that AI camera someone made where it generates an image prompt based on, I think, GPS and weather data

kyt · 2025-03-19T19:00:08 1742410808

Interesting project. What makes this an agent? Just looks like an image transform that uses LLMs.

jchiu1234 · 2025-03-19T19:04:11 1742411051

Good question, we designed a system that looks across the image and chooses which parts of the image to be preserved and which to be changed. And, if a region is selected to change, how should it be changed.

This is all done autonomously and the decision-maker is an agent.

zellyn · 2025-03-19T19:05:33 1742411133

Why does it make random changes, like moving limbs, changing hairstyles (hilariously leaving a black person with dreads, but removing them from the white person beside them :facepalm:) etc?

jchiu1234 · 2025-03-19T19:14:01 1742411641

The diffusion model(s) the agent leverages to edit certain parts of the image are notorious for exhibiting weird behaviors (of course, we will improve these as we progress).

I'll let you figure out why it does some weird things regarding your comment (data distribution).

MidhaelBollox · 2025-03-19T18:14:29 1742408069

jchiu1234 · 2025-03-19T17:57:03 1742407023

this is a pretty dope app!

rafram · 2025-03-19T19:26:29 1742412389

Forgot to switch accounts?

jchiu1234 · 2025-03-19T19:29:25 1742412565

No, lol, I was forced to make an account for this. Mostly just a joke

（评论） (comments)

（评论）
(comments)