152
submitted 1 day ago by 101@feddit.org to c/technology@lemmy.world
you are viewing a single comment's thread
view the rest of the comments
[-] tal@lemmy.today 22 points 1 day ago* (last edited 1 day ago)

looks dubious

The problem here is that if this is unreliable -- and I'm skeptical that Google can produce a system that will work across-the-board -- then you have a synthesized image that now has Google attesting to be non-synthetic.

Maybe they can make it clear that this is a best-effort system, and that they only will flag some of them.

There are a limited number of ways that I'm aware of to detect whether an image is edited.

  • If the image has been previously compressed via lossy compression, there are ways to modify the image to make the difference in artifacts in different points of the image more visible, or -- I'm sure -- statistically look for such artifacts.

  • If an image has been previously indexed by something like Google Images and Google has an index sufficient to permit Google to do fuzzy search for portions of the image, then they can identify an edited image because they can find the original.

  • It's possible to try to identify light sources based on shading and specular in an image, and try to find points of the image that don't match. There are complexities to this; for example, a surface might simply be shaded in such a way that it looks like light is shining on it, like if you have a realistic poster on a wall. For generation rather than photomanipulation, better generative AI systems will also probably tend to make this go away as they improve; it's a flaw in the image.

But none of these is a surefire mechanism.

For AI-generated images, my guess is that there are some other routes.

  • Some images are going to have metadata attached. That's trivial to strip, so not very good if someone is actually trying to fool people.

  • Maybe some generative AIs will try doing digital watermarks. I'm not very bullish on this approach. It's a little harder to remove, but invariably, any kind of lossy compression is at odds with watermarks that aren't very visible. As lossy compression gets better, it either automatically tends to strip watermarks -- because lossy compression tries to remove data that doesn't noticeably alter an image, and watermarks rely on hiding data there -- or watermarks have to visibly alter the image. And that's before people actively developing tools to strip them. And you're never gonna get all the generative AIs out there adding digital watermarks.

  • I don't know what the right terminology is, but my guess is that latent diffusion models try to approach a minimum error for some model during the iteration process. If you have a copy of the model used to generate the image, you can probably measure the error from what the model would predict -- basically, how much one iteration would change an image or part of it. I'd guess that that only works well if you have a copy of the model in question or a model similar to it.

I don't think that any of those are likely surefire mechanisms either.

[-] SchmidtGenetics@lemmy.world -2 points 1 day ago

I guess this would be a good reason to include some exif data when images are hosted on websites, one of the only ways to tell an image is true from my little understanding.

[-] CatsGoMOW@lemmy.world 3 points 1 day ago
[-] SchmidtGenetics@lemmy.world -3 points 1 day ago* (last edited 1 day ago)

I guess, but the original image would be somewhere to be scraped by google to compare and see an earlier version. Thats why you don’t just look at the single image, you scrape multiple sites looking for others as well.

Theres obviously very specific use cases that can take advantage of brand new images that are created on a computer, but theres still ways of detecting that with other methods as explained by the user I responded to.

[-] CatsGoMOW@lemmy.world 0 points 1 day ago

It seems like you’re assuming that file modified times are fixed…? Every piece of metadata like that can be altered. If you took a picture and posted it somewhere, I could take it and alter it to my liking, then add in some fake exif data as well as make it look like I modified the image before your actual original version.

You can’t use any of that metadata to prove anything.

[-] SchmidtGenetics@lemmy.world 0 points 1 day ago* (last edited 1 day ago)

No, but it seems like you’re assuming they would look at this sandboxed by itself…? Of course there is more than one data point to look at, when you uploaded the image would noted, so even if you uploaded an image with older exif data, so what? The original poster would still have the original image, and the original image would have scraped and documented when it was hosted. So you host the image with fake data later, and it compares the two and sees that your fake one was posted 6 months later, it gets flagged like it should. And the original owner can claim authenticity.

Metadata provides a trail and can be used with other data points to show authenticity when a bad actor appears for your image.

You are apparently assuming to be looking at a single images exif data to determine what? Obviously they would use every image that looks similar or matches identical and use exif data to find the real one. As well as other mentioned methods.

The only vector point is newly created images that haven’t been digitally signed, anything digitally signed can be verified as new, unless you go to extreme lengths to fake and image and than somehow recapture it with a digitally signed camera without it being detected fake by other methods….

load more comments (18 replies)
load more comments (20 replies)
this post was submitted on 18 Sep 2024
152 points (94.7% liked)

Technology

58135 readers
4674 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS