AI models collapse when trained on recursively generated data - Nature

Stopthatgirl7@lemmy.world · 5 months ago

AI models collapse when trained on recursively generated data - Nature

TootSweet@lemmy.world · 5 months ago

So one potentially viable way to destroy AI would be to repeatedly train LLMs and image generators on their own (or rather previous generations’) output to get garbage/junk/bad training data and then publish the text/images in places where bots trawling for training data are likely to find them.

Probably bonus points if the images still look “sensical” to the human eye, so that humans eyeballing the data don’t realize it’s the digital equivalent of a sabot. (Apparently the story about sabots being thrown into machinery is not true, but you know what I mean.)

Admiral Patrick@dubvee.org · 5 months ago

I already block all the LLM scraper bots via user agent.

I’ve been toying with the idea of, instead of returning 404 for those requests, returning LLM-generated drivel to poison the well.

Amanda@aggregatet.org · 5 months ago

This is a really good idea actually

snooggums@midwest.social · 5 months ago

train LLMs and image generators on their own (or rather previous generations’)

AIncest!

lemmyng@lemmy.ca · 5 months ago

Deep fried AI.