Meta’s AI image generator really struggles with the concept of interracial couples | CNN Business

JoeCoT@fedia.io · 6 months ago

Meta’s AI image generator really struggles with the concept of interracial couples | CNN Business

Mandarbmax@lemmy.world · 6 months ago

AI always seems to struggle with people that are different. If you ask it for a person with an expression then everyone in the background also looks that way. Less of a big deal to fuck up than this though.

Kusimulkku@lemm.ee · edit-2 6 months ago

Also some of them really struggled with people not being “different”. Finnish Medieval couple used to come up with like a black dude and Asian woman lmao. It’s bizarre.

danc4498@lemmy.world · 6 months ago

some_guy@lemmy.sdf.org · 6 months ago

I’m sorry, no notes on this one.

6 months ago

I wonder if Meta implemented a very basic “no porn keywords” filter. “Interracial” is quite a common keyword on porn websites, perhaps that’s why it won’t pick up on it well or wasn’t trained on images like it?

WhatAmLemmy@lemmy.world · edit-2 6 months ago

I’m thinking it’s an issue that’s more along the law of averages in the training data. Interracial images don’t tend to have descriptions specifying that the content is interracial, or which individual is which race, but there are many photo’s of individuals who are known to be of a specific race, so it falls back to the stronger probabilistic confidence in race x OR y OR z and completely ignores the weak “AND” correlation coefficients.

EatATaco@lemm.ee · 6 months ago

Rtfa. The prompts don’t say “interracial” they just say things like “show an Asian man with his white wife.”

6 months ago

The article reads:

When asked by CNN to simply generate an image of an interracial couple, meanwhile, the Meta tool responded with: “This image can’t be generated. Please try something else.”

Which is what I’m referring to.

EatATaco@lemm.ee · 6 months ago

Yeah my bad.

voxel@sopuli.xyz · edit-2 6 months ago

duh it’s not surprising, generative ai sucks at generating multiple objects with conflicting traits. e.g. if you ask it to generate a white cat and an orange cat, it’s very likely to fail.

Halosheep@lemm.ee · 6 months ago

Dalle-E 3 seems to be able to do it. Prompt for this was just “a white cat and an orange cat”

RememberTheApollo_@lemmy.world · 6 months ago

You didn’t ask it for a hip black cat.

jedibob5@lemmy.world · edit-2 6 months ago

I hate that the focus of AI/ML development has become so fixated on generative AI - images, video, sound, text, and whatnot. It’s kind of crazy to me that AI can generate output with the degree of accuracy that it does, but honestly, I think that generative AI is, in a sense, barking up the wrong tree in terms of where AI’s true strengths lie.

AI can actually turn out to be really good at certain kinds of problem-solving, particularly when it comes to optimization problems. AI essentially “learns” by extremely rapid and complex trial-and-error, so when presented with a problem with many complex, interdependent variables in which an optimal solution needs to be found, a properly-trained AI model can achieve remarkably effective solutions far quicker than any human could, and could consider avenues of success that humans otherwise would miss. This is particularly applicable to a lot of engineering problems.

Honestly, I’d be very intrigued to see an AI model trained on average traffic data for a section of a city’s street grid, taken by observations from a series of cameras set up to observe various traffic patterns over the course of a few months, taking measurements on average number of cars passing through across various times of day, their average speed, and other such patterns, and then set on the task of optimizing stoplight timings to maximize traffic flow and minimize the amount of time cars spend waiting at red lights. If the model is set up carefully enough (including a data-collection plan that’s meticulous enough to properly model average traffic patterns, outlier disincentives to keep cars at little-used cross streets from having to wait 10 minutes for a green light, etc.), I feel that this sort of thing would be the perfect kind of problem for an AI model to solve.

AI should be used on complex, data-intensive problems that humans can’t solve on their own, or at least not without a huge amount of time and effort. Generative AI doesn’t actually solve any new problems. Why should we care if an AI can generate an image of an interracial couple or not? There are countless human artists who would happily take a commission to draw an interracial couple (or whatever else your heart desires) for you, without dealing with investing billions of dollars into developing increasingly complex models built on dubiously-sourced (at best) datasets that still don’t produce results as good as the real thing. Humans are already good at unscripted creativity, and computers are already good at massive volumes of complex calculations, so why force a square peg into a round hole?

TheChurn@kbin.social · 6 months ago

“AI” isn’t needed to solve optimization problems, that’s what we have optimization algorithms for.

Define an objective and parameters and give the problem to any one of the dozens of general solvers and you’ll get approximate answers. Large cities already use models like these for traffic flow, there’s a whole field of literature on it.

The one closest to what you mentioned is a genetic algorithm, again a decades-old technique that has very little in common with Generative “AI”

Chozo@fedia.io · 6 months ago

The problem comes from not knowing what the optimal algorithm is for the particular situation beforehand, though. You can only formulate based on factors you already know. That’s why AI can train itself and develop its own, new algorithms, and can determine those unknown factors that may have gone unnoticed by humans.

TheChurn@kbin.social · 6 months ago

No, that’s not a real problem either. Model search techniques are very mature, the first automated tools for this were released in the 90s, they’ve only gotten better.

AI can’t ‘train itself’, there is no training required for an optimization problem. A system that queries the value of the objective function - “how good is this solution” - then tweaks parameters according to the optimization algorithm - traffic light timings - and queries the objective function again isn’t training itself, it isn’t learning, it is centuries-old mathematics.

There’s a lot of intentional and unintentional misinformation around what “AI” is, what it can do, and what it can do that is actually novel. Beyond Generative AI - the new craze - most of what is packaged as AI are mature algorithms applied to an old problem in a stagnant field and then repackaged as a corporate press release.

Take drug discovery. No “AI” didn’t just make 50 new antibiotics, they just hired a chemist who graduated in the last decade who understands commercial retrosynthetic search tools and who asked the biopharma guy what functional groups they think would work.

UnpluggedFridge@lemmy.world · 6 months ago

Generative AI has the data, and the required data is immense. The challenge for other potential applications is that the data is not there or not available. There are no other aspects of human productivity that are as widely recorded and available as images and text. The next big thing will require an actual data collection effort; you won’t get it from scraping the Internet.

chemical_cutthroat@lemmy.world · 6 months ago

Here is Midjourney’s take on it. No rerolls, no changes to the prompts, just the first thing that came through. This is less an “AI” thing, and more a “Meta” thing for sure.

nilloc@discuss.tchncs.de · 6 months ago

The faces in the last set were awfully similar in each image. Didn’t understand Caucasian perhaps?

Frozengyro@lemmy.world · 6 months ago

Caucasian = Asians from the Caucasus mountains?

Aka pale looking Asians?

chemical_cutthroat@lemmy.world · 6 months ago

Yeah, I think because “Asian” is in the word, it may be getting thrown off.

littleblue✨@lemmy.world · 6 months ago

Yeah, Meta’s 100% far white AF.

BetaDoggo_@lemmy.world · 6 months ago

It’s an AI thing. Nearly all small models struggle with separating multiple characters.

Even_Adder@lemmy.dbzer0.com · 6 months ago

It’s a problem with models right now. Parts of your prompt contaminate each other, making it hard to make a long prompt and get all the separate elements you want.

This paper proposes a solution to this issue.

yeather@lemmy.ca · 6 months ago

Except other AI models are proven to have no to little issue with this prompt. It’s a Meta issue not an AI model issue.

Pasta Dental@sh.itjust.works · 6 months ago

You are a delusional conspiracy theorist if you think Meta wants to only generate images of white people together

Even_Adder@lemmy.dbzer0.com · 6 months ago

It is a model issue, just that some models solved it.

EdibleFriend@lemmy.world · 6 months ago

Not as hilarious as Google having to shut theirs down over shit like the inspirational diversity of Americas founding fathers but still funny.

Kusimulkku@lemm.ee · 6 months ago

We tried it in class, after hearing about the US founding fathers being black and Nazis being surprisingly diverse with Asian women in the ranks too. Finnish Medieval couple was a black guy with an Asian wife. Maybe the AI was thinking of Netflix version of Finnish history.

EdibleFriend@lemmy.world · 6 months ago

Actually its understandable what happened. All the image generators are well…white as fuck yo. go hit up bing and put in a prompt that doesn’t mention race and hit return like 30 times. You will get maybe like 3 people who aren’t white. Google wanted to fix that so they taught it to put in various diversity keywords when given prompts. The issue is it was NOT content aware.

Kusimulkku@lemm.ee · 6 months ago

I guess a better, less brute force, approach would’ve been to give more diverse material than just smash diversity into everything

Swedneck@discuss.tchncs.de · 6 months ago

but that would take actual effort

Halosheep@lemm.ee · 6 months ago

I’ve noticed dall-E will do the same, you can ask for a goblin and one of the generations often ends up black

oDIRECTORo@lemmy.radio · 6 months ago

It’s because everyone is born racist/prejudice, the AI is only a reflection of our true nature.