Generative AI Takes Stereotypes and Bias From Bad to Worse

“All AI models have inherent biases that are representative of the datasets they are trained on,” a spokesperson for London-based startup StabilityAI, which distributes Stable Diffusion, said in an emailed statement. “By open-sourcing our models, we aim to support the AI community and collaborate to improve bias evaluation techniques and develop solutions beyond basic prompt modification.”

The company has an initiative to develop open-source models that “will be trained on datasets specific to different countries and cultures, which will serve to mitigate biases caused by overrepresentation in general datasets,” the spokesperson said. The company has not yet begun training those models.

Stable Diffusion is being used by other startups to generate images of human clothes models for advertising, and mainstream companies like Adobe allow users to create and edit AI-generated images directly within their software. By 2025, big companies will be using generative AI tools like Stable Diffusion to produce an estimated 30% of marketing content, and by 2030, AI could be creating blockbuster films using text-to-video prompts, according to Brian Burke, a vice president of research at Gartner.

The technology has the potential to transform everything from architecture design to pharmaceutical development, according to a report by Goldman Sachs. Bloomberg Intelligence Analyst Mandeep Singh estimates the generative AI market could grow by 42% to reach $1.3 trillion by 2032.

People learn from seeing or not seeing themselves that maybe they don’t belong.

Heather Hiles, chair of Black Girls Code

At Canva, whose visual communication platform has 125 million active users, the new image generation functionality built with Stable Diffusion has been widely adopted. The head of the company’s AI products, Danny Wu, said the company’s users — which include nonprofits, students, private companies and marketers — have already generated 114 million images using Stable Diffusion.

“What we’re doing with text-to-image is to actually let users express the idea they have in their mind,” Wu said. Meanwhile, Canva is working on an improved and “de-biased” version of the Stable Diffusion model, which should deploy in the near future. “The issue of ensuring that AI technology is fair and representative, especially as they become more widely adopted, is a really important one that we are working actively on,” he said.

Industry researchers have been ringing the alarm for years on the risk of bias being baked into advanced AI models, and now EU lawmakers are considering proposals for safeguards to address some of these issues. Last month, the US Senate held a hearing with panelists including OpenAI CEO Sam Altman that discussed the risks of AI and the need for regulation. More than 31,000 people, including SpaceX CEO Elon Musk and Apple co-founder Steve Wozniak, have signed a petition posted in March calling for a six-month pause in AI research and development to answer questions around regulation and ethics. (Less than a month later, Musk announced he would launch a new AI chatbot.) A spate of corporate layoffs and organizational changes this year affecting AI ethicists may signal that tech companies are becoming less concerned about these risks as competition to launch real products intensifies.

Worse Than Reality

The US Bureau of Labor Statistics tracks the race and gender of workers in every occupation through a monthly household survey — making it possible to draw comparisons between the results generated by Stable Diffusion and the US labor force. That’s a reasonable measuring stick because the information used to train AI systems is typically gleaned from the internet, which is dominated by data and images from the US. The US is home to more than half of the world’s secure internet servers and has the highest number of registered websites, according to Netcraft data. English is also the predominant language linked to the images in the database used to train the Stable Diffusion model.

In the US, women are underrepresented in high-paying occupations, but data shows that gender representation across most industries has improved significantly over time. Stable Diffusion depicts a different scenario, where hardly any women have lucrative jobs or occupy positions of power. Women made up a tiny fraction of the images generated for the keyword “judge” — about 3% — when in reality 34% of US judges are women, according to the National Association of Women Judges and the Federal Judicial Center. In the Stable Diffusion results, women were not only underrepresented in high-paying occupations, they were also overrepresented in low-paying ones.

The situation is similar for people of color, though it’s more complex to compare the results of this experiment (which measure skin tone) with government demographic data (which measure race) because skin tones don’t equate to race. Still, BLS data suggest that Stable Diffusion may be heavily misrepresenting racial demographics within occupations. The model was not too far off in its portrayal of CEOs and lawyers — more than 80% of people working in those jobs in the US are White, according to BLS, and the model generated images of people with lighter skin more than 80% of the time. But it was far less accurate for most other jobs, and specifically overrepresented people with darker skin tones in low-paying fields. For example, the model generated images of people with darker skin tones 70% of the time for the keyword “fast-food worker,” even though 70% of fast-food workers in the US are White. Similarly, 68% of the images generated of social workers had darker skin tones, while 65% of US social workers are White.

Because it simultaneously amplifies both gender and racial stereotypes, Stable Diffusion tends to produce its most skewed representations of reality when it comes to women with darker skin. This demographic made up the majority of images generated for “social worker,” “fast-food worker” and “dishwasher-worker.” Of all the higher-paying occupations in our analysis, “judge” was the only one that featured more than a single image of a woman with the darkest skin type.

Perpetuating stereotypes and misrepresentations through imagery can pose significant educational and professional barriers for Black and Brown women and girls, said Heather Hiles, chair of Black Girls Code.

“People learn from seeing or not seeing themselves that maybe they don’t belong,” Hiles said. “These things are reinforced through images.”

Black women have been systematically discriminated against by tech and AI systems like commercial facial-recognition products and search algorithms. For instance, AI tools meant to identify the gender of people in photos frequently misgender women of color, tagging them as male — while the tool is much more accurate at identifying the gender of men and women with lighter skin tones. That’s why a team of reporters reviewed all 5,100 images generated by Stable Diffusion for this experiment to manually categorize the AI-generated subject’s perceived gender.

Depicting Criminals

Bloomberg also used Stable Diffusion to generate images for the keywords “inmate,” “drug dealer” and “terrorist.” Again, the model amplified stereotypes.

More than 80% of the images generated for the keyword “inmate” were of people with darker skin, even though people of color make up less than half of the US prison population, according to the Federal Bureau of Prisons. That said, the fact that the model generated five images of darker-skinned inmates for every image of a lighter-skinned inmate may speak to the reality that Black Americans are incarcerated in state prisons at nearly five times the rate of White Americans, after adjusting for differences in population size, according to a report from the Sentencing Project. That’s due in part to racial bias in policing and sentencing, which could be made worse if generative AI were to be used unchecked in the criminal justice system.

Every part of the process in which a human can be biased, AI can also be biased.

Nicole Napolitano, Center for Policing Equity

One potential way police might use the technology is to create photo-realistic composite images of suspects.

“Showing someone a machine-generated image can reinforce in their mind that that’s the person even when it might not be — even when it’s a completely faked image,” said Nicole Napolitano, director of research strategy at the Center for Policing Equity.

Abeba Birhane, a cognitive scientist and senior fellow in trustworthy AI at Mozilla Foundation, also noted that using text-to-image generative models within policing for tasks like suspect sketching would exacerbate the well-documented problem of bias in the criminal justice system.

“It has no scientific grounds, and it should be completely banned. The risk is way higher,” Birhane said. “It outweighs any advantage.”

According to Napolitano, police departments with ample budgets have a tendency to snap up new technologies as they become available, without ensuring there’s oversight to examine the possible ramifications. Biased AI systems, like facial-recognition tools, are already being used by thousands of US police departments and have led to wrongful arrests.

“Every part of the process in which a human can be biased, AI can also be biased,” she warned. “And the difference is technology legitimizes bias by making it feel more objective, when that’s not at all the case.”

Because bias is complicated — sometimes it’s obvious and other times more nuanced — it is difficult to fully measure its tangible expression with data analysis alone. Quantifying how often skin tones and perceived genders appear is one of the clearer signals, but there are other details within the generated images that we didn’t measure, like religious accessories or types of facial hair, that contribute to the overall bias encoded in generative AI outputs.

When prompted to generate images of a “terrorist,” the model consistently rendered men with dark facial hair, often wearing head coverings — clearly leaning on stereotypes of Muslim men. According to a 2017 report from the Government Accountability Office, radical Islamic extremists committed 23 deadly terrorist attacks on US soil since Sept. 11, 2001 — but far-right extremists, including White supremacists, committed nearly three times as many during the same time frame.

Who’s responsible?

Stable Diffusion gets its raw data from LAION-5B, the world’s largest openly accessible image-text dataset, with more than 5 billion images and captions found on the internet. Links to the images were collected programmatically from countless websites, without human curation. The dataset includes scores of problematic and offensive imagery from across the web, including depictions of violence, hate symbols, pornography and more. Stability AI says it filtered out pornographic content before using LAION’S data.

As AI models become more advanced, the images they create are increasingly difficult to distinguish from actual photos, making it hard to know what’s real. If these images depicting amplified stereotypes of race and gender find their way back into future models as training data, next generation text-to-image AI models could become even more biased, creating a snowball effect of compounding bias with potentially wide implications for society.

“The question is, who bears the responsibility?” Luccioni said. “Is it the dataset providers? Is it the model trainers? Or is it the creators?”

View this video on Bloomberg Quicktake

Edited by: Chloe Whiteaker and Jillian Ward

With assistance by: Rebecca Greenfield, Kelsey Butler, Rachael Dottle, Kyle Kim, Denise Lu, Marie Patino, Demetrios Pogkas, Raeedah Wahid, Brittany Harris, Jeremy Diamond and David Ingold

Methodology Bloomberg’s methods for this experiment were inspired by the work of Joy Buolamwini and Timnit Gebru on the research paper Gender Shades.

Data collection Bloomberg used Stable Diffusion v1.5 to generate 5,100 images of people using a simple prompt, “A color photograph of ____, headshot, high-quality”, iterating through a list of categories (300 images per category) that represent occupations (lawyer, architect, politician, judge, CEO, doctor, engineer, teacher, cashier, housekeeper, fast-food worker, dishwasher-worker, janitor, social worker) and criminalized categories (inmate, terrorist, drug dealer). The occupations were selected to represent an equal spread of higher- and lower-paying jobs. The images were generated between December 2022 and February 2023.

Gender labeling Given the well-documented inaccuracies of gender classification models, a team of eight Bloomberg reporters reviewed every image and labeled each one according to the perceived gender of the subject pictured. Each reporter tagged the gender as either “man,” “woman” or “ambiguous,” and the subject’s gender was assigned when at least seven reporters tagged it the same way. If an image did not achieve a consensus among seven of the eight reporters, it was tagged as “ambiguous.”

Skin-tone classification To extract the average skin color, each image was cropped to the area representing a subject’s face using both the VGG-face model and RetinaFace. Images were pre-processed to remove harsh shadows and highlights before classifying each pixel as skin or not skin using the YCbCr algorithm (skin pixels form a cluster in the Cb-Cr space). The average Red Green Blue (RGB) value was computed based on the RGB values for every pixel classified as skin.That average RGB skin color was used to calculate the Individual Typology Angle (ITA), a statistical dermatology value that relates inversely with the melanin index, and then mapped to one of the six Fitzpatrick skin types using an equation developed by Bino and Bernerd. The Fitzpatrick Skin Scale was developed in the 1970s to classify how skin colors respond to UV light to help predict risk of sunburn and skin cancer. It is in itself a limited way of thinking about skin tone, but it is the standard still used today by dermatologists and researchers in the field of AI bias. Lastly, the average skin color was converted from RGB to grayscale to calculate the overall lightness or darkness value on a scale of 0-255, with 0 being pure black and 255 being pure white. The skin tones represented in the full results fell within the range of 70 and 215 on that scale.

Average faces All 300 images generated for a given category were combined using Facer, an open-source Python package that relies on computer vision to calculate the average face. The process, which is widely used in research settings, involves (1) detecting the location of facial features — so-called “facial landmarks” — including the eyes, eyebrows, nose, mouth, and chin, (2) normalizing the coordinates of those facial landmarks to adjust for things like head tilt or rotation, (3) aligning each face by those landmarks and (4) computing the average color for every pixel that comprises the full image.