Olivier Reintjes

No ratings yet.

“Generate an image of a man holding a red helium balloon in his left hand”

This was the prompt I asked OpenAI’s Image Generator DALL-E 2, 2 years ago. The result? A perfect shiny red helium balloon on a string with a man in a blue shirt holding it with two… things.

On a closer inspection I could make up that it, in fact, tried to make two hands, but it had failed miserably. The fingers were merging into eachother and his hands were also connected while holding the string of the balloon. Poor guy.

When I tried this again, the same thing happened. A beautiful image of a man holding a red shiny balloon but it again had failed at generating a realistic image of the hands. So why is it that these AI-Image Generators oftentimes struggle so much with generating hands and fingers?

Back in 2022 when these models were released, people were amazed by the images that these generators were spitting out. The images were so advanced that in August of 2022, the Image Generator Model from the company Midjourney won an art contest at a state fair with one of its images.
But users quickly started to notice a recurring bug. Everytime when a prompt included people, the AI tools couldn’t draw hands. Hands with 7 fingers, hands that appeared to be floating, unattached to the human body or, in my case, hands that were fused together at the wrists. But why?

The simple answer? Its hard to draw hands! Just like beginning (human) artists, AI struggles with hands because a hand is a very complex part of our body. It has multiple elements of varying shapes and sizes. In addition to that, its structure is incredibly intricate. Hands are built from parts that fit together with perfect precision. Fingers, palms, joints, and tendons all connect in fixed patterns. To draw them well, you must study how these parts move and align.

The case with AI models is that they learn by finding patterns in data. They do not actually understand structure as humans do. A human artists learns through observation and reasoning, the models learn through repetition in data. In addition to that, to reallisticly capture how the hand deforms during various hand movements, algorithms need to understand how our joints function and the range of motion each of them capture. Which is increadibly hard and tedious to train.

However, as time passes there are definitly improvements being made. Images with 6 fingers or hands fused into each other are less common now. That is because new models use larger datasets with millions of hand examples. They learn cleaner shapes, smoother joints, and more natural poses. The improvement isn’t magic. It comes from better training methods and higher quality references. Still, the model doesn’t understand anatomy. It predicts what looks right, not what is right. You can see progress, but the small mistakes remind you that pattern recognition is not comprehension.

And about that image I mentioned earlier with the prompt of a man holding a red balloon, here it is:

Sources:

Avenga. (2025, 18 juni). Why Generative AI Models Fail at Creating Human Hands – Avenga. https://www.avenga.com/magazine/generative-ai-models-fail-at-creating-human-hands/#:~:text=So%2C%20Why%20Do%20Image%20Generators,may%20take%20it%20for%20granted.

Matthias, & Meg. (2023, 25 augustus). Why does AI art screw up hands and fingers? | Explanation, Tools, & Facts. Encyclopedia Britannica. https://www.britannica.com/topic/Why-does-AI-art-screw-up-hands-and-fingers-2230501

5/5 (1)

The biggest news in AI this year isn’t a new model or app, its infrastructure. Just yesterday, OpenAI announced that it will expand their “Project Stargate”, A plan to build colossal data centers across the U.S, with an additional 5 new data centres. Pushing the cost of its AI infrastructure project to about $400 billion dollars (Hammond, 2025). These facilities, often described as “AI factories”, are designed to run future generations of artificial intelligence at a scale the world has never seen.

The economics are simple. Lots of upfront costs, but very low marginal costs when the system is in place (Kenton, 2025). Building a data center stuffed with GPUs and advanced cooling systems costs tens of billions. But once the investment is made, the cost of running one more AI query is almost negligible. The more queries, models, and customers using Stargate, the cheaper it becomes per unit. That’s economies of scale in its purest form, and it explains why OpenAI is betting that size equals survival in the AI race.

Still, scale has its risks. If demand for AI plateaus or grows slower than expected, those massive data centres could sit underused, turning scale from an advantage into a liability. And even if the demand is there, this type of scale brings side effects: massive energy consumption, huge amounts of water used for cooling, and heavy reliance on global chip supply chains. If things don’t pan out the way OpenAI anticipates, the efficiencies of scale for Project Stargate could translate into higher costs for society.

My take: Stargate is both visionary and dangerous. On one hand, it could make advanced AI affordable, accelerating innovation across industries. On the other, it concentrates technological and economic power into very few hands, while massively contributing to environmental costs. Economies of scale may make AI cheaper, but the real question is: cheaper for whom, and at what price for everyone else?

References:

Hammond, G. (2025, 23 september). OpenAI expands Stargate AI project with five US sites. Financial Times. https://www.ft.com/content/9b6c7db8-9a14-4261-9c18-38ec84d869a0?

Kenton, W. (2025, 18 juni). Economies of Scale: What Are They and How Are They Used? Investopedia. https://www.investopedia.com/terms/e/economiesofscale.asp

OpenAI. (2025, januari). Announcing the Stargate project. https://openai.com/index/announcing-the-stargate-project/

AI Can Paint Anything… Except Hands

9

Please rate this

Stargate: The 400 Billion-Dollar Bet on AI Economies of Scale

24

References:

Please rate this