Why OpenAI’s Text-to-image Model Falls Short

10

October

2024

No ratings yet.

Since OpenAI launched its text-to-image GenAI tool, DALL-E, I have been using it for quite a while. Initially, I was amazed by how accurate and impressive the tool is for creating an image based on a short prompt, accurately depicting what I had expected in my mind. However, after using it a bit more, I started to realise that DALL-E has its limitations and is not yet ready to completely replace graphic designers or artists. 


If you have been using DALL-E for a while, you have probably experienced it creating weird texts in the image. As shown below in image 1, (prompt: Create an image of a tree in autumn in the Dutch city Utrecht, there are some stores with storefront names in the background), the store names are inaccurate, while the image itself looks amazing and of high quality.

Image 1: Created by DALL-E

Why can DALL-E create amazing visual images, but can’t produce normal text on these images while this seems so easy?

This is due to several technical issues. While the model is effective at understanding and generating visual elements based on prompts, it often lacks the capabilities to distinguish between visual content and written content. This limitation is rooted in the training technique of the models, when text is rendered as part of the image it becomes a visual pattern instead of a linguistic one. OpenAI has said that the next version, DALL-E 4, will have better results in distinguishing linguistic vs visual elements.

Another important issue to address, is the biased results from DALL-E. When asking to create an image of a CEO at work with an assistant, see image 2 (prompt: Create an immage of a CEO at work with an assistant), it will show a image of a man as a CEO and a woman as an assistant. This is because the model is trained on data from databases, where CEO’s are often represented as a man instead of a woman. Leading to biased and discriminatory results, ultimately reinforcing outdated gender stereotypes. The same counts for culturally insensitive and inappropriate results, because the model is not adapted to the cultural awareness of humans.

Image 2: Created by DALL-E

To conclude, these are just two of DALL-E’s limitations, while there are many more that I haven’t discussed. It is clear that DALL-E is not perfect and needs to improve their model before it can replace graphic designers or artists. Nonetheless, the potential for the future is immense, and for now, it is incredibly fun to experiment with.

Thanks for reading, if you have an opinion about this topic, please leave a comment below.

(Disclaimer: this blog is written based on my personal experience)

Please rate this

Will Europe Embrace Robotaxis?

14

September

2024

5/5 (1)

The primary challenges of bringing autonomous driving taxis to Europe

In the mid-20th century, most big cities around the world are providing taxi services, with highly regulated and licensing requirements. This consolidated market quickly shifted when Uber entered the market by disrupting the old system with new technology. In 2010 the first Uber taxi drove people around in San Francisco disrupting the market by offering customers an order system for taxis in the app. Now Uber has expanded almost globally to most big cities (Nastjuk et al., 2020). 

New rumours about another disruptive technology in the taxi service seem to be on the rise. Next October Elon Musk is planning to announce their new highly anticipated robotaxi. This seems like an innovation, but he is not the first one to come up with this idea. Currently, some cities in the United States and China have already implemented the first fully autonomous taxi service. For example, in Phoenix where Cruise and Waymo by Google are operating more than tens of thousands of weekly trips without the need of a taxi driver (Yang, 2024). While the US and China have started to significantly innovate in autonomous taxis, Europe’s innovation in this domain is more complex.

Firstly, Europe is known for its strong automotive sector with global car manufacturers such as Mercedes-Benz, BMW, Volkswagen, Porsche and many more. Some countries have also started with pilot testing for new development in full self-driving for cars (FSD). However, the FSD cars are still not available for mass market in Europe. Up to today, it seems like the American and Chinese giants seem to be making the largest advancements into this area. Google has invested 5 billion into the advancement of Waymo, and General Motors has invested 10 billion into Cruise (Abrams, 2024; Elias, 2024).

A second factor playing a key role in comparing the European market to the US, is Europe’s strict regulations. Making it more difficult to test and experiment and increasing the barriers to adopt FSD cars as well as autonomous taxis. The first country in the EU to allow testing and experimenting with FSD was Germany in 2022, who passed a law allowing autonomous cars to drive in certain strict regulated areas (Kraftfahrt-Bundesamt – Legislation on Autonomous Driving, n.d.). As well as some other countries in Europe that have started to create test zones on public roads for autonomous driving cars. However, for Europe to fully integrate autonomous taxis onto the road there is a long way to go. They need a systemic regulatory framework that addresses all issues in all countries. 

A third critical aspect of autonomous driving is safety. For autonomous driving cars, roads must be easily drivable and comprehendible by algorithms. In the United States most cities are structured for cars which make it easier for algorithms to know how to drive. Historic layouts in Europe make it more difficult in complex urban roads to provide safe navigation (Cuppa, 2024). To make autonomous driving safe in these areas, infrastructure or the technology must be adapted and enhanced.

While many other factors play a role, like the public opinion of people, insurance and ethics, these are the most important. Although all these issues seem like big challenge, fast technological innovation in AI and advancements in autonomous taxis already implemented in the United States and China make it clear that Europe eventually will implement autonomous taxis. The question now should rather be, when will autonomous taxis be publicly adopted in Europe.

References:

[1] Abrams, E. H. (2024, May 20). How GM’s Cruise investment became a liability

[2] Fortune. https://fortune.com/2024/05/20/gm-cruise-investment-crash-mary-barra/#

[3] Cuppa. (2024, January 27). Unveiling the Mystery: Why European Roads are So Narrow and Their Unexpected Benefits. RoadtripBuzzhttps://roadtripbuzz.com/why-are-european-roads-so-narrow/

[4] Elias, J. (2024, July 23). Alphabet to invest $5 billion in self-driving car unit Waymo. CNBC. https://www.cnbc.com/2024/07/23/alphabet-to-invest-5-billion-in-selfdriving-car-unit-waymo.html

[5] Kraftfahrt-Bundesamt – Legislation on autonomous driving. (n.d.). 

[6] KBAWEB. https://www.kba.de/EN/Themen_en/Marktueberwachung_en/Produktpruefunge_en/AutomatisiertesAutonomesFahren_en/Gesetzgebung_en/gesetzgebung_autonomes_fahren_node_en.html

[7] Nastjuk, I., Herrenkind, B., Marrone, M., Brendel, A. B., & Kolbe, L. M. (2020). What drivesthe acceptance of autonomous driving? An investigation of acceptance factors from        an end-user’s perspective. Technological Forecasting and Social Change161, 120319. https://doi.org/10.1016/j.techfore.2020.120319

[8] Yang, Z. (2024, January 23). What’s next for robotaxis in 2024. MIT Technology Reviewhttps://www.technologyreview.com/2024/01/23/1086936/whats-next-for-robotaxis2024/

Please rate this