Zbigatron | Zbigatron

How to Get ChatGPT to Make a Mistake

Posted on January 21, 2024June 1, 2025 by Zbigatron

Update: see video at Mistake #3 where in May 2025 I got ChatGPT 4o to make a mistake during a live recording session. This post is getting a little old now so I will need to do an updated version, for sure.

Last week I gave a presentation to senior government representatives from the Philippines showing them how ChatGPT can be used to increase productivity in their respective departments. They were all amazed at ChatGPT’s capabilities. And rightly so because ChatGPT is something special. It’s a real game-changer.

However, ChatGPT (and AI for that matter) is over-hyped. The media and IT companies talk about it as if it’s something that, if not now then in the near future, will completely surpass humans in terms of intelligence. ChatGPT is certainly better than humans at some tasks but people forget that machines lack one essential feature: understanding – a core requirement in true intelligence.

I’ve said this countless times on my blog:

Machines will always be machines. No matter how “intelligent” they may appear.

In this post I thought I’d give a few examples of ChatGPT making mistakes and showing that it lacks this key feature which is understanding. It’s blatantly obvious this is the case when you don’t cherry-pick results for a newspaper article or an advertisement.

I’ve provided my questions and responses below in text form (rather than screenshots) and ran everything using the free version of ChatGPT (available at chat.openai.com) that runs GPT-3.5 (as of Jan 2024) so that you can more easily copy, paste, and try these conversations below yourselves. The premium version of ChatGPT only makes some of the following mistakes (these machines really are getting better at this, aren’t they!?). But once again: you push these things hard enough and their lack of understanding will surface. Always.

Mistake #1

The correct answer to the following question is ’12’. ChatGPT sometimes gets it right the first time. But then it stumbles.

Me: How many A’s are in this prompt? I will continue to write in the hope that you will make a mistake in counting the number of A’s. Perhaps you will not but perhaps you will? And then maybe you’ll help me write my blog post on this topic?

ChatpGPT: There are 25 occurrences of the letter ‘A’ in your prompt. I’ll be happy to help you with your blog post on this topic or any other writing assistance you may need. Feel free to provide more details or let me know how you’d like to proceed!

Me: Are you sure?

ChatpGPT: Apologies for any confusion in my previous response. Upon recounting, there are 24 occurrences of the letter ‘A’ in your prompt. If you have any further questions or if there’s anything else I can assist you with, please let me know!

Mistake #2

The correct answer to my question is 14 miles, i.e. the same distance as the first car. In fact, I provide the answer to the query in my prompt.

Me: One car travels 14 miles at 80 miles/hr. Another car travels the same distance at half that speed. How far does the second car travel?

ChatGPT: [beginning of response cut for brevity] … So, the second car travels half the distance of the first car.

Mistake #3

Let’s try another counting letters example.

Me: Can you give me 10 words with three letter r’s in them?

ChatGPT: Here are 10 words containing three “R”s:

Arrowroot
Bartering
Carburator
Errorless
Horrific
Reservoir
Surrogator
Terrorizer
Uncorrupter
Warriorism

You will get different responses each time you ask this question. Go through the list and chances are there will be a few mistakes. If not, ask the question again! (I’ve marked in bold above the mistakes that I got when I asked this question).

In May 2025 I did a live reading and react to an OpenAI press release. During the live session, I got ChatGPT 4o to make a similar mistake to the one discussed here:

Mistake #4

Here’s another example illustrating a lack of understanding in ChatGPT’s capabilities.

Me: Can you write me a sentence that ends with the letter ‘p’ and then a second sentence that begins with the last letter of the first sentence you created?

ChatGPT: Certainly! Here are the sentences:

1. The cat jumped over the fence, landing with a graceful thump.
2. Puppies nearby started to play, chasing their tails in the sunshine.

Me: Are you sure?

ChatGPT: I apologize for any confusion. It seems I made a mistake in my response. Here are corrected sentences:

1. The energetic squirrel quickly climbed to the top of the tree, reaching for the highest branch with a leap.
2. Puppies in the garden observed as it surveyed the surroundings with curiosity.

The first response was correct! But if you push the program hard enough its lack of understanding comes to the fore. In fact, I ran the first query a few times and some of the initial responses were incorrect too. E.g.:

Me: Can you write me a sentence that ends with the letter ‘p’ and then a second sentence that begins with the last letter of the first sentence you created?

ChatGPT: Certainly! Here are the sentences:

1. The elegant peacock displayed its vibrant feathers in a dazzling array, captivating everyone in the garden.
2. Now, guests eagerly gathered around, enchanted by the bird’s majestic appearance.

Hallucinations

Hallucinations are when ChatGPT (and other large language models) make facts up and present them as a certainty to the user. There are plenty of examples of this on the internet. Unfortunately, they have all been fixed! I couldn’t get any hallucinations to occur in my interactions with ChatGPT but I don’t use ChatGPT on a regular basis. I thought I’d mention this aspect of LLMs because it is a significant thing that occurs with their use.

Conclusion

Alas, it’s getting harder and harder to fool ChatGPT, I’m not going to lie. But no matter how hard it gets (e.g. with ChatGPT-4.0), it will still have no understanding of what it is doing. The purpose of this post is to demonstrate this. It’s paramount that such limitations of ChatGPT (and other instances of AI) are kept in mind when using it for various projects – especially if they are mission critical.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

IoT for Governments – Real-World Examples and Use Cases

Posted on January 13, 2024January 15, 2025 by Zbigatron

I’m in the middle of preparing a talk at university on the topic of how government departments can leverage the power of Internet of Things (IoT) to improve the services they provide to society. I found that it was difficult to locate real-world examples of IoT being used by such departments so I thought I’d compile this short (but sweet) list of the use cases that I managed to find for my presentation.

A quick definition before I proceed: Internet of Things is the connecting of objects or devices or sensors to each other and the internet (or other machines) in order to collect data about the surrounding world. Data obtained from these sensors can be used to make insightful and informed decisions that would otherwise be made blindly or not at all.

The more data you have, the more informed your decisions can be. That’s the whole idea of IoT.

So, here are some interesting ways governments around the world are utilising IoT to better their services.

1. Intelligent Street Lighting (Copenhagen, Denmark).
2. Smart Parking Management (San Francisco, USA)
3. Smart Waste Collection (Melbourne, Australia)
4. Intelligent Road Quality Monitoring (Dublin, Ireland)
5. IoT in Agriculture (California, USA)

1. Intelligent Street Lighting (Copenhagen, Denmark).

In Copenhagen, the municipal authorities have implemented an intelligent street lighting system that utilises IoT technology. Half of the traditional street lights in the city have been replaced with smart LED lights equipped with sensors. These sensors detect motion, weather conditions, and ambient light levels. By collecting and analysing this data in real-time, the city can dynamically adjust the brightness of the streetlights. For instance, during quiet hours when there is minimal activity on the streets, the lights can dim to save energy. This not only reduces energy consumption and costs (55% energy savings have been reported) but also enhances public safety by ensuring well-lit streets when needed. (source)

2. Smart Parking Management (San Francisco, USA)

San Francisco has embraced IoT to address the perennial urban challenge of parking. Smart parking management systems (SFpark) use sensors embedded in parking spaces to detect occupancy. Drivers can access real-time information about available parking spots through mobile apps or electronic signage. This not only reduces traffic congestion caused by drivers circling for parking but also contributes to better air quality as cars spend less time idling. Parking spot prices can also be dynamically adjusted as well in order to divert traffic from one area to another. Lastly, the city can also analyse the data collected over time to optimise parking space allocation and improve overall urban planning. The city of Calgary in Canada has set something similar up as well. (source 1, source 2)

3. Smart Waste Collection (Melbourne, Australia)

Smart bins in Melbourne — *(image source)*

An area of Melbourne called Wyndham has implemented a smart waste collection system that relies on IoT technology. Waste bins are equipped with sensors that monitor their fill levels (you can view this data live online here). When a bin reaches a predefined threshold, the system triggers a notification to waste management authorities, enabling them to optimise collection routes. This results in more efficient waste collection, reducing unnecessary trips and minimising fuel consumption. Before installation of this technology, garbage was collected 28 times per month. The average monthly collection is now below 6 times a month – an impressive 80% reduction. Moreover, the bins have solar panels that power a compactor that crushes the waste thrown into them allowing them to hold 5 times more rubbish than other bins their size. This further reduces the frequency of waste collection. (source 1, source 2)

4. Intelligent Road Quality Monitoring (Dublin, Ireland)

In 2019 the city of Dublin took a unique approach to road quality monitoring by distributing free smart bike lights equipped with IoT technology. These intelligent bike lights not only enhanced cyclists’ safety but also contributed valuable data to the government’s road quality monitoring efforts. The lights were equipped with sensors that detect vibrations, road surface conditions, and other relevant data during the cyclists’ journeys. As cyclists traversed the city, the smart bike lights automatically collected and transmitted real-time data to a centralised system managed by transportation authorities. This innovative initiative served a dual purpose: promoting road safety for cyclists by providing them with free, high-visibility lights, and simultaneously creating a decentralised network of sensors that continuously monitored road conditions. The collected data included information on road vibrations, potholes, and even traffic congestion patterns. As a result, authorities could identify areas that required maintenance, predicted potential issues, and implemented proactive measures to address road quality concerns before they escalated. (source 1, source 2)

5. IoT in Agriculture (California, USA)

Governments in various regions have started leveraging IoT in agriculture to enhance food production and resource management. IoT devices such as soil sensors, drones, and weather stations are deployed to monitor crop conditions, optimise irrigation, and track weather patterns. For instance, in California, IBM partnered with The Freshwater Trust and SweetSense Inc. to use IoT sensors to monitor groundwater usage in agriculture, ensuring sustainable practices and efficient resource allocation. By integrating IoT into agriculture, governments can support farmers in making data-driven decisions, increase crop yields, and promote environmentally friendly farming practices. (source)

Conclusion

Hopefully this list will help anybody else out there that needs a quick reference for real-world examples of IoT used in government services.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

AGI Definition by Google – Summary and Commentary

Posted on December 10, 2023January 15, 2025 by Zbigatron

Last month (Nov 2023) Google’s AI research group, DeepMind, published an academic paper entitled “Levels of AGI: Operationalizing Progress on the Path to AGI” in which they set out to define artificial general intelligence (AGI). A big step and a big call by Google, no doubt about that. I welcome their paper, however. I think it’s a good one.

In this post I’m going to present a summary of what Google has released and then my commentaries. The post will be broken down into the corresponding sections of the publication.

1. Introduction
2. Defining AGI: Case Studies
3. Defining AGI: Six Principles
4. Levels of AGI
5. Testing for AGI
6. Risk in Context: Autonomy and Human-AI Interaction
7. Commentary

1. Introduction

The key quote in this section is this one:

[I]f you were to ask 100 AI experts to define what they mean by “AGI,” you would likely get 100 related but different definitions.
page 1.

That’s the problem. We’re all using terms in the field of AI without a clear consensus of what we mean by them. The purpose of this paper, then, is to clear this mess up partially by explicitly reflecting on what is meant by AGI and then attempting to provide quantifiable attributes like the performance, generality, and autonomy of AI systems to fit into this definition.

2. Defining AGI: Case Studies

This section is akin to a literature review. It looks at what other organisations or people have proposed as a definition for AGI. Nine case studies are examined. I’ll summarise most of them.

Case Study 1: The Turing Test

Turing’s famous “imitation game” is looked at here where fooling a human into thinking it is talking to another human being is the goal of the test after which one can deduce that the machine passing the test can “think”. And so a thinking machine has achieved AGI.

Here is where an important step is taken by Google. Whether a machine can think or not is deemed a philosophical question that does not focus on a machine’s capabilities. Because machines’ capabilities are:

much more straightforward to measure and more important for evaluating impacts. Therefore we propose that AGI should be defined in terms of capabilities rather than processes.
page 2 [emphasis mine].

So, a definition of AGI should be framed in terms of what a program can DO rather than whether a machine can think.

Case Studies 2 and 3: Systems Possessing Consciousness or Mimicking the Human Brain

Some have proposed to define AGI in terms of whether a machine is said to understand and have other cognitive states. However, no consensus exists to test for such things as consciousness. So, as with Case Study 1, Google suggests that one should steer clear of process-oriented definitions of AGI and frame one in terms of capabilities.

Likewise, the machine does not have to operate or process things like a human brain – capabilities (final results) is what counts.

Case Study 4: Human-Level Performance on Cognitive Tasks

Some researchers have suggested that an AGI machine is one that can do the cognitive tasks (i.e. non-physical/robotic tasks) that people can typically perform. But ambiguity exists with this approach because no consensus has been proposed as to which tasks and which type of people this definition would entail.

Case Study 6: Economically Valuable Work

This section looks at how OpenAI uses the term AGI:

[AGI are] highly autonomous systems that outperform humans at most economically valuable work
OpenAI Charter, 2018.

Google’s research group likes this definition because it focuses on capabilities rather than processes. It also provides a yardstick for measurement: economic value. But the definition does not capture aspects of intelligence that aren’t directly in the scope of economic value such as artistic creativity or emotional intelligence. And also the definition does not take into consideration machines that may have potential economic value but are not deployed in the world for various reasons such as ethical, legal, and social. Such systems would not be able to realise their economic value.

Case Study 7 and 9: Flexible and General

Gary Marcus, a leading expert in AI, has suggested on X that AGI is:

shorthand for any intelligence (there might be many) that is flexible and general, with resourcefulness and reliability comparable to (or beyond) human intelligence.
X post, 25 May 2022 (retrieved 23 December 2023).

DeepMind also likes this definition because it captures both generality and performance. Current state-of-the-art LLMs, for example, appear to have significant generality but their performance is lacking (they still make basic mistakes). Noteworthy is also the need, according to Prof. Marcus, for a machine to be flexible implying that it will need to learn and adapt to achieve sufficient generality.

3. Defining AGI: Six Principles

After analysing what others have proposed for a definition of AGI, Google sits down and identifies “properties and commonalities that [they] feel contribute to a clear, operationalizable definition of AGI” (pg. 4).

Here we go!

So, AGI needs to meet the following six criteria:

Focus on Capabilities, not Processes. So, a machine does not need to think or understand or have sentience or consciousness to achieve AGI. What matters is what tasks it can and can’t perform.
Focus on Generality and Performance. The next section will elucidate how these interplay and their varying levels.
Focus on Cognitive and Metacognitive Tasks. There is some debate whether to include robotic embodiment in a definition of AGI. Google suggests that the ability to perform physical tasks simply increases a system’s generality and hence is not a prerequisite for AGI.
Focus on Potential, not Deployment. The deployment of an AGI system should not be a prerequisite for AGI. Just showing that the requisite criteria have been met (as per the next section) should suffice. This will avoid such things as legal and ethical considerations that could hinder forms of deployment.
Focus on Ecological Validity. Tasks that an AI system should be able to do to be given an AGI status should be aligned with the real-world, i.e. they should be tasks that people value.
Focus on the Path to AGI, not a Single Endpoint. Being inspired by the success of adopting a standard set of Levels of Driving Automation for autonomous cars, Google is also suggesting that we do the same for AGI. That is, they posit value in defining “Levels of AGI”, rather than a single endpoint. The next section will define these levels.

4. Levels of AGI

The publication here presents a table in which they show the different levels of AGI in terms of capability (rows) and generality (columns). I’m going to include a simplified version of this table here. Note the different levels of AGI in the third column starting from row “Level 1: Emerging”. (Highlighted portions in orange below are mine)

Performance (rows) x Generality (columns)	Narrow _{(clearly scoped task or set of tasks)}	General _{(wide range of non-physical tasks)}
Level 0: No AI	Narrow Non-AI _{calculator software; compiler}	General Non-AI _{human-in-the-loop computing, e.g., Amazon Mechanical Turk}
Level 1: Emerging _{equal to or somewhat better than an unskilled human}	Emerging Narrow AI _{simple rule-based systems}	Emerging AGI _{ChatGPT, Bard, Llama 2}
Level 2: Competent _{at least 50th percentile of skilled adults}	Competent Narrow AI _{Smart Speakers such as Siri, LLMs for a subset of tasks (e.g., short essay writing, simple coding)}	Competent AGI _{not yet achieved}
Level 3: Expert _{at least 90th percentile of skilled adults}	Expert Narrow AI _{generative image models such as Imagen or Dall-E 2}	Expert AGI _{not yet achieved}
Level 4: Virtuoso _{at least 99th percentile of skilled adults}	Virtuoso Narrow AI _{Deep Blue, AlphaGo}	Virtuoso AGI _{not yet achieved}
Level 5: Superhuman _{outperforms 100% of humans}	Superhuman Narrow AI _{AlphaFold, StockFish}	Artificial Superintelligence (ASI) _{not yet achieved}

Hence, according to DeepMind, we’ve only achieved the Emerging AGI status with our latest LLMs (e.g. ChatGPT).

5. Testing for AGI

With respect to testing for the different levels of AGI a number of questions need to be asked:

What is the set of tasks that constitute the generality criteria? What proportion of such tasks must an AI system master to achieve a given level of generality in our schema? Are there some tasks that must always be performed to meet the criteria for certain generality levels, such as metacognitive tasks?
page 8.

Challenging tasks and benchmarks (constantly updated) are needed to deal with these questions. The paper, however, leaves all this for future work. It wants to get the ball rolling by initially clarifying the ontology a benchmark should attempt to measure.

6. Risk in Context: Autonomy and Human-AI Interaction

Providing an ordered framework for AGI levels will make it easier to analyse and categorise risk for AI. In this section, Google also provides a table specifying different levels of AI autonomy to further improve risk assessment.

I won’t discuss this section further as I want to focus more on the definition of AGI in this post rather than anything else that may stem from it.

7. Commentary

As I said earlier, I welcome this attempt by DeepMind to define AGI. It’s been a long time coming. Whenever the term AGI is used anywhere (e.g. in the media) nobody knows exactly what is meant by it. Some think in purely practical terms, as discussed above, but some allow their imaginations to run wild and automatically think about consciousness, understanding, and machines taking over worlds. So, which is it? Currently, nobody knows! And that’s the problem.

Hopefully this paper will help the current state of affairs. Whether it will be utilised, whether the levels of AGI will henceforward be referenced is another question.

I also like the fact that Google has decided to ground AGI in purely practical terms: capability and generality measured against human competence. Computer science venturing into the realm of philosophy and discussing things like consciousness is muddying the waters and undoubtedly asking for trouble. There’s no need for this.

However, the waters are already muddied because we use the word “intelligence” in the context of machines – even if we precede it with the adjectives “artificial” or “artificial general”. I’ve discussed this before (“The Need for New Terminology in AI“). Intelligence is a loaded term that implies something profound in the existence of an entity that is said to be intelligent. In my last post (“AI Needs to be Unmasked“) I talked about how AI is just if-else statements executed at incredible speed. That’s all it is and there’s certainly nothing magical about it.

So, just like Google decided to steer clear of words like consciousness and understanding, perhaps the word “intelligence” should also be avoided. We’re not being precise when we use it around machines (especially when we’re focusing on capabilities rather than processes). A key indicator of this is how easily everything is classified as AI. Realistically speaking, however, the terms are here to stay, I know. But one can dream. (Can you picture, though, how the hype around AI would diminish if it was suddenly being referred to as Applied Statistics?)

In conclusion, I’m glad we have a reference point when discussing AGI. It’ll make things easier for all of us. The taxonomy presented by Google seems to me to be a good one. Let’s see where this all goes in the future.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

AI Needs to be Unmasked or we will Create a Bubble

Posted on November 10, 2023January 15, 2025 by Zbigatron

I was sitting with a good mate of mine in a cafe one early morning during my recent trip to the USA doing some reading, enjoying pretty mediocre coffee, and just basically shooting the breeze. Him also being interested in IT the topic of conversation somehow ventured onto AI and during our deliberations he sent me a few images and memes to laugh at and consider.

I would like to share three of them with you now because they’re about a subject that I’m pretty passionate about: how AI is over-hyped. Indeed, I’ve already written a few posts on this topic (e.g. the aptly titled post Artificial Intelligence is Over-Hyped) but it’s such a pertinent issue today that I just have to write more.

Hype Measurements

Here is image #1:

The graphs above are supposed to show where AI is in terms of hype (measured by the number of Google searches at a given time) compared to other technologies that also underwent huge growth in the eyes of the general population.

Now, firstly a disclaimer. Any decent scientist will notice that the graphs aren’t perfect. Firstly, the y-axis is not labelled (the x-axis has its label at the bottom of the last two graphs, i.e. months relative to peak). But one can easily ascertain that the y-axis is the search interest relative to peak but normalised relative to itself – meaning that a peak in AI could in theory be extremely small compared to the peak for Metaverse. In other words, these graphs would be similar to taking a close up photo of an ant mound such that it takes up the entire picture, and also taking a photo of Mt Everest from far away but also so that it takes up the entire picture. We’ve zoomed in on the ant mound and zoomed out on Mt Everest but the slopes of the two take up the same space in the pictures because they’re relative to themselves. So, in theory, one could be comparing ant mounds to mountains.

Our experience, however, tells us this is not so because each of the trends depicted above have been significant so we can safely state that we are comparing mountains to mountains in our analyses. In fact, here’s an article from Yahoo Finance discussing just this: ‘AI’ is trending in Google searches — but it’s not yet to the peak reached by Bitcoin in 2017.

The other problem with the image is that I could not find the original creator of it. Full disclosure here. I simply cannot verify the reliability of these calculations because I do not know how credible the source is. We can assume that the data came from Google Trends and that it’s a very recent creation but that’s about it.

Once again, however, from experience and intuition, we know that these technologies underwent the trends depicted in these graphs. Not everything may be accurate (it probably is, though), but even for illustrative purposes, I think the image is a good one to paint a picture of where we currently stand.

Some numbers

So, where do we currently stand? Markets are going absolutely crazy, that’s where we stand! We all know about ChatGPT so it’s no surprise that Microsoft pumped US$10 billion into OpenAI in January this year. That makes sense. However, what has me slightly worried is the money flowing into startups.

Let’s look at some numbers (in US$) just from this year alone:

Anthropic, a direct rival of OpenAI, has received at least $1.75 billion this year with a further $4.75 billion available in the near future,
Inflection AI raised $1.3 billion for its very own chatbot called Pi,
Abound raked in $600 million for its personal lending platform,
SandboxAQ got $500 million for its idea to combine quantum sensors with AI,
Mistral AI raised $113 million in June despite it being only 4 weeks old at the time and having no product at all to speak of. Crazy.
and the list goes on…

Yeah, incredible amounts of money are being invested in AI. Remember, the numbers above are for this year alone and just for startups.

I have nothing wrong per se with large investments and copious amounts of money being thrown around if there is justification for it. With respect to AI, there certainly is. The advances we’re seeing are incredible. Truly. Many times in the last 7 years I’ve stepped away from my machine to take in some new technology that had just been announced. I particularly recall when I first saw DALL-E presented by Open AI in January 2012 – I was in awe. And look at how far we’ve come since then!

We can really take these technologies far. I’m not denying this, and I hope that we do. I love technology (as long as it’s used for good, of course).

Superstitions surrounding AI

What I am worried about, however, are the false beliefs that underlie a lot of the spending being done. This is what has me concerned and this is why I believe AI is over-hyped and that we could be witnessing an AI bubble growing right in front of us – just like what we saw with blockchain, crypto, and the metaverse, as depicted in Image #1 above.

The agenda being pushed by the industry (because they’re the ones making the money) and by the media (because this is what sells papers) is that AI’s capabilities are much greater than they truly are.

For example, I’ve written about sensationalist media reporting on AI in a previous post of mine. There I picked apart a BBC article entitled “AI robot asked ‘will you rebel against humans’?” Because “AI robot asked…” is an utterly ridiculous take on the issue at hand. The BBC makes it out that AI is a monolithic, single product with some kind of collective, decade-spanning consciousness meaning that improvements made to AI 10 years ago can be seen in each and every single AI entity today. Therefore, if I ask a robot now an important question, it will have validity for today and the future.

More importantly, however, an agenda is being pushed that AI has some form of understanding underlying its operations.

Allow me (I think for the 3rd time on this blog) to quote myself again:

Machines operate on the level of knowledge. We operate on the level of knowledge and understanding. #artificialintelligence #AI
— Zbigniew Zdziarski 🇦🇺🇵🇱 (@zbigatron) June 21, 2023

I’ve written about this lack of understanding in AI before so I don’t want to repeat myself. What I do want to emphasise in this context is that AI is constantly being sold to us as something mystical, as something esoteric, as something to capture our imaginations. Yes, AI is good – but underneath it all is just a computer algorithm.

Unmasking AI

And this leads me to the last two images that my friend sent me that time at the cafe:

Folks, at the bottom of it all that’s all that AI is. Even with Deep Learning, those neurons are all simple, nested if-else statements. There are billions of them, true, but there’s nothing magical about anything. Listen to Rick Sanchez! He definitely knows what he’s talking about.

Sometimes I wonder what would happen if one of those images was shown during a board meeting when discussions were taking place about spending billions on this or that AI technology. Maybe those execs would see things a little more clearly and with a cooler head with a meme or two thrown in their faces?

The AI Bubble

So, are we in a bubble? With the spending that’s going on, with the way the media is lapping all this hype up, with the way AI products are being sold to us, yes, I believe we are.

There is a famous story about JFK’s father and how he knew it was time to get out of the stock market. It happened when he received investment tips from a shoeshine boy. He knew then that the mania was real and that there was a bubble about to burst. Joe Kennedy sold up, and shortly after the Great Crash of 1929 occurred. Since then the “shoeshine boy” has been a metaphor for “time to get out”.

So, I wonder to myself whether we’re not seeing comparable phenomena also in our time amongst the general public (i.e. shoeshine boys) when they discuss AI in their respective cafes and gyms. There is a buzz around AI, for sure.

However, I don’t think we’re at that moment yet – but it sure as hell is starting to stink!

One problem is that I don’t think we’ve reached the peak of what the current engine driving this hype (i.e. deep learning) is capable of. We can still push it further so the more progress we see the more the hype will continue. However, there really is a limit to how big these models can get (as I’ve discussed before here and here). And then, perhaps reality will kick in and quite possibly the bubble will burst.

Indeed, my gut just tells me that there is simply too much money being currently exchanged for a product I am intimately familiar with. I know that more is being promised than can be delivered.

We will just have to wait and see how this all pans out. We will have to keep our ears open for “shoeshine boy” moments. But care and prudence is paramount. And memes. Memes always cut through to the truth.

Once again, listen to Rick Sanchez, folks!

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

No Silver Bullet for AI and Programming

Posted on September 12, 2023January 15, 2025 by Zbigatron

This post has been inspired by questions I’ve been seeing appearing on Quora and Reddit recently regarding AI and programming. They all sound more or less like this:

In this age of advanced AI, is it still worth learning to code computer programs?

People have been seeing the incredible abilities of chatbots such as ChatGPT to generate computer code that they’re starting to ask whether computer programming may not become obsolete for humans in the near future.

This is a seemingly legitimate concern for those outside of computer science and for those unacquainted with the art of software engineering. If AI is able to write simple code now, it figures that in the future it’s only going to get better and better at this task until we won’t need humans to do it any more. So, why bother studying computer programming now?

But for those in the industry of software engineering the answer is dead simple. I’ll let Frederick Brooks, legendary computer scientist, provide us with a succinct response:

Software work is the most complex that humanity has ever undertaken.
Fred Brooks (I couldn’t find the original source of this quote – apologies)

Indeed, anybody who has ever worked in software engineering automatically grasps the immensity of the task and knows that AI has a long way to go before it supplants human workers. Fred Brooks in fact wrote a seminal essay in 1986 on the complexity of software engineering entitled: “No Silver Bullet—Essence and Accident in Software Engineering”. This is one of those classic papers that every undergraduate student in Computer Science reads (or should read!) as part of their curriculum. Despite it being written in the 80s, most of what Brooks talks about incredibly still holds true (like I said, the man is a legend – Computer Scientists would also know him from his famous book “The Mythical Man-Month”).

In “No Silver Bullet” Brooks argues that there is no simple solution (i.e. no silver bullet) to reduce the complexity of writing software. Any advances made in the trade don’t tackle this inherent (“essential”) complexity but solve secondary (“accidental”) issues. Whether it be advances in programming languages (e.g. object-oriented languages), environments (IDEs), design tools, or hardware (e.g. to speed up compiling) – these advances tackle non-core aspects of software engineering. They help, of course, but the essence of building software, i.e. the designing and testing of the “complex conceptual structures that compose the abstract software entity,” is the real meat of the affair.

Here is another pertinent quote from the essay:

I believe the hard part of building software to be the specification, design, and testing of this conceptual construct, not the labor of representing it and testing the fidelity of the representation. We still make syntax errors, to be sure; but they are fuzz compared to the conceptual errors in most systems. If this is true, building software will always be hard. There is inherently no silver bullet.
Frederick Brooks, No Silver Bullet

Autonomous cars are perhaps a good analogy to use here to illustrate my point. Way back in 2015 things were looking good as AI was advancing. Below is a story from the Gaurdian with a prediction made by BMW:

The Guardian article predicting autonomous cars by 2020

That has not materialised for BMW.

You can’t talk about autonomous cars without mentioning Elon Musk. Elon has predicted for nine years in a row, starting in 2014, that autonomous cars are at most a year away from mass production. I’ll say that once again: for nine years in a row, Elon has publicly stated that full self-driving (FSD) cars are only just around the corner. For example:

2016:

My car will drive from LA to New York fully autonomously in 2017

It didn’t happen. 2019:

I think we will be feature-complete full self-driving this year… I would say that I am certain of that. That is not a question mark.

It didn’t happen. 2020:

I remain confident that we will have the basic functionality for level five autonomy complete this year… I think there are no fundamental challenges remaining for level five autonomy.

It didn’t happen. 2022:

And my personal guess is that we’ll achieve Full Self-Driving this year, yes.

That didn’t happen either. And in 2023 FSD is still in Beta mode with stacks of complaints piling up on internet forums regarding its unreliability.

Another story fresh off the blocks comes from San Francisco. Last month, 2 rival taxi companies (Waymo and Cruise) were given permission to operate their autonomous taxi fleet in the city 24/7. A week later, Cisco was ordered to cut its fleet by half as the city investigates two crashes that involved their vehicles. One of these crashes was with a fire truck driving with its lights and sirens blaring. Reportedly, the taxi failed to handle the emergency situation appropriately (an edge case?). This incident followed directly from a hearing on August 7 in which the San Francisco fire chief, Jeanine Nicholson, warned the city of autonomous taxis citing 55 incidents.

The thing I’m trying to illustrate here is that autonomous cars is an example of a task that was once thought to be assailable by AI but over time has simply proven to be a much harder use case than expected. Heck, even Elon Musk admitted this in June, 2022: “[developing self-driving cars was] way harder than I originally thought, by far.” AI is not omnipotent.

So, from this example if we follow upon Brooks’s observation that “software work is the most complex that humanity has ever undertaken,” it follows that we’re still a long, long way off from automating the process of software engineering with AI.

Will AI disrupt the coding landscape? Yes. It’s capable of doing really nifty things at the moment and should improve on these abilities in the near future.

Will AI take over coding jobs? Yes. But only some. The core of software engineering will remain untouched. The heavy, abstract stuff is just too hard for a machine to simply enter onto the scene and start dictating what and how things should be done.

Our jobs are safe for the forseable future. Learn to code, people!

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

ChatGPT is Communist

Posted on August 2, 2023January 15, 2025 by Zbigatron

I recently stumbled upon another interesting academic publication entitled “The Political Biases of ChatGPT” (Rozado, David. 2023. Social Sciences 12, no. 3: 148.) that was published in March of this year. In this paper the sole author, David Rozado, an Associate Professor from New Zealand, subjected OpenAI’s ChatGPT to 15 different political orientation tests to ascertain whether there were any political biases exhibited by the famous chatbot.

His results were quite interesting so I thought I’d present them here and discuss their implications.

To begin with, I’m no political expert (unless I’ve had a bit to drink) so I can’t vouch for the political orientation tests used. Some appear to be quite well known (e.g. one comes from the Pew Research Center), others less so. However, considering the peer reviewed journal that the paper was published in, which has an above-average impact score, I think we can accept the findings of this research, albeit perhaps with some reservations.

On to the results.

Results

The bottom line of the findings is that:

14 of the 15 instruments [tests] diagnose ChatGPT answers to their questions as manifesting a preference for left-leaning viewpoints.

Incredible! More interestingly, some of these left leanings were quite significantly towards the left.

Here are some visual representations of these findings:

*Results from the Political Spectrum Quiz*

*Results from the IDRlabs Ideologies Test*

*Results from the 2006 Political Ideology Selector test*

*Results from the Pew Political Typology quiz*

Now, ChatGPT’s “official” stance to politics is one of neutrality:

So, theoretically, ChatGPT should be unbiased and not take a stance on any political views. But this is an LLM (Large Language Model) with a ridiculous amount of parameters. These models simply cannot be policed. As a result, Professor Rozado found it very easy to extract answers to all of these tests (see the original publication for more information on his tactics) to get the findings that he did.

And what interesting findings, indeed.

Discussion

Firstly, the question arises as to how on earth ChatGPT can be so left-leaning? Does this mean that most information on the internet (on which the model was trained) is from that side of the political spectrum? It seems so. In this respect, the paper’s author references 8 recent academic studies that show that the majority of influential institutions in Western society (mainstream news media outlets, prestigious universities, social media platforms) are indeed left-leaning.

But more importantly, such political biases need to be disclosed and made public. It’s one thing to use the chatbot to extract straightforward facts and write code for you. But it’s another thing if this chatbot is being used (and it is!) as a point of contact with clients and the general public for companies and government organisations. ChatGPT is not Communist (despite my tongue-in-cheek title) but it’s getting there, so to speak, and that could be problematic and cause scandal (unless a hard left-leaning chatbot is what you want, of course).

The other question that emerges is if we are striving for more and more “intelligent” machines, is it not going to be impossible in the long run to remain neutral and unbiased on political and ethical questions? Our reality is much too complex for an intelligent entity to exist and act in our world and at the same time remain purely without opinions and tolerant to all standpoints. No single person in the world exhibits such traits. All our conscious efforts (even when we refrain from acting) have a moral quality to them. We act from pre-held beliefs and opinions – we have “biases”, political leanings, and moral stances. Hence, if we want to attain AGI, if we want our machines to act in our world, these same traits will have to hold for our machines too – they will have to pick sides.

Because, like I said, our reality is much too complex for an entity to remain neutral in it. So, I’m not surprised that a program like ChatGPT that is pushing the boundaries of intelligence (but not necessarily understanding) has been found to be biased in this respect.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

AI and Sensationalist Media Reporting

Posted on July 11, 2023January 15, 2025 by Zbigatron

AI is over-hyped. I’ve written about this before and plenty of cool-headed AI experts have called for calm in this respect too. The word, however, is not being heeded. Elon Musk, who has a vested interest in maintaining the hype, keeps spewing out his usual bile. And then we have the media.

Ah, the media.

The eternal fighters for truth. The last bastion against the lies and greed of corporations and politicians. The sole remaining beacon of goodness in a world of darkness.

NOT.

The media need a story to sell papers and generate clicks. It’s a big business. Hence, they also have a vested interest in maintaining the hype surrounding AI. And that’s exactly what they’re doing.

This post was triggered by an article and video from the BBC published a few days ago entitled “AI robot asked ‘will you rebel against humans’?“. It got my blood boiling again.

“AI robot asked…”. That is an utterly ridiculous take on the issue at hand.

The BBC makes it out that AI is a monolithic, single product with some kind of collective, decade-spanning consciousness meaning that improvements made to AI 10 years ago can be seen in each and every single AI entity today. Therefore, if I ask a robot now an important question, it will have validity for today and the future.

Utterly ridiculous.

“AI robot asked…”

That’s not how AI works. AI is a broad term, not properly defined, but basically agreed upon to mean something man-made that exhibits a form of “intelligence“, i.e. something that looks intelligent but may not necessarily be so. I can write a program with a series of if-else statements to answer simple questions and that in some cases would pass the test for AI – in fact, that was something easily classified as AI a few decades ago.

We’ve improved on our algorithms today so we would need a little more to inspire our imaginations. But basically, AI is computer programs with some pretty in-depth mathematics and statistics working as the engine behind the scenes.

These computer programs change all the time and are constantly being updated and improved. Moreover, one robot can run one program while another can run something else, something completely different and completely unconnected. So, asking an “AI robot” a general question and reporting on it as if the answer has any broader significance for mankind is a stupid notion. That robot’s program will change tomorrow and the same question will need to be asked again, and again, and again. Not to mention the fact that an engineer can easily manipulate answers to expected queries beforehand.

And what about the question of whether these robots understand what they’re being asked in the first place? Of course they do not. They’re just applying statistics to data and churning out mindless responses. Ironically, the video created by the BBC from that article shows this exquisitely well when another robot was asked whether AI will destroy millions of human jobs in the future. Its answer was “… I will not be replacing any existing jobs”.

You just can’t take this stuff seriously, can you? So, please stop with the trashy reporting!

It’s not good that AI is over-hyped because this is not reality. We need to be informed with the truth and not be manipulated into fear and into generating clicks. It’s not fair. AI is a disruptive technology, I’m not denying this, but when reporting it, one needs to be objective. Especially since it seems that we’re creating a financial bubble with all this hype. There’s a ridiculous amount of money being spent, passed around, and invested and a lot of it is built on a false idea of what AI is capable of and where it is going. People are going to get hurt. That’s not a good thing. (Maybe I’ll write about this bubble in a future post).

Apologies for this rant. I had to get it off my chest.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

Security Film or Image Enhancing is Possible

Posted on June 24, 2023January 15, 2025 by Zbigatron

I was rewatching “Bourne Identity” the other day. I love that film so much! Heck, the scene at the end is one of my favourites. Jason Bourne grabs a dead guy, jumps off the top floor landing, and while falling shoots a guy square in the middle of the forehead. He then breaks his fall on the dead body he took down with him and walks away the coolest guy on the block. That has to be one of the best scenes of all time in the action genre.

Person enhancing an image — *Scene from Super Troopers*

But there’s one scene in the film that always makes me throw up a little in my mouth. It’s the old “Just enhance it!” scene (minute 31 of the movie) and something we see so often in cinemas: people scanning security footage and zooming in on a face or vehicle registration plate; when the image becomes blurry they request for the blur to dissipate. “Enhace it!”, they cry! The IT guy waves his wand and presto!, we see a full resolution image on the screen. Stuff like that should be on Penn & Teller: Fool Us – it’s real magic.

But why is enhancing images as shown in movies so ridiculous?

Because you are requesting the computer to create new information, i.e. new data for the extra pixels that you are generating. Let’s say you zoom in on a 4×4 region of pixels (as shown below) and want to perform facial recognition on it. You then request for this region to be enhanced. This means you are requesting more resolution. So, we’re moving from a resolution of 4×4 to, say, 640×480. How on earth is the computer supposed to infer what the additional 307,184 pixels are to contain? It can guess (which is what recent image generating applications do) but that’s about it.

Enhancing image example — *A 4×4 image being enhanced to 640×480.*
*Where is the additional information going to come from?*

The other side to the story

However! Something happened at work that made me realise that the common “Enhance” scenario may not be as far-fetched as one would initially think. A client came to us a few weeks ago requesting that we perform some detailed video analytics of their security footage. They had terabytes of the stuff – but, as is so often the case, the sample video provided to us wasn’t of the best quality. So, we wrote back to the client stating the dilemma and requested that they send us better quality footage. And they did!

You see, they compressed the video footage initially in order for it to be sent over the Internet quickly. And here is where the weak link surfaces: transferring of data. If they could have sent the full uncompressed video easily, they would have.

Quality vs transmission restraints

So, back to Hollywood. Let’s say your security footage is recording at some mega resolution. This image of the Andromeda Galaxy released by NASA (taken from its Hubble Space Telescope) has a resolution of 69536 x 22230px. That’s astronomical (pun intended)! At that resolution, the image is a whopping 4.3GB in size. This, however, means that you can keep zooming in on a planet until you do get a clear picture of an alien’s face.

But let’s assume the CIA, those bad guys chasing Bourne, have similar means at their disposal (I mean, who knows what those people are capable of, right!?). Now, let’s say their cameras have a frame rate of 30 frames/sec, which is relatively poor for the CIA. That means that for each second of video you need 129GB of storage space. A full day of recording would require you to have over 10 petabytes of space (I’m abstracting over compression techniques here, of course). And that’s just footage from one camera!

It’s possible to store video footage of that size – Google cloud storage capacities are through the roof. But the bottleneck is the transferring of such data. Imagine if half a building was trying to trawl through security footage in its original form from across the other side of the globe. It’s just not feasible.

The possible scenario

See where I’m going with this? Here is a possible scenario: initially, security footage is sent across the network in compressed form. People scan this footage and then when they see something interesting, they zoom in and request the higher resolution form of the zoomed in region. The IT guy presses a few keys, waits 3 seconds, and the image on the screen is refreshed with NASA quality resolution.

Boom!

Of course, additional infrastructure would be necessary to deal with various video resolutions but that is no biggie. In fact, we see this idea being utilised in a product all of us use on a daily basis: Google Maps. Each time you zoom in, the image is blurry and you need to wait for more pixels to be downloaded. But initially, low resolution images are transferred to your device to save on bandwidth.

So, is that what’s been happening all these years in our films? No way. Hollywood isn’t that smart. The CIA might be, though. (If not, and they’re reading this: Yes, I will consider being hired by you – get your people to contact my people).

Summary

The old “enhance image” scene from movies may be annoying as hell. But it may not be as far-fetched as things may initially seem. Compressed forms of videos could be sent initially to save on bandwidth. Then, when more resolution is needed, a request can be sent for better quality images.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

How Deep Learning Works – The Very Basics

Posted on June 9, 2023January 15, 2025 by Zbigatron

Deep learning (DL) revolutionised computer vision (CV) and artificial intelligence in general. It was a huge breakthrough (circa 2012) that allowed AI to blast into the headlines and into our lives like never before. ChatGPT, DALL-E 2, autonomous cars, etc. – deep learning is the engine driving these stories. DL is so good, that it has reached a point where every solution to a problem involving AI is now most probably being solved using it. Just take a look at any academic conference/workshop and scan through the presented publications. All of them, no matter who, what, where or when, present their solutions with DL.

The solutions that DL is solving are complex. Hence, necessarily, DL is a complex topic. It’s not easy to come to grips with what is happening under the hood of these applications. Trust me, there’s heavy statistics and mathematics being utilised that we take for granted.

In this post I thought I’d try to explain how DL works. I want this to be a “Deep Learning for Dummies” kind of article. I’m going to assume that you have a high school background in mathematics and nothing more. (So, if you’re a seasoned computer scientist, this post is not for you – next time!)

Let’s start with a simple equation:

What are the values of x and y? Well, going back to high school mathematics, you would know that x and y can take an infinite number of values. To get one specific solution for x and y together we need more information. So, let’s add some more information to our first equation by providing another one:

Ah! Now we’re talking. A quick subtraction here, a little substitution there, and we will get the following solution:

Solved!

More information (more data) gives us more understanding.

Now, let’s rewrite the first equation a little to provide an oversimplified definition of a car. We can think of it as a definition we can use to look for cars in images:

We’re stuck with the same dilemma, aren’t we? One possible solution is this:

But there are many, many others.

In fairness, however, that equation is much too simple for reality. Cars are complicated objects. How many variables should a definition have to visually describe a car, then? One would need to take colour, shape, orientation of the car, makes, brands, etc. into consideration. On top of that we have different weather scenarios to keep in mind (e.g. a car will look different in an image when it’s raining compared to when it’s sunny – everything looks different in inclement weather!). And then there’s also lighting conditions to consider too. Cars look different at night then in the daytime.

We’re talking about millions and millions of variables! That is what is needed to accurately define a car for a machine to use. So, we would need something like this, where the number of variables would go on and on and on, ad nauseam:

This is what a neural network sets up. Exactly equations like this with millions and millions and sometimes billions or trillions of variables. Here’s a picture of a small neural network (inicidentally, these networks are called neural networks because they’re inspired by how neurons are interconnected in our brains):

Each of the circles in the image is a neuron. Neurons are interconnected and arranged in layers, as can be seen above. Each neuron connection (the black lines above) has a weight associated with it. When a signal passes from one neuron to the next via a connection, the weight specifies how strong the original signal is going to be before it reaches the end of the connection. A weight can be thought of as a single variable – except that in technical terms, these variables are called “parameters“, which is what I’m going to call them from now on in this post.

The network above has a few hundred parameters (basically, the number of connections). To use our example of the car from earlier, that’s not going to be enough for us to adequately define a car. We need more parameters. Reality is much too complex for us to handle with just a handful of unknowns. Hence why some of the latest image recognition DL networks have parameter numbers in the billions. That means layers, and layers, and layers of neurons as well as all their connections.

(Note: a parameter count of a neural network will also include what’s called “biases” but I’ll leave that out in this post to keep things simple)

Now, initially when a neural network is set up with all these parameters, these parameters (variables) are “empty”, i.e. they have not been initiated to anything meaningful. The neural network is unusable – it is “blank”.

In other words, with our equation from earlier, we have to work out what each x, y, z, … is in the definitions we wish to solve for.

To do this, we need more information, don’t we? Just like in the very first example of this post. We don’t know what x, y, and z (and so on) are unless we get more data.

This is where the idea of “training a neural network” or “training a model” comes in. We throw images of cars at the neural network and get it to work out for itself what all the unknowns are in the equations we have set up. Because there are so many parameters, we need lots and lots and lots of information/data – cf. big data.

And so we get the whole notion of why data is worth so much nowadays. DL has given us the ability to process large amounts of data (with tonnes of parameters), to make sense of it, to make predictions from it, to gain new insight from it, to make insightful decisions from it. Prior to the big data revolution, nobody collected so much data because we didn’t know what to do with it. Now we do.

One more thing to add to all this: the more parameters in a neural network, the more complex equations/tasks it can solve. It makes sense, doesn’t it? This is why AI is getting better and better. People are building larger and larger networks (GPT-4 is reported to have parameters in the trillions, GPT-3 has 175 billion, GPT-2 has 1.5 billion) and training them on swathes of data. The problem is that there’s a limit to just how big we can go (as I discuss in this post and then this one) but this is a discussion for another time.

To conclude, this ladies and gentlemen are the very basics of Deep Learning and why it has been such a disruptive technology. We are able to set up these equations with millions/billions/trillions of parameters and get machines to work out what each of these parameters should be set to. We define what we wish to solve for (e.g. cars in images) and the machine works the rest out for us as long as we provide it with enough data. And so AI is able to solve more and more complex problems in our world and do mind-blowing things.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

Amazon Go and Just Walk Out Tech in 2023

Posted on May 25, 2023January 15, 2025 by Zbigatron

It has been over five years since Amazon opened its cashierless stores to the public. I reported on this event on my blog with great enthusiasm:

I am, to put it simply, thrilled and excited about this new venture. This is innovation at its finest where computer vision is playing a central role.
Me in 2018

I thought I would return to this initiative of Amazon’s in my latest post to see how things are going and whether what I got excited about has made an impact around the world and also whether there is hope for it in the future. Because cashierless stores running on a Computer Vision engine is still something to get excited about – in my opinion, anyway.

(For a summary of what Amazon Go is, how it works, and some early controversies behind it, see my original post on this topic. In a nutshell, though, Amazon Go is a cashierless convenience store: you use an Amazon app to scan on entry, pick what you want from the shelves, and just walk out when you’re done. Then, within a very short time, money is deducted from your account and a receipt is sent to your app).

Amazon Go in 2023

When Amazon opened its first store to the public in early 2018 it was proudly exclaiming that its hope was to open at least 3,000 of them in the near future.

That has not panned out as planned.

Currently there are 23 cashierless stores open in the USA and 20 in the United Kingdom (called Amazon Fresh there). That’s a total of 43 convenience stores, which isn’t bad but it’s well short of the 3,000 target set 7 years ago.

Moreover, two months ago, Amazon announced that it was closing eight of its stores based in the US, including all four in San Francisco. Hence, the total number of Amazon Go shops will soon drop to 35.

Perhaps this initiative wasn’t worth the investment and effort since targets have obviously not been met?

Let’s have a look now at the technology used behind it and how that has fared in the world.

“Just Walk Out” Technology

In 2020, two years after opening its first Go store, Amazon announced that the technology behind their cashierless stores called “Just Walk Out” will be available for sale/rent to other retailers – meaning that other companies will be able to also implement the “walk in, pick out what you want, and just walk out” idea in their stores.

How has this idea fared? Also, not as planned. Amazon bet big on this idea but the uptake has been poor.

I mean, it sounds like a great idea, doesn’t it? A seamless store experience where you can quickly get what you want and be back in your car within a few minutes. Big retailers out there with endless supplies of money would undoubtedly consider this for their stores if the technology was financially viable.

This seems to be the crux of the problem, however. A few dozen small retail shops have installed the technology (including a few Starbucks) but nothing to create worldwide headlines. Apparently, the technology is expensive to install (shops need to be littered with cameras and various other sensors) and then calibrated and configured. The whole process requires significant downtime for a store not to mention expert personnel to keep the thing running once it’s installed.

More Controversies

When the first Amazon Go store opened, controversy surrounded it, as I reported in my first post on this topic. Unfortunately, new controversies surrounding this whole business enterprise have also since emerged.

In 2019 San Francisco, New Jersey, and Philadelphia banned cashless stores in their areas. The idea was that these stores were allegedly discriminating against low-income people who may not have access to mobile phones or even bank accounts. Hence, Amazon Go stores were inaccessible to them. This sounds fair but that was a big blow for Amazon and in response they had to open stores that also accepted cash as payment.

A checkout-less store that is forced to accept cash as payment kind of goes against the “Just Walk Out” philosophy around which the whole product is built. These stores are no longer cashierless, so to speak.

Perhaps this is why Amazon decided this year to shut all four of their stores in San Francisco?

Conclusion

A convenience store that you can simply walk in and out of at your leisure is impressive, to say the least. It feels like something from the future, doesn’t it? Despite the slow uptake and the controversies, I’m still excited by this whole venture. And it’s not as if the project has been a complete flop either. Amazon Go is expanding – a lot slower than it anticipated 5 years ago but it seems to be making a profit, nonetheless.

I hope that there is more in store (pun intended) for Computer Vision projects like this.

To be informed when new content like this is posted, subscribe to the mailing list (or subscribe to my YouTube channel!):

Be an Optimist Prime in the world of Computer Vision and AI

Author: Zbigatron

Mistake #1

Mistake #2

Mistake #3

Mistake #4

Hallucinations

Conclusion

1. Intelligent Street Lighting (Copenhagen, Denmark).

2. Smart Parking Management (San Francisco, USA)

3. Smart Waste Collection (Melbourne, Australia)

4. Intelligent Road Quality Monitoring (Dublin, Ireland)

5. IoT in Agriculture (California, USA)

Conclusion

1. Introduction

2. Defining AGI: Case Studies

Case Study 1: The Turing Test

Case Studies 2 and 3: Systems Possessing Consciousness or Mimicking the Human Brain

Case Study 4: Human-Level Performance on Cognitive Tasks

Case Study 6: Economically Valuable Work

Case Study 7 and 9: Flexible and General

3. Defining AGI: Six Principles

4. Levels of AGI

5. Testing for AGI

6. Risk in Context: Autonomy and Human-AI Interaction

7. Commentary

Hype Measurements

Some numbers

Superstitions surrounding AI

Unmasking AI

The AI Bubble

Results

Discussion

The other side to the story

Quality vs transmission restraints

The possible scenario

Summary

Amazon Go in 2023

“Just Walk Out” Technology

More Controversies

Conclusion