Also Andisearch had it clear that you have to go with the car. A humble free AI from a two person startup with better results as AIs from big corps, as shown several times in the past.
But i don’t think we can conclude much from this test. Gemini and Andisearch could easily fail a similar test but others fail.
I think the important take away here is to remeber that these AIs are not cognitive and cannot reason.
The more this space evolves the more it is central that we remeber that there is a huge difference between simulating inteligence and actual intelligence. Tech companies are getting pretty good at simulating intelligence and they have an economic interest in fooling people into believing it is actual intelligence.
I just tried this with the following services; Grok, Perplexity, Le chat, Lumo (Proton), ChatGPT and Gemini
All of them told be the pros a cons of each and concluded that walking would be best.
Except Gemini. It told me that unless i was expecting to carry the car i should drive there. You win this round Google.
I saw someone else did the same thing and IIRC a Chinese one also passed the test.
Also Andisearch had it clear that you have to go with the car. A humble free AI from a two person startup with better results as AIs from big corps, as shown several times in the past.
Did not know that one.
But i don’t think we can conclude much from this test. Gemini and Andisearch could easily fail a similar test but others fail. I think the important take away here is to remeber that these AIs are not cognitive and cannot reason.
The more this space evolves the more it is central that we remeber that there is a huge difference between simulating inteligence and actual intelligence. Tech companies are getting pretty good at simulating intelligence and they have an economic interest in fooling people into believing it is actual intelligence.