This feels like a weird nostalgia pull for a PC gaming site, but think back to the first time you really struggled in a Pokémon game—I bet the bleating of your caught companion's dwindling health bar still makes your palms sweat. Well, it turns out Gemini starts to make questionable choices when its Pokémon team is on the ropes too.
While bigging up the Gemini 2.X model family in their latest report, a surprising case study—namely, the Twitch channel . This project comes from , an engineer unaffiliated with Google. However, during the AI's two runs through Pokémon Blue (going with Squirtle as its starter Pokémon both times), the Gemini team at DeepMind observed an interesting phenomenon they describe in the appendix as 'Agent Panic'.
Basically, as soon as things start to look a bit dicey, the AI agent attempts to get the heck out of Dodge. When party is either low in health or Power Points, the team observed, "model performance appears to correlate with a qualitatively observable degradation in the model’s reasoning capability – for instance, completely forgetting to use the pathfinder tool in stretches of gameplay while this condition persists."
Due to this (plus a fixation on a hallucinated Tea item that exists in the remake but not the original 90s game), it took the AI agent over 813 hours to finish Pokémon Blue for the first time. After some tweaking by Zhang, the AI agent shaved off hundreds of hours from its second run through…clocking in at a playtime of 406.5 hours.
While playing and replaying these games definitely made these games feel expansive in my youth, it's worth noting that the main story of Pokémon [[link]] Blue can be completed in about 26 hours . So, no, Gemini is not very good at playing a children's video game that is now more than a quarter of a century old.
While I enjoy this report's cracking scatter graphs charting the AI's lengthy progress towards beating the Elite Four, I'm less enthused by many other aspects of this exercise. For one, AI agents playing videogames in an attempt to benchmark their abilities just fills me with existential despair—why make anything if a robot is just going to chew it up and spit it out again? All of that also goes without saying just how little these 'AI benchmarking' attempts actually tell us (though ).
GameAddict5058
I appreciate the themed slot games, especially those based on movies and TV shows. They make the gaming experience more engaging and immersive. The combination of storyline, visuals, and bonus features makes each game feel unique. The payout process is generally smooth and reliable, though occasionally it takes longer than expected. Overall, I feel confident that my winnings are safe and will be credited properly.
SpinQueen2950
I appreciate the themed slot games, especially those based on movies and TV shows. They make the gaming experience more engaging and immersive. The combination of storyline, visuals, and bonus features makes each game feel unique. The payout process is generally smooth and reliable, though occasionally it takes longer than expected. Overall, I feel confident that my winnings are safe and will be credited properly. Sometimes I wish there were more ways to earn rewards through loyalty programs or frequent player bonuses. Adding seasonal events or special challenges could enhance the excitement even further.
SlotMaster1680
The progressive jackpots are thrilling, and it's exciting to watch the jackpot amounts grow as more players spin the reels. I hope they add even more jackpot slots because it adds a lot of excitement to the gameplay. I appreciate the themed slot games, especially those based on movies and TV shows. They make the gaming experience more engaging and immersive. The combination of storyline, visuals, and bonus features makes each game feel unique. Sometimes I wish there were more ways to earn rewards through loyalty programs or frequent player bonuses. Adding seasonal events or special challenges could enhance the excitement even further.