o3 Beats a Master-Level Geoguessr Player—Even with Fake EXIF Data
In Which I Try to Maintain Human Supremacy for a Bit Longer
Hasnain says:
This is going to give me nightmares as I sleep because what the heck, man
“So to put a bow on this:
The o3 model isn’t smoke and mirrors, tricking us by only using EXIF data. It’s at a comparable Geoguessr skill level to Master I or better players now (at least according to my own ~20 or so rounds of testing).
Humans still hold a big edge in decision time—most of my guesses were < 2 min, o3 often took > 4 min.”
Spoofing EXIF data doesn’t throw off the model.
Whether you view this as dystopian or as a technological marvel - or both - you can’t claim it’s a parlor trick.”
Posted on 2025-04-30T06:32:53+0000