AI Benchmarking Problems: Why Scores Don’t Mean Much
I remember when headlines screamed that an AI “passed” a medical licensing exam and the internet collectively sighed: either “we’re doomed” or…
I remember when headlines screamed that an AI “passed” a medical licensing exam and the internet collectively sighed: either “we’re doomed” or…
I remember the moment I first read about the celebrity chatbots controversy: a mix of disbelief, irritation, and a dash of curiosity…
I remember the first time a chatbot felt like more than lines of code — it was creepy and oddly compelling. That…
I still remember the mix of curiosity and relief I felt reading about Apertus — a new, publicly released effort from researchers…
I want to pull up a chair and walk you through something I’ve been thinking about a lot: the AI boom that…
Over coffee the other day I found myself thinking about how fast synthetic media has exploded into our feeds. China’s recent push…