Ex-google deepmind researcher warns benchmarks won’t save us
Former DeepMind researcher Lun Wang warns that current AI benchmarks assume incremental progress and may miss new, strategic risks, urging the development of self‑evolving evaluation methods.