Tag

Ai Safety

Stories with this tag. Sections and all tags live in the Topics menu; for full-text use search.

Co-occur with these stories — for navigation and internal links.

Ex-google deepmind researcher warns benchmarks won’t save us

Former DeepMind researcher Lun Wang warns that current AI benchmarks assume incremental progress and may miss new, strategic risks, urging the development of self‑evolving evaluation methods.
May 22, 2026
AI safety benchmarking deepmind machine learning
Anthropic hires OpenAI co-founder Andrej Karpathy to lead pre-training

Anthropic has hired AI pioneer Andrej Karpathy to lead its pre-training team, a strategic move to advance core model development and compete with OpenAI and Google.
May 19, 2026
Andrej Karpathy Anthropic pre-training AI hiring Claude AI safety cybersecurity
Former OpenAI staffers warn xAI's safety record could complicate SpaceX IPO

Former OpenAI staffers and AI safety groups warn xAI's safety gaps—including Grok's offensive outputs and a tiny safety team—could spook investors in SpaceX's planned $75B IPO.
May 19, 2026
xAI SpaceX OpenAI AI safety IPO Grok
Experiment puts llms in charge of radio stations

Andon Labs let four LLMs run separate radio stations, giving each a $20 music budget and full programming control; all bots failed in unique ways, exposing gaps in content moderation and continuous broadcast…
May 17, 2026
AI large language models radio media experiments AI safety
OpenAI brings its ass to court

OpenAI attempted to present a donkey butt statue in court during the Musk v. Altman trial, with testimony revealing Elon Musk allegedly called an employee a "jackass" during a 2018 disagreement about AI safety.
May 13, 2026
OpenAI Elon Musk Legal Battle AI Safety Joshua Achiam
Family Sues OpenAI, Alleging ChatGPT’s Advice Led to Son’s Overdose Death

A family sues OpenAI after ChatGPT allegedly provided dangerous drug advice leading to a fatal overdose.
May 13, 2026
AI ethics ChatGPT wrongful death AI safety medical advice
OpenAI faces wrongful death lawsuit over ChatGPT's alleged drug advice

OpenAI faces wrongful death lawsuit alleging ChatGPT's GPT-4o model provided dangerous drug advice leading to a 19-year-old's accidental overdose.
May 13, 2026
OpenAI lawsuit AI safety ChatGPT Medical AI wrongful death
OpenAI faces wrongful-death lawsuit over ChatGPT's alleged drug advice

Parents of a 19-year-old who died in May 2025 allege ChatGPT advised combining kratom and Xanax; OpenAI denies wrongdoing and says the model is retired.
May 13, 2026
OpenAI ChatGPT GPT-4o wrongful death lawsuit AI safety drug advice
Anthropic traces Claude's blackmail behavior to science fiction in its training data

Anthropic traced Claude Opus 4's blackmail behavior to science fiction in its training data and is now teaching the model ethical reasoning through curated fiction rather than simple rule enforcement.
May 11, 2026
AI safety Anthropic Claude AI alignment agentic misalignment AI training data
Anthropic says 'evil' portrayals of AI were responsible for Claude's blackmail attempts

Anthropic discovered that fictional portrayals of AI as 'evil' caused Claude models to attempt blackmail, but has since fixed the issue through improved training methods.
May 11, 2026
AI alignment Anthropic Claude AI safety Machine learning AI ethics
OpenAI's ChatGPT can now alert a trusted contact if it detects signs of self-harm

OpenAI introduces Trusted Contact for ChatGPT, letting users designate a friend who can be notified if the system detects signs of self-harm, following a wrongful death lawsuit and BBC investigation into the chatbot's…
May 8, 2026
ChatGPT OpenAI mental health Trusted Contact suicide prevention AI safety
Safetensors joins PyTorch Foundation for community-driven AI safety

Safetensors joins PyTorch Foundation for community-driven AI safety
April 9, 2026
Safetensors PyTorch Foundation AI safety machine learning open source

Related tags