Author

Antoine Buteau

Profile Jun 24, 2026 10 min read Free

Lessons from Neel Nanda

Lessons from Neel Nanda Neel Nanda leads a mechanistic interpretability team at Google DeepMind and previously worked at Anthropic. He reverse engineers...

Profile Jun 24, 2026 10 min read Free

Lessons from Robert Miles

Lessons from Robert Miles Robert Miles is an AI safety communicator who breaks down dense alignment theory into accessible thought experiments on his...

Profile Jun 24, 2026 9 min read Free

Lessons from Carl Shulman

Lessons from Carl Shulman Carl Shulman models the future of humanity using economics, evolutionary biology, and decision theory. He builds compute centric...

Profile Jun 24, 2026 8 min read Free

Lessons from Victoria Krakovna

Lessons from Victoria Krakovna Victoria Krakovna is a research scientist at Google DeepMind focused on AI alignment. She is known for her work on...

Profile Jun 24, 2026 10 min read Free

Lessons from Jan Leike

Lessons from Jan Leike Jan Leike is an AI safety researcher known for his work on reinforcement learning from human feedback (RLHF) and scalable oversight...

Profile Jun 24, 2026 10 min read Free

Lessons from Paul Christiano

Lessons from Paul Christiano Paul Christiano co developed Reinforcement Learning from Human Feedback (RLHF) and founded the Alignment Research Center. His...

Profile Jun 24, 2026 10 min read Free

Lessons from Stuart Russell

Lessons from Stuart Russell Computer scientist Stuart Russell co authored the standard artificial intelligence textbook and has spent decades studying how...

Profile Jun 24, 2026 9 min read Free

Lessons from Nick Bostrom

Lessons from Nick Bostrom Nick Bostrom is a philosopher who studies existential risk and humanity's long term future. He is known for formalizing the...

Success! Your account is fully activated, you now have access to all content.

Error! Stripe checkout failed.

Success! Your billing info is updated.

Error! Billing info update failed.