Author
Antoine Buteau

Antoine Buteau

Lessons from Neel Nanda

Lessons from Neel Nanda Neel Nanda leads a mechanistic interpretability team at Google DeepMind and previously worked at Anthropic. He reverse engineers...

Lessons from Robert Miles

Lessons from Robert Miles Robert Miles is an AI safety communicator who breaks down dense alignment theory into accessible thought experiments on his...

Lessons from Carl Shulman

Lessons from Carl Shulman Carl Shulman models the future of humanity using economics, evolutionary biology, and decision theory. He builds compute centric...

Lessons from Victoria Krakovna

Lessons from Victoria Krakovna Victoria Krakovna is a research scientist at Google DeepMind focused on AI alignment. She is known for her work on...

Lessons from Jan Leike

Lessons from Jan Leike Jan Leike is an AI safety researcher known for his work on reinforcement learning from human feedback (RLHF) and scalable oversight...

Lessons from Paul Christiano

Lessons from Paul Christiano Paul Christiano co developed Reinforcement Learning from Human Feedback (RLHF) and founded the Alignment Research Center. His...

Lessons from Stuart Russell

Lessons from Stuart Russell Computer scientist Stuart Russell co authored the standard artificial intelligence textbook and has spent decades studying how...

Lessons from Nick Bostrom

Lessons from Nick Bostrom Nick Bostrom is a philosopher who studies existential risk and humanity's long term future. He is known for formalizing the...
You've successfully subscribed to Antoine Buteau
Great! Next, complete checkout to get full access to all premium content.
Welcome back! You've successfully signed in.
Unable to sign you in. Please try again.
Success! Your account is fully activated, you now have access to all content.
Error! Stripe checkout failed.
Success! Your billing info is updated.
Error! Billing info update failed.