We are an artificial intelligence research lab putting humanity in the driver's seat.

We are Aligned AI.

About Us

Learn More

What we do

Since 2022, we have been researching how to design, build and implement artificial intelligence that mitigates the human risks that this technology can inflict.

Learn more

Latest updates

See our latest updates on our social media accounts.

Frontier AI

We are using novel mathematical and theoretical techniques to fundamentally reinvent and improve AI.

Our focus is to create the next step-change in machine learning: teaching AIs to hold human-like concepts, helping to overcome fundamental issues across the industry.

Explore our blog

Research Papers

Read our latest ground-breaking research.

28 SEPT 2023

Solving goal misgeneralisation

Goal misgeneralisation is a key challenge in AI alignment - the task of getting powerful AIs to align their goals with human intentions and human morality.

19 Jun 2023

Concept extrapolation: A conceptual primer

This article is a primer on concept extrapolation - the ability to take a concept, a feature, or a goal that is defined in one context and extrapolate it safely to a more general context.

20 Mar 2022

Recognising the importance of preference change

As AI becomes more powerful and a ubiquitous presence in daily life, it is imperative to understand and manage the impact of AI systems on our lives and decisions.

04 MAY 2022

Missing mechanisms of manipulation in the EU AI Act

The European Union (EU) Artificial Intelligence (AI) Act proposes to ban AI systems that ”manipulate persons through subliminal techniques or exploit the fragility of vulnerable individuals.