EleutherAI

Explore our research

Projects

How do properties of models emerge and evolve over the course of training?

As models get smarter, humans won't always be able to independently check if a model's claims are true or false. We aim to circumvent this issue by directly eliciting latent knowledge (ELK) inside the model’s activations.

Training LLMs

EleutherAI has trained and released many powerful open source LLMs.

Recent Publications

Featured

Feb 12, 2024

arXiv

Suppressing Pink Elephants with Direct Principle Feedback

Feb 12, 2024

arXiv

Feb 12, 2024

arXiv

Feb 6, 2024

arXiv

Neural networks learn moments of increasing order

Feb 6, 2024

arXiv

Feb 6, 2024

arXiv

Dec 17, 2023

NeurIPS Workshop (Attributing Model Behavior at Scale)

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Dec 17, 2023

NeurIPS Workshop (Attributing Model Behavior at Scale)

Dec 17, 2023

NeurIPS Workshop (Attributing Model Behavior at Scale)

Dec 16, 2023

ICLR

Quality-Diversity through AI Feedback

Dec 16, 2023

ICLR

Dec 16, 2023

ICLR

Dec 16, 2023

ICLR

ReLoRA: High-Rank Training Through Low-Rank Updates

Dec 16, 2023

ICLR

Dec 16, 2023

ICLR

News

Featured

Jul 7, 2025

Summer of Open Science

Jul 7, 2025

Jun 15, 2025

Common Pile v0.1

Jun 15, 2025

Jun 12, 2025

EvalEval Coallition

Jun 12, 2025

The EleutherAI Institute

Explore our research

Recent Publications

News