0
Skip to Content
EleutherAI
EleutherAI
About
Community
Staff
Research
Language Modeling
Interpretability
Alignment
Other Modalities
Papers
Releases
Blog
EleutherAI
EleutherAI
About
Community
Staff
Research
Language Modeling
Interpretability
Alignment
Other Modalities
Papers
Releases
Blog
Folder: About
Back
Community
Staff
Folder: Research
Back
Language Modeling
Interpretability
Alignment
Other Modalities
Papers
Releases
Blog
Datasheet Stella Biderman 13/01/2022 Datasheet Stella Biderman 13/01/2022

Datasheet for the Pile

This datasheet describes the Pile, a 825 GiB dataset of human-authored text compiled by EleutherAI for use in large-scale language modeling.

Read More

About

Research

Language Modeling

Interpretability

Alignment

Other Modalities

Releases

Blog

contact@eleuther.ai

Copyright EleutherAI 2023