0
Skip to Content
EleutherAI
EleutherAI
About
Community
Staff
Research
Language Modeling
Interpretability
Alignment
Papers
Releases
Blog
EleutherAI
EleutherAI
About
Community
Staff
Research
Language Modeling
Interpretability
Alignment
Papers
Releases
Blog
Folder: About
Back
Community
Staff
Folder: Research
Back
Language Modeling
Interpretability
Alignment
Papers
Releases
Blog
Datasheet Stella Biderman 13/01/2022 Datasheet Stella Biderman 13/01/2022

Datasheet for the Pile

This datasheet describes the Pile, a 825 GiB dataset of human-authored text compiled by EleutherAI for use in large-scale language modeling.

Read More

About

Research

Language Modeling

Interpretability

Alignment

Other Modalities

Releases

Blog

contact@eleuther.ai

Copyright EleutherAI 2023