Common Pile v0.1
We released the Common Pile v0.1, 8TB of public domain and openly licensed text, for training large language models and the Comma v0.1 models trained on it.
We released the Common Pile v0.1, 8TB of public domain and openly licensed text, for training large language models and the Comma v0.1 models trained on it.