Polyglot

The anglocentric nature of western AI research means that the overwhelming majority of resources for training LLMs have been put into training monolingual English models, with monolingual Chinese and generically “massively multilingual” models taking up most of the remaining energies.

The Polyglot Project focuses on extending the benefits of large language models to cultures and contexts not well suited by the current state of affairs, as well as studying the best practices for doing so. This work includes training LLMs in languages other than English and Chinese, improving tools for non-English data documentation, curation, and analysis, culturally-aware research on ethics and bias in non-English LLMs, and more.

The origins of this project come from the BigScience Research Workshop, an international collaboration to train multilingual language models, and the volunteer efforts of many Korean NLP researchers interested in promoting access to NLP technologies. Many EleutherAI members participated in BigScience and contributed key roles to designing, developing, and evaluating models such as BLOOM and mT0.

Most recently, we released the Polyglot-Ko model series. These are monolingual Korean language models with 1.3B, 3.8, and 5.8B parameters, the largest of which is the world's most powerful publicly available Korean language model. We are excited to continue to train and publicly release non-English language models.

Releases

Featured

Dec 15, 2022

Polyglot-Ko

Dec 15, 2022

A series of Korean autoregressive language models made by the EleutherAI polyglot team. We currently have trained and released 1.3B, 3.8B, and 5.8B parameter models.

Dec 15, 2022

Publications

Featured

Jun 7, 2023

arXiv

A Technical Report for Polyglot-Ko: Open-Source Large-Scale Korean Language Models

Jun 7, 2023

arXiv

Jun 7, 2023

arXiv

Dec 19, 2022

arXiv

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

Dec 19, 2022

arXiv

Yong, Schoelkopf, Muennighoff, et al. "BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting." arXiv preprint arXiv:2212.09535 (2022).

Dec 19, 2022

arXiv

Nov 10, 2022

arXiv

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Nov 10, 2022

arXiv

Le Scao, et al. (incl. Tow, Biderman, Ammanamanchi, Gao, Sutawika, Teehan). "BLOOM: A 176B-Parameter Open-Access Multilingual Language Model." arXiv preprint arXiv: 2211.05100, 2022.

Nov 10, 2022

arXiv

Releases

Publications

Mesaoptimization