Tag

Allenai

Stories with this tag. Sections and all tags live in the Topics menu; for full-text use search.

Co-occur with these stories — for navigation and internal links.

EMO: AllenAI pretrains mixture of experts so modularity emerges from data

AllenAI releases EMO, a 1B-active / 14B-total MoE where experts self-organize into domain-level modules — using just 12.5% of experts retains near full-model performance.
May 8, 2026
mixture-of-experts emergent modularity allenai MoE pretraining expert routing