Get the latest tech news
AI2’s new model aims to be open and powerful yet cost effective
Allen Institute for AI (AI2)'s new mixture of experts-based model outperforms other 1B parameter models, but is still cost-effective.
Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Nathan Lambert, AI2 research scientist, posted on X (formerly Twitter) that OLMOE will “help policy…this can be a starting point as academic H100 clusters come online.” AI2 said it decided to use a fine-grained routing of 64 small experts when designing OLMoE and only activated eight at a time.
Or read this on Venture Beat