Get the latest tech news
AI21 Labs’ new AI model can handle more context than most
Increasingly, the AI industry is moving toward generative AI models with longer contexts. But models with large context windows tend to be
Ori Goshen, the CEO of AI startup AI21 Labs, asserts that this doesn’t have to be the case — and his company is releasing a generative model to prove it. But some of the early incarnations, including an open source model from Princeton and Carnegie Mellon researchers called Mamba, can handle larger inputs than their transformer-based equivalents while outperforming them on language generation tasks. The model doesn’t have safeguards to prevent it from generating toxic text or mitigations to address potential bias; a fine-tuned, ostensibly “safer” version will be made available in the coming weeks.
Or read this on TechCrunch