Get the latest tech news

Salesforce takes aim at ‘jagged intelligence’ in push for more reliable AI

Salesforce unveils groundbreaking AI research tackling "jagged intelligence," introducing new benchmarks, models, and guardrails to make enterprise AI agents more intelligent, trusted, and consistently reliable for business use.

“While LLMs may excel at standardized tests, plan intricate trips, and generate sophisticated poetry, their brilliance often stumbles when faced with the need for reliable and consistent task execution in dynamic, unpredictable enterprise environments,” said Silvio Savarese, Salesforce’s Chief Scientist and Head of AI Research, during a press conference preceding the announcement. “Recognizing that current AI models often fall short in reflecting the intricate demands of enterprise environments, we’ve introduced CRMArena: a novel benchmarking framework meticulously designed to simulate realistic, professionally grounded CRM scenarios,” Savarese said. While the entire tech industry pursues ever-larger models with impressive raw capabilities, Salesforce’s focus on the consistency gap highlights a more nuanced approach to AI development — one that prioritizes real-world business requirements over academic benchmarks.

Get the Android app

Or read this on Venture Beat