Get the latest tech news
Deep Cogito goes big, releasing 4 new open source hybrid reasoning models with self-improving ‘intuition’
Arora explains this as a difference between searching for a path versus already knowing roughly where the destination lies.
As Deep Cogito’s blog post notes, the researchers “disincentivize the model from ‘meandering more’ to be able to arrive at the answer, and instead develop a stronger intuition for the right search trajectory for the reasoning process.” Deep Cogito CEO and co-founder Drishan Arora — previously a lead LLM engineer at Google — described the company’s long-term goal as building models that can reason and improve with each iteration, much like how AlphaGo refined its strategy through self-play. While many models simulate the calculation step-by-step and occasionally make unit conversion errors, Cogito 671B reflects internally, determines that 240 ÷ 80 = 3 hours, and correctly concludes that the train cannot arrive in time.
Or read this on Venture Beat