Get the latest tech news

Does Reasoning Emerge? Probabilities of Causation in Large Language Models


Recent advances in AI have been significantly driven by the capabilities of large language models (LLMs) to solve complex problems in ways that resemble human thinking. However, there is an ongoing debate about the extent to which LLMs are capable of actual reasoning. Central to this debate are two key probabilistic concepts that are essential for connecting causes to their effects: the probability of necessity (PN) and the probability of sufficiency (PS). This paper introduces a framework that is both theoretical and practical, aimed at assessing how effectively LLMs are able to replicate real-world reasoning mechanisms using these probabilistic measures. By viewing LLMs as abstract machines that process information through a natural language interface, we examine the conditions under which it is possible to compute suitable approximations of PN and PS. Our research marks an important step towards gaining a deeper understanding of when LLMs are capable of reasoning, as illustrated by a series of math examples.

View PDFHTML (experimental) Abstract:Recent advances in AI have been significantly driven by the capabilities of large language models (LLMs) to solve complex problems in ways that resemble human thinking. This paper introduces a framework that is both theoretical and practical, aimed at assessing how effectively LLMs are able to replicate real-world reasoning mechanisms using these probabilistic measures. By viewing LLMs as abstract machines that process information through a natural language interface, we examine the conditions under which it is possible to compute suitable approximations of PN and PS.

Get the Android app

Or read this on Hacker News

Read more on:

Photo of reasoning

reasoning

Photo of Causation

Causation

Photo of probabilities

probabilities

Related news:

News photo

Baidu's Improving Retrieval Augmented Language Model with Self-Reasoning

News photo

OpenAI is reportedly working on more advanced AI models capable of reasoning and ‘deep research’

News photo

ChatGPT Fails to Understand Causation