Machine understanding and deep learning representation

Tamir, Michael; Shech, Elay

doi:10.1007/s11229-022-03999-y

Cited by 5 publications

(5 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Although our focus is on scientific understanding in the natural sciences (e.g., physics), we anticipate that our framework can be applied to other scientific disciplines as well. We break with the traditional view (Dellsén, 2020;Wilkenfeld, 2013;Searle, 1980;Johnson-Laird, 2010;Nersessian, 1992) by arguing that understanding should be conceptualized in terms of abilities rather than internal mechanics (Marcus, 2018;Chollet, 2017) or representations (Tamir & Shech, 2023;Wilkenfeld, 2013). Specifically, we contend that scientific understanding is a skill-based capability that relies on an agent's ability to perform specific actions, rather than a subjective mental state.…”

Section: Introductionmentioning

confidence: 85%

“…We maintain that the evaluation of understanding in any agent, including artificial ones like LLMs, should follow the same principles used to assess human scientific understanding -that is, it should be based on their abilities to perform relevant tasks. Incidentally, we are not the only ones who relate understanding to an ability (see Krenn et al, 2022;Tamir & Shech, 2023). For example, Tamir and Shech (2023) have argued that practical abilities (such as reliable and robust task performance) can be seen as key factors indicative of understanding in the context of deep learning.…”

Section: Scientific Understanding As An Ability: the Behavioral Conce...mentioning

confidence: 99%

“…Incidentally, we are not the only ones who relate understanding to an ability (see Krenn et al, 2022;Tamir & Shech, 2023). For example, Tamir and Shech (2023) have argued that practical abilities (such as reliable and robust task performance) can be seen as key factors indicative of understanding in the context of deep learning. While we think this is a good start, we argue that a more comprehensive and rigorous evaluation of understanding as an ability is needed.…”

Section: Scientific Understanding As An Ability: the Behavioral Conce...mentioning

confidence: 99%

See 2 more Smart Citations

Towards a Benchmark for Scientific Understanding in Humans and Machines

Barman,

Caron,

Claassen

et al. 2024

Minds & Machines

View full text Add to dashboard Cite

Scientific understanding is a fundamental goal of science. However, there is currently no good way to measure the scientific understanding of agents, whether these be humans or Artificial Intelligence systems. Without a clear benchmark, it is challenging to evaluate and compare different levels of scientific understanding. In this paper, we propose a framework to create a benchmark for scientific understanding, utilizing tools from philosophy of science. We adopt a behavioral conception of understanding, according to which genuine understanding should be recognized as an ability to perform certain tasks. We extend this notion of scientific understanding by considering a set of questions that gauge different levels of scientific understanding, covering information retrieval, the capability to arrange information to produce an explanation, and the ability to infer how things would be different under different circumstances. We suggest building a Scientific Understanding Benchmark (SUB), formed by a set of these tests, allowing for the evaluation and comparison of scientific understanding. Benchmarking plays a crucial role in establishing trust, ensuring quality control, and providing a basis for performance evaluation. By aligning machine and human scientific understanding we can improve their utility, ultimately advancing scientific understanding and helping to discover new insights within machines.

show abstract

Section: Introductionmentioning

confidence: 85%

Section: Scientific Understanding As An Ability: the Behavioral Conce...mentioning

confidence: 99%

Section: Scientific Understanding As An Ability: the Behavioral Conce...mentioning

confidence: 99%

See 1 more Smart Citation

Towards a Benchmark for Scientific Understanding in Humans and Machines

Barman,

Caron,

Claassen

et al. 2024

Minds & Machines

View full text Add to dashboard Cite

show abstract

“…based on well-defined assumptions). Hence, it is largely unrelated to the literature that investigates the sociological basis for the trust in ML models, or analogies between ML and human behaviours [see e.g., Clark and Khosrowi (2022), Duede (2022), Tamir and Shech (2023)]. Further comparisons with the literature are provided in the main text.…”

Section: Introductionmentioning

confidence: 99%

Reliability and Interpretability in Science and Deep Learning

Scorzato

2024

Minds & Machines

View full text Add to dashboard Cite

In recent years, the question of the reliability of Machine Learning (ML) methods has acquired significant importance, and the analysis of the associated uncertainties has motivated a growing amount of research. However, most of these studies have applied standard error analysis to ML models—and in particular Deep Neural Network (DNN) models—which represent a rather significant departure from standard scientific modelling. It is therefore necessary to integrate the standard error analysis with a deeper epistemological analysis of the possible differences between DNN models and standard scientific modelling and the possible implications of these differences in the assessment of reliability. This article offers several contributions. First, it emphasises the ubiquitous role of model assumptions (both in ML and traditional science) against the illusion of theory-free science. Secondly, model assumptions are analysed from the point of view of their (epistemic) complexity, which is shown to be language-independent. It is argued that the high epistemic complexity of DNN models hinders the estimate of their reliability and also their prospect of long term progress. Some potential ways forward are suggested. Thirdly, this article identifies the close relation between a model’s epistemic complexity and its interpretability, as introduced in the context of responsible AI. This clarifies in which sense—and to what extent—the lack of understanding of a model (black-box problem) impacts its interpretability in a way that is independent of individual skills. It also clarifies how interpretability is a precondition for a plausible assessment of the reliability of any model, which cannot be based on statistical analysis alone. This article focuses on the comparison between traditional scientific models and DNN models. However, Random Forest (RF) and Logistic Regression (LR) models are also briefly considered.

show abstract

“…Утверждая революционный характер влияния на общество и человека AI и Big Data, нельзя не отдавать себе отчета в том, что понятия искусственного интеллекта, глубокого обучения (deep learning), нейронных сетей и т. п. в значительной мере были и еще будут довольно долго оставаться метафорами. Хотя алгоритмы глубокого обучения являются едва ли не ключевыми в создании AI и использовании Big Data, но методы понимания текста (и тем более контекста) основаны на механическом переборе словаря, который должен постоянно расширяться (Tamir & Shech, 2023). В мозге человека, способном действительно понимать, 85 миллиардов нейронов, а на настоящий момент наиболее успешная программа нейронной сети, призванная смоделировать функции мозга, способна воссоздать мозг примитивного червя, который состоит из чуть более чем 300 нейронов.…”

unclassified

Искусственный интеллект и технологии Big Data (больших данных)

Бажанов

2023

Философия-ВШЭ

View full text Add to dashboard Cite

show abstract

Machine understanding and deep learning representation

Cited by 5 publications

References 37 publications

Towards a Benchmark for Scientific Understanding in Humans and Machines

Towards a Benchmark for Scientific Understanding in Humans and Machines

Reliability and Interpretability in Science and Deep Learning

Искусственный интеллект и технологии Big Data (больших данных)

Contact Info

Product

Resources

About