Article The Turing Tests of today are mistaken

**C C** · Mar 24, 2024 10:36 PM

https://iai.tv/articles/the-turing-tests..._auid=2020

INTRO: Companies like OpenAI try to show that AIs are intelligent by hyping their high scores in behavioural tests – an approach with roots in the Turing Test. But there are hard limits to what we can infer about intelligence by observing behaviour. To demonstrate intelligence, argues Raphaël Millière, we must stop chasing high scores and start uncovering the mechanisms underlying AI systems’ behaviour.

EXCERPTS: In theory, benchmarks should allow for rigorous and piecemeal evaluations of AI systems, helping foster broad consensus about their abilities. But in practice benchmarks face major challenges, which only get worse as AI systems progress. High scores on benchmarks do not always translate to good real-world performance in the target domain. This means benchmarks may fail to provide reliable evidence of what they are supposed to measure, which drives further division about how impressed we should be with current AI systems.

[...] This points to a broader concern about what benchmarks are really supposed to measure. A well-designed test should measure some particular skill or capacity, and good test performance should generalize to relevant real-world situations. However, common benchmarks used in AI research explicitly target nebulous capacities, such as “understanding” and “reasoning”. These constructs are abstract, multifaceted, and implicitly defined with reference to human psychology. But we cannot uncritically assume that a test designed for humans can be straightforwardly adapted to evaluate language models and remain valid as an assessment of the same capacity. Humans and machines may achieve similar performance on a task through very different means, and benchmark scores alone do not tell that story... (MORE - details)

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Research Modern AI systems have achieved Turing's vision, but not exactly how he hoped	C C	0	511	Dec 21, 2024 02:29 AM Last Post: C C
	Research Is the Turing Test dead?	C C	0	373	Dec 2, 2023 08:48 AM Last Post: C C
	Outing A.I.: Beyond the Turing Test	C C	0	829	Feb 27, 2015 04:26 AM Last Post: C C
	How Alan Turing played dumb to fool US intelligence	C C	0	928	Nov 30, 2014 09:30 PM Last Post: C C