Anthropocentric bias in language model evaluation

Rathkopf, Charles

Talk (non-conference) (Other)

FZJ-2025-05123

Anthropocentric bias in language model evaluation

Rathkopf, C. (Corresponding author)FZJ*

2025

Tübingen-Nancy Seminar on Philosophical Aspects of Computer Sciences, Berlin, Germany, 27 Nov 2025

Abstract: Evaluating the cognitive capacities of large language models (LLMs) requires overcoming not only anthropomorphic but also anthropocentric biases. This article identifies two types of anthropocentric bias that have been neglected: (i) overlooking how auxiliary factors can impede LLM performance despite competence, which we call auxiliary oversight, and (ii) dismissing LLM mechanistic strategies that differ from those of humans as not genuinely competent, which we call mechanistic chauvinism. Mitigating these biases necessitates an empirically-driven, iterative approach to mapping cognitive tasks to LLM-specific capacities and mechanisms, which can be done by supplementing carefully designed behavioral experiments with mechanistic studies.Paper coauthored with Raphaël Millière.

Contributing Institute(s):

Gehirn & Verhalten (INM-7)

Research Program(s):

5255 - Neuroethics and Ethics of Information (POF4-525) (POF4-525)

Appears in the scientific report 2025

Click to display QR Code for this record

The record appears in these collections:
Dokumenttypen > Präsentationen > Vorträge (nicht Konferenz)
Institutssammlungen > INM > INM-7
Workflowsammlungen > Öffentliche Einträge
Publikationsdatenbank

Datensatz erzeugt am 2025-12-09, letzte Änderung am 2026-02-20

Ähnliche Datensätze

Dieses Dokument bewerten:

(Bisher nicht rezensiert)

Zum persönlichen Korb hinzufügen
Export als Author List with IDs BibTeX (UTF-8), EndNote XML, EndNote Text, RIS, MARC, Print MARC, MARCXML, DC,
Request correction
Submit fulltext

Gast :: Anmelden JuSER
		Suchen		Absenden		Personalisieren Ihre Benachrichtigungen Ihre Körbe Ihre Suchanfragen		Hilfe