Introduction

Eyewitness identification research

Eyewitnesses to crimes may try to identify the perpetrator from an identification procedure. One commonly used procedure is a lineup (identity parade in the UK). A lineup comprises the police suspect and fillers, as shown in the illustration below. A lineup with an innocent suspect is a target-absent lineup, and a lineup with a guilty suspect is a target-present lineup. The eyewitness can make a filler ID from either lineup, identify the innocent suspect (from a target-absent lineup) or guilty suspect (from a target-present lineup), or reject the lineup (declare that the perpetrator is not in the lineup).

Receiver operating characteristic (ROC) analysis measures discriminability (the ability to distinguish innocent from guilty suspects), and confidence accuracy characteristic (CAC) and response-time accuracy characteristic (RAC) analysis measure positive predictive value (the likelihood the identified suspect is guilty). pyWitness allows researchers to run these analyses.

Theories about eyewitness identification have so far been signal detection-based. pyWitness allows researchers to compare these types of model fits.

Receiver operator characteristic (ROC)

There are pacakges in R and MATLAB that smooth ROCS and calculate and statistically compare pAUC values.

pROC: an open-source package for R and S+ to analyze and compare ROC curves

The ROC Toolbox: A MATLAB toolbox for analyzing ROCs derived from confidence ratings

Signal Detection Theory

sdtlu: an open-source package for R to run signal detection model fits on lineup data

Statistical tests

Uncertainties on ROCs and CACs

pAUC and associated statistical tests

Relative frequencies

Power analyses

Model fits

Independent observations

Integration

Ensemble

BEST-REST

Goals of pyWitness

Create a consistent set of analysis tools for any researchers to use for lineup data and other recognition memory data

Little as possible human interaction (to reduce data manipulation, transcribing values and other transformations that may create errors)

Place to store all relevant algorithms, fits, and methods in a reproducible and documented way

Bin and pivot raw data to rates (the processed data)

Plot and compare ROC, CAC, and RAC curves with relative frequencies per datapoint

Calculate pAUC and compute statistical comparisons

Perform statistical tests on CAC and RAC curves

Fit signal detection-based models to the processed data

Monte Carlo simulation of raw data

Create teaching aids to understand signal detection theory

Background to the pyWitness and the code

pyWitness has a flexible internal data format.

Internal data format

Data columns and allowed values
lineupSize	targetLineup	responseType	confidence
integer	“targetPresent”	“suspectId”	integer/float
	“targetAbsent”	“fillerId”
		“rejectId”
		“designateId”

Minimal example data file for an experiment

Example data file
participantId	lineupSize	targetLineup	responseType	confidence
1	6	targetPresent	suspectId	6
2	6	targetAbsent	rejectId	9
3	6	targetPresent	rejectId	1

Note

Other columns can be in the data file (e.g., labels for other experimental conditions or data, demographic data, or personal differenes).

Transforming data structures

Transforming data into pyWitness format can be time consuming and error prone. An example of transforming a data file is in the table below. We use the same structure as sdtlu.

Example data file transformation (sdtlu)
pyWitness	sdtlu
lineupSize	lineup_size
targetLineup	culprit_present
targetPresent	present
targetAbsent	absent
responseType	id_type
suspectId	suspect
fillerId	filler
rejectId	reject
confidence	conf_level

Data for showups

To perform an analysis of showup data, the data need to be organised differently compared to lineup data. rejectId for both targetAbsent and targetPresent showups need to given new confidence names. For example, if the data are measured on a three point confidence scale (1, 2 and 3) for previously experienced (“old”)items and for not previously experienced (“new”) items. The subjectId for both targetPresent and targetAbsent confidences need to be mapped to (4, 5, 6) and rejectId needs to be given (3, 2, 1).

Note

Confidence can be a verbal expression and needs to be mapped to a number. For example, confidence could be low (1), medium (2) or high (3).

Note

All plots and diagrams are made using pyWitness.