Early View
ARTICLE

From missing data to informative GPA predictions: Navigating selection process beliefs with the partial identifiability approach

Eduardo Alarcón-Bustamante

Corresponding Author

Eduardo Alarcón-Bustamante

Escuela de Psicología, Pontificia Universidad Católica de Chile, Santiago de Chile, Chile

Departamento de Evaluación Medición y Registro Educacional (DEMRE), Universidad de Chile, Santiago de Chile, Chile

Millennium Nucleus on Intergenerational Mobility: From Modelling to Policy (MOVI), Santiago de Chile, Chile

Interdisciplinary Laboratory of Social Statistics, Santiago de Chile, Chile

Correspondence

Eduardo Alarcón-Bustamante, Av. Vicuña Mackenna 4860, Macul, Santiago de Chile, Chile.

Email: [email protected]

Contribution: Conceptualization, ​Investigation, Funding acquisition, Writing - original draft, Writing - review & editing, Visualization, Validation, Methodology, Software, Formal analysis, Project administration, Resources, Supervision, Data curation

Search for more papers by this author
Jorge González

Jorge González

Millennium Nucleus on Intergenerational Mobility: From Modelling to Policy (MOVI), Santiago de Chile, Chile

Interdisciplinary Laboratory of Social Statistics, Santiago de Chile, Chile

Faculty of Mathematics, Pontificia Universidad Católica de Chile, Santiago de Chile, Chile

Contribution: Data curation, Supervision, Resources, Project administration, Software, Formal analysis, Methodology, Validation, Visualization, Writing - review & editing, Writing - original draft, Funding acquisition, ​Investigation, Conceptualization

Search for more papers by this author
David Torres Irribarra

David Torres Irribarra

Escuela de Psicología, Pontificia Universidad Católica de Chile, Santiago de Chile, Chile

Millennium Nucleus on Intergenerational Mobility: From Modelling to Policy (MOVI), Santiago de Chile, Chile

Interdisciplinary Laboratory of Social Statistics, Santiago de Chile, Chile

Contribution: Conceptualization, ​Investigation, Funding acquisition, Writing - original draft, Writing - review & editing, Visualization, Validation, Methodology, Software, Formal analysis, Project administration, Resources, Supervision, Data curation

Search for more papers by this author
Ernesto San Martín

Ernesto San Martín

Millennium Nucleus on Intergenerational Mobility: From Modelling to Policy (MOVI), Santiago de Chile, Chile

Interdisciplinary Laboratory of Social Statistics, Santiago de Chile, Chile

Faculty of Mathematics, Pontificia Universidad Católica de Chile, Santiago de Chile, Chile

The Economics School of Louvain, Université Catholique de Louvain, Brussels, Belgium

Contribution: Conceptualization, ​Investigation, Funding acquisition, Writing - original draft, Writing - review & editing, Visualization, Validation, Methodology, Software, Formal analysis, Project administration, Resources, Supervision, Data curation

Search for more papers by this author
First published: 24 December 2024

Abstract

The extent to which college admissions test scores can forecast college grade point average (GPA) is often evaluated in predictive validity studies using regression analyses. A problem in college admissions processes is that we observe test scores for all the applicants; however, we cannot observe the GPA of applicants who were not selected. The standard solution to tackle this problem has relied upon strong assumptions to identify the exact value of the regression function in the presence of missing data. In this paper, we present an alternative approach based on the theory of partial identifiability that considers a variety of milder assumptions to learn about the regression function. Using a university admissions dataset we illustrate how results can vary as a function of the assumptions that one is willing to make about the selection process.

CONFLICT OF INTEREST STATEMENT

The author declares that there is no conflict of interest.

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are openly available in the GitHub repository of the first author at https://github.com/edalarconb/BJMSP2024.