My Account Log in

1 option

Machine Learning as a Tool for Hypothesis Generation / Jens Ludwig, Sendhil Mullainathan.

NBER Working papers Available online

View online
Format:
Book
Author/Creator:
Ludwig, Jens.
Contributor:
National Bureau of Economic Research.
Mullainathan, Sendhil.
Series:
Working Paper Series (National Bureau of Economic Research) no. w31017.
NBER working paper series no. w31017
Language:
English
Physical Description:
1 online resource: illustrations (black and white);
Place of Publication:
Cambridge, Mass. National Bureau of Economic Research 2023.
Summary:
While hypothesis testing is a highly formalized activity, hypothesis generation remains largely informal. We propose a systematic procedure to generate novel hypotheses about human behavior, which uses the capacity of machine learning algorithms to notice patterns people might not. We illustrate the procedure with a concrete application: judge decisions about who to jail. We begin with a striking fact: The defendant's face alone matters greatly for the judge's jailing decision. In fact, an algorithm given only the pixels in the defendant's mugshot accounts for up to half of the predictable variation. We develop a procedure that allows human subjects to interact with this black-box algorithm to produce hypotheses about what in the face influences judge decisions. The procedure generates hypotheses that are both interpretable and novel: They are not explained by demographics (e.g. race) or existing psychology research; nor are they already known (even if tacitly) to people or even experts. Though these results are specific, our procedure is general. It provides a way to produce novel, interpretable hypotheses from any high-dimensional dataset (e.g. cell phones, satellites, online behavior, news headlines, corporate filings, and high-frequency time series). A central tenet of our paper is that hypothesis generation is in and of itself a valuable activity, and hope this encourages future work in this largely "pre-scientific" stage of science.
Notes:
Print version record
March 2023.

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Library Catalog Using Articles+ Library Account