Case Study: Is There A Reproducibility Crisis In Finance?#

What is reproducibility?#

How the Turing Way defines reproducible research: https://the-turing-way.netlify.app/reproducible-research/overview/overview-definitions (Used under a CC-BY 4.0 license.) How the Turing Way defines reproducible research

Reproducibility Crisis#

Significance Testing#

Recently, various fields, especially in psychology and biomedical sciences, have faced a “reproducibility crisis.” John Ioannidis’ paper “Why Most Published Research Findings Are False” is highly influential paper published in 2005 that is credited with founding the field of “metascience”. Wikipedia summarizes it as follows:

In the paper, Ioannidis argued that a large number, if not the majority, of published medical research papers contain results that cannot be replicated. In simple terms, the essay states that scientists use hypothesis testing to determine whether scientific discoveries are significant. Statistical significance is formalized in terms of probability, with its p-value measure being reported in the scientific literature as a screening mechanism. Ioannidis posited assumptions about the way people perform and report these tests; then he constructed a statistical model which indicates that most published findings are likely false positive results.

The key argument of the paper is that this “screening mechanism” leads to erroneous significance testing. This is illustrated nicely by XKCD:

xkcd:Significant

Simple Typos#

However, there are many other ways which may lead a paper to fail to replicate. For example, this can happen with simple coding errors. And this does happen more often than you would think! One particularly famous incident involved famous economists Carmen Reinhart and Kenneth Rogoff. A coding error was discovered in their influential paper “Growth in a Time of Debt.” This error, a simple spreadsheet miscalculation, significantly impacted their results, which had previously suggested a clear negative correlation between high public debt and economic growth. This discovery was significant as it challenged the paper’s conclusions, which had been influential in shaping economic policy debates, particularly around the impacts of government debt on economic growth. At the time, this paper was held up as evidence in favor of austerity measures proposed at the time in 2013. The incident highlighted the importance of data accuracy and transparency in economic research.

Reinhart and Rogoff Coding Error

(Image from here)

Reinhart and Rogoff Coding Error Bloomberg

Is There a Replication Crisis in Finance?#

Discussion

Finance, unlike fields like psychology, do not rely on clinical experiments. It relies on data that is, to a large degree, open to anyone to use. Would we expect there to be a reproducibility crisis in finance, given that everybody is using the same data?

The following is actually a very recent example of a failure to replicate due to simple errors. Bloomberg: A Grad-School Number-Cruncher Shakes Up the World of Bond Quants—Professors retract seminal finance paper after detective work uncovers flaws Bloomberg Bonds Research

In this case, Alex Dickerson, a doctoral student, uncovered flaws in a seminal finance paper by Georgetown University professors, which had been influential in bond quantitative analysis. His work led to the retraction of this paper from the Journal of Financial Economics, one of the top five journals in finance and a first for the journal. Dickerson identified errors in data alignment and removal of extreme losses in bonds, which invalidated the original paper’s conclusions about predictive factors in the bond market. This discovery has significant implications for the field of finance, challenging the reliability of many accepted theories and methodologies.

The retracted paper was important because it provided a framework for identifying predictive factors in bond markets, a key aspect for quantitative investing. Its findings were seen as groundbreaking, influencing numerous strategies and research in finance. The retraction of such a seminal work signifies a major shift in understanding these factors, impacting both academic research and practical investment strategies in the bond market. This calls into question the reliability of established methodologies and theories in the field of finance.

However, there is an active, ongoing debate about other challenges to replication (such as data mining and screening). For example, see here:

How can we help?#

Learning about Reproducible Analytical Pipelines and adopting its principles can help. By adopting RAP, researchers and analysts in finance can ensure that their findings are not only based on accurate data handling but are also replicable by others using the same methodology. This approach enhances the credibility of research findings, promotes trust in financial models, and supports the development of sound economic policies and investment strategies. In essence, RAP serves as a cornerstone for reliability and integrity in financial research and practice.

See also

As an example in this direction, see the “Open Source Asset Pricing” project on GitHub: OpenSourceAP/CrossSection