List of Potential Final Projects#
The project that you are assigned will come from one of the following three categories: white papers, data series from âEvidence from Many Asset Classesâ, or replications of arbitrage spreads from the paper âSegmented Arbitrageâ.
If you canât access the papers below, you can find a link to them here: https://drive.google.com/drive/folders/10OvVRXb-G31PHV95smF06HkahRpnCh_d?usp=sharing
White Papers#
1. Lewellen (2015), Critical Finance Review. âThe Cross-section of Expected Stock Returnsâ#
This paper will teach you how to combine many firm characteristics into real-time forecasts of a stockâs expected returns. See the abstract: âThis paper studies the cross-sectional properties of return forecasts derived from Fama-MacBeth regressions. These forecasts mimic how an investor could, in real time, combine many firm characteristics to obtain a composite estimate of a stockâs expected return. Empirically, the forecasts vary substantially across stocks and have strong predictive power for actual returns. For example, using ten-year rolling estimates of Fama-MacBeth slopes and a cross-sectional model with 15 firm characteristics (all based on low-frequency data), the expected-return estimates have a cross-sectional standard deviation of 0.87% monthly and a predictive slope for future monthly returns of 0.74, with a standard error of 0.07.â
Task: Replicate Table 1, Table 2, and Figure 1
Data sources: CRSP and Compustat
Citation: Lewellen, Jonathan W., The Cross Section of Expected Stock Returns (August 22, 2014). Forthcoming in Critical Finance Review, Tuck School of Business Working Paper No. 2511246, Available at SSRN: https://ssrn.com/abstract=2511246 or http://dx.doi.org/10.2139/ssrn.2511246
Notes: I will provide some starter code that partially constructs some of these tables. In this project, youâll have to complete the code.
2. Monetary Tightening and US Bank Fragility in 2023#
This paper will teach you how to use bank call reports to construct measures of the financial condition of banks. This is especially useful if you have interest in specializing, for example, in specializing in equities covering the financial sector.
Task: Replicate Table 1, Table A1 and Figure A1
Citation: Jiang et al (2023). Monetary Tightening and US Bank Fragility in 2023: Mark-to-Market Losses and Uninsured Depositor Runs?. No. w31048. National Bureau of Economic Research, 2023. https://www.nber.org/papers/w31048
Data sources: Bank Call Reports. See https://wrds-www.wharton.upenn.edu/pages/get-data/bank-regulatory/
Notes: I will provide some starter code that will replicate most of Table 1.
3. Intermediary Asset Pricing: New Evidence from Many Asset Classes#
This project will give you experience in analyzing the balance sheet fundamentals of large broker-dealers, in this case the full set of primary dealers in the US, and how the financial condition of these primary dealers affect the US stock market as a whole.
Task: Replicate Table 2, and Table 3 (Panel A only, but Panel B for bonus)
Data used: CRSP, Compustat, Datastream
Citation: He, Zhiguo, Bryan Kelly, and Asaf Manela. âIntermediary asset pricing: New evidence from many asset classes.â Journal of Financial Economics 126, no. 1 (2017): 1-35.
Note: I will provide some code that will help you get started.
5. Forward Return Expectations#
This paper deals with extracting return expectations from option prices. More than simply measuring return expectations as a whole, this paper constructs expectations as different horizons. See the abstract: âWe measure investorsâ short- and long-term stock-return expectations using both options and survey data. These expectations at different horizons reveal what investors think their own short-term expectations will be in the future, or forward return expectations. While contemporaneous short-term expectations are not countercyclical across all data sources, we find that forward expectations are consistently countercyclical, and excessively so: in bad times, forward expectations are higher than justified by investorsâ own subsequent short-term return expectations. This excess volatility in forward expectations helps account for excess volatility in prices, inelastic demand for equities, and stylized facts about the equity term structure.â
Task: Replicate Table 1 and Figure 1
Data: OptionMetrics
Citation: Gandhi, Mihir and Gormsen, Niels Joachim and Lazarus, Eben, Forward Return Expectations (September 7, 2023). Available at SSRN: https://ssrn.com/abstract=4565351 or http://dx.doi.org/10.2139/ssrn.4565351
6. Market Expectations in the Cross-Section of Present Values#
This paper demonstrates that returns and cash flows for the aggregate US stock market are highly predictable. It demonstrates this by constructing a univariate predictor from the cross section of stocks. See the abstract:
Returns and cash flow growth for the aggregate U.S. stock market are highly and robustly predictable. Using a single factor extracted from the cross-section of book-to market ratios, we find an out-of-sample return forecasting R2 of 13% at the annual frequency (0.9% monthly). We document similar out-of-sample predictability for returns on value, size, momentum, and industry portfolios. We present a model linking aggregate market expectations to disaggregated valuation ratios in a latent factor system. Spreads in value portfoliosâ exposures to economic shocks are key to identifying predictability and are consistent with duration-based theories of the value premium.
Also, from their paper,
âOur approach attacks a challenging problem in empirical asset pricing: how does one exploit a wealth of predictors in relatively short time series? If the predictors number near or more than the number of observations, the standard ordinary least squares (OLS) forecaster will be poorly behaved or nonexistent (see Huber (1973)). Our solution is to use partial least squares (PLS; Wold (1975)), which is a simple regression-based procedure designed to parsimoniously forecast a single time series using a large panel of predictors. We use it to construct a univariate forecaster for market returns (or dividend growth) that is a linear combination of assetsâ valuation ratios.â
Task: Replicate Table 1
Data sources: CRSP and Compustat
Citation: KELLY, B. and PRUITT, S. (2013), Market Expectations in the Cross-Section of Present Values. THE JOURNAL OF FINANCE, 68: 1721-1756. https://doi.org/10.1111/jofi.12060
Replications of data series from âEvidence from Many Asset Classesâ#
He, Kelly, and Manela (2017) is a well known paper that examines the effects that intermediariesâ balance sheets have on asset prices. This paper happens to test this theory on a variety of asset classes, going beyond just debt and equity. The projects in this section will be to replicate the test asset returns across this variety of asset classes. Each project below focuses on a different asset class. The returns of these tests assets are provided here. The data provided in that file goes up until 2012. Your project will be to replicate that data up until 2012 and then provide updated returns going as close to the present as possible. The derivation of the returns is defined by a separate paper. These are listed below.
7. Corporate Bond Portfolio Returns#
This will give you experience in working with corporate bonds.
Task: Replicate the corporate bond columns from He_Kelly_Manela_Factors_And_Test_Assets_monthly.csv (https://asafmanela.github.io/papers/hkm/intermediarycapitalrisk/He_Kelly_Manela_Factors.zip )
Data used: Bond Returns by WRDS (https://wrds-www.wharton.upenn.edu/pages/get-data/wrds-bond-returns/), but maybe also FINRA TRACE (https://wrds-www.wharton.upenn.edu/pages/get-data/otc-corporate-bond-and-agency-debt-bond-transaction-data/) , Mergent FISD/NAIC, TRACE, and Lehman Brothers Fixed Income Database.
Citation: Nozawa, Yoshio. âWhat Drives the CrossâSection of Credit Spreads?: A Variance Decomposition Approach.â The Journal of Finance 72, no. 5 (2017): 2045-2072.
Note: You may use data from openbondassetpricing.com/ and here (Alexander-M-Dickerson/TRACE-corporate-bond-processing) if itâs helpful.
8. Credit Default Swap Returns#
This replication will give you experience using the Markit data source for Credit Default Swaps. If youâre interested in CDS, this is the canonical paper you should know for constructing returns on these instruments.
Task: Replicate the Credit Default Swap columns from He_Kelly_Manela_Factors_And_Test_Assets_monthly.csv (https://asafmanela.github.io/papers/hkm/intermediarycapitalrisk/He_Kelly_Manela_Factors.zip )
These columns are described in He, Kelly, and Manela. (2017), but the definition of returns on CDS contracts comes from Palhares (2013).
Data used: Markit
Citation: Palhares, Diogo. Cash-flow maturity and risk premia in CDS markets. The University of Chicago, 2013.
9. Commodity Futures Returns#
This replication will give you experience in working with commodity futures and constructing return series from them.
Task: Replicate Table 1. It may be difficult to replicate the commodities columns in He_Kelly_Manela_Factors_And_Test_Assets_monthly.csv (https://asafmanela.github.io/papers/hkm/intermediarycapitalrisk/He_Kelly_Manela_Factors.zip ), but you may try.
Data used: Futures Contract data from Datastream
Citation: Yang, Fan. âInvestment shocks and the commodity basis spread.â Journal of Financial Economics 110, no. 1 (2013): 164-184.
Replicate Arbitrage Spreads from âSegmented Arbitrageâ#
The projects in this section will have students replicate various arbitrage spreads studied in the paper âSegmented Arbitrageâ https://www.hbs.edu/ris/Publication Files/24-030_1506d32b-3190-4144-8c75-a2326b87f81e.pdf by Emil Siriwardane, Adi Sunderam, and Jonathan Wallen.
This paper examines various so-called arbitrage trades and makes the observation that the price dynamics of these trades challenges various asset pricing models that have been used in the past to explain deviations from no-arbitrage. The internet appendix gives more detail as to how each trade works: https://www.emilsiriwardane.com/s/Appendix.pdf
One of the authors, Emil Siriwardane https://www.emilsiriwardane.com/, makes code and data available for these trades online: esiriwardane/arbitrage-spreads-public However, much of the code is in Stata. Your goal will be to rewrite these in Python and automate the pulls of the data (to the extent possible).
You can also find some of my own code that attempts to replicate some of the figures in the paper here: jmbejara/arb_spreads . This replication merely uses the data from the Authorâs original code above, so it may or may not be helpful. You can find some notebooks that demo the code here: https://jeremybejarano.com/arb_spreads/
Tasks
For each of the following, you should replicate the associated panel in Figure A1 in the Appendix (See https://www.emilsiriwardane.com/s/Appendix.pdf)
Replicate the series shown in the paper, but also compute the series up to the present (so that the reporting of the size of the arbitrage spread can be automated)
In each case that Bloomberg is used, please try to use Datastream as an alternative if possible
10. FX Covered Interest Parity Spread#
This trade is based off of the covered interest parity (CIP). A CIP deviation is a spread between a cash riskless rate and a synthetic riskless rate. The synthetic rate is local currency borrowing swapped into a foreign denominated rate using cross-currency derivatives.
Data used: Bloomberg, maybe Datastream
11. Equity Spot Futures#
âWe construct arbitrage implied forward rates using the nearby and first deferred futures contracts for the S&P 500, Dow Jones, and Nasdaq 100 indices.â
Data used: Bloomberg
12. Treasury Spot-Futures#
Replicate the data pull and the construction of the Treasury-Spot Futures spread. The Treasury spot-futures arbitrage trade involves exploiting the price discrepancy between Treasury futures contracts and the underlying Treasury securities in the cash (spot) market. By simultaneously buying (or selling) the mispriced asset and selling (or buying) its equivalent in the other market, arbitrageurs aim to lock in a risk-free profit, accounting for carry costs, accrued interest, and delivery options.
Data used: Bloomberg
13. Treasury Swap#
In an interest rate swap, one counterparty agrees to pay a series of predetermined payments based on the so-called fixed rate (or swap rate) prevailing at the swapâs inception. In return, the counterparty receives a series of stochastic floating payments that is determined based on the future realization of a short-term reference rate.
Data used: Bloomberg
14. TIPS Treasury#
âIn the market for US sovereign debt, there is a no-arbitrage condition between inflation-swapped Treasury Inflation-Protected Securities (TIPS) and Treasuries (Fleckenstein et al., 2014). TIPS are US Treasury obligations for which the principal amount (and coupons) are adjusted for the Consumer Price Index (CPI). These inflation adjustments may be undone using an inflation swap, yielding fixed cash flows. The arbitrage spread is the difference in yield between this synthetic nominal Treasury constructed from TIPS and inflation swaps and a nominal Treasuryâ
Data sources: Bloomberg, and the Federal Reserveâs Yield Curve data (for zero coupon bond yields: https://www.federalreserve.gov/data/yield-curve-tables/feds200628_1.html )
15. Corporate CDS-Bond Spread#
Replicate the data pull and construction of the series using the Markit Credit Default Swap data in conjunction with the Corporate Bond data on wrds.
Replicate the final CDS-Bond basis spread given in final_cds_bases.xlsx here: esiriwardane/arbitrage-spreads-public Your numbers should match theirs nearly precisely.
Extra credit available if this is chosen.
Data used: Markit CDS, Corporate TRACE