OeNB Jubiläumsfonds, Project 18765

Inference with Bayesian nonparametric models in the presence of measurement errors and outliers

This page is dedicated to a collection of materials in the context of project 18765, funded by the Jubiläumsfonds of the Oesterreichische Nationalbank with EUR 168,000.00 for the period from 12/2022 to 11/2024.

Abstract: Data is often imperfect and incomplete when it is first released. A related problem are outlying observations such as those during the recent Covid-19 pandemic, which may severely affect parameter estimates in conventional linear models. On a related note, there may be changes in transmission channels of fiscal or monetary policies, which linear models are not equipped to detect. Particularly for structural and predictive inference, measurement errors and outliers may severely bias parameter estimates and thus yield misleading and suboptimal policy implications.

This project aims to develop new Bayesian nonparametric methods that are robust to nonlinear relationships in the data, measurement errors and outliers. In addition, the proposed framework is capable of handling highly nonstandard data that is potentially irregular and sampled at different frequencies than key economic series of interest. The proposed research extends state-of-the-art econometric methods for high-dimensional nonlinear time series analysis.

Publications and working papers

Forecasts with Bayesian vector autoregressions under real time conditions

Journal of Forecasting 43(3), 771-801, 2024; by M.Pfarrhofer [DOI]

This paper investigates the sensitivity of forecast performance metrics to taking a real time versus pseudo out-of-sample perspective. I use monthly vintages of two popular datasets for the United States and the euro area. Variants of vector autoregressions, varying the size of the information sets and the conditional mean and variance specification, are considered. The results suggest differences in the relative ordering of model performance for point and density forecasts depending on the forecast simulation design used to evaluate predictive accuracy. Differentials are more pronounced for the European dataset, and stochastic volatility is a particularly attractive model feature to obtain accurate forecasts in real time.

Forecasting euro area inflation using a huge panel of survey expectations

International Journal of Forecasting 40(3), 1042-1054, 2024; by F. Huber, L. Onorante and M. Pfarrhofer [DOI]

In this paper, we forecast euro area inflation and its main components using a massive number of time series on survey expectations obtained from the European Commission’s Business and Consumer Survey. To make the estimation of such a huge model tractable, we use recent advances in computational statistics to carry out posterior simulation and inference. Our findings suggest that including a wide range of firms’ and consumers’ opinions about future economic developments offers useful information to forecast prices and assess tail risks to inflation. These predictive improvements arise from surveys related to expected inflation and other questions related to the general economic environment. Finally, we find that firms’ expectations about the future seem to have more predictive content than consumer expectations.

Dynamic Shrinkage Priors for Large Time-Varying Parameter Regressions Using Scalable Markov Chain Monte Carlo Methods

Studies in Nonlinear Dynamics & Econometrics 28(2), 201-225, 2024; project member N. Hauzenberger co-authored this paper with F. Huber and G. Koop [DOI]

Time-varying parameter (TVP) regression models can involve a huge number of coefficients. Careful prior elicitation is required to yield sensible posterior and predictive inferences. In addition, the computational demands of Markov Chain Monte Carlo (MCMC) methods mean their use is limited to the case where the number of predictors is not too large. In light of these two concerns, this paper proposes a new dynamic shrinkage prior which reflects the empirical regularity that TVPs are typically sparse (i.e. time variation may occur only episodically and only for some of the coefficients). A scalable MCMC algorithm is developed which is capable of handling very high dimensional TVP regressions or TVP Vector Autoregressions. In an exercise using artificial data we demonstrate the accuracy and computational efficiency of our methods. In an application involving the term structure of interest rates in the eurozone, we find our dynamic shrinkage prior to effectively pick out small amounts of parameter change and our methods to forecast well.

Sparse time-varying parameter VECMs with an application to modeling electricity prices

International Journal of Forecasting 41(1), 361-376, 2025; by N. Hauzenberger, M. Pfarrhofer and L. Rossini [DOI]

In this paper we propose a time-varying parameter (TVP) vector error correction model (VECM) with heteroskedastic disturbances. We propose tools to carry out dynamic model specification in an automatic fashion. This involves using global–local priors and postprocessing the parameters to achieve truly sparse solutions. Depending on the respective set of coefficients, we achieve this by minimizing auxiliary loss functions. Our two-step approach limits overfitting and reduces parameter estimation uncertainty. We apply this framework to modeling European electricity prices. When considering daily electricity prices for different markets jointly, our model highlights the importance of explicitly addressing cointegration and nonlinearities. In a forecasting exercise focusing on hourly prices for Germany, our approach yields competitive metrics of predictive accuracy.

Belief Shocks and Implications of Expectations About Growth-at-Risk

Journal of Applied Econometrics forthcoming, by M. Böck and M. Pfarrhofer [DOI]

This paper revisits the question of how shocks to expectations of market participants can cause business cycle fluctuations. We use a vector autoregression to estimate dynamic causal effects of belief shocks which are extracted from nowcast errors about output growth. In a first step, we replicate and corroborate the findings of Enders, Kleemann, and Müller (2021). The second step computes nowcast errors about growth-at-risk at various quantiles. This involves both recovering the quantiles of the nowcast distribution of output growth from the Survey of Professional Forecasters, and, since the true quantiles of output growth are unobserved, estimating them with quantile regressions. We document a lack of distinct patterns in response to shocks arising from nowcasts misjudging macroeconomic risk. Although the differences are statistically insignificant, belief shocks about downside risk seem to produce somewhat sharper business cycle fluctuations.

Bayesian nonparametric methods for macroeconomic forecasting

Handbook of Macroeconomic Forecasting edited by M. Clements and A.B. Galvao, chapter 5, by M. Marcellino and M. Pfarrhofer [Chapter/SSRN]

We review specification and estimation of multivariate Bayesian nonparametric models for forecasting (possibly large sets of) macroeconomic and financial variables. The focus is on Bayesian Additive Regression Trees and Gaussian Processes. We then apply various versions of these models for point, density and tail forecasting using datasets for the euro area and the US. The performance is compared with that of several variants of Bayesian VARs to assess the relevance of accounting for general forms of nonlinearities. We find that medium-scale linear VARs with stochastic volatility are tough benchmarks to beat. Some gains in predictive accuracy arise for nonparametric approaches, most notably for short-run forecasts of unemployment and longer-run predictions of inflation, and during recessionary or otherwise non-standard economic episodes.

Bayesian neural networks for macroeconomic analysis

forthcoming Journal of Econometrics, project member N. Hauzenberger co-authored this paper with F. Huber, K. Klieber and M. Marcellino [DOI]

Macroeconomic data is characterized by a limited number of observations (small ), many time series (big K) but also by featuring temporal dependence. Neural networks, by contrast, are designed for datasets with millions of observations and covariates. In this paper, we develop Bayesian neural networks (BNNs) that are well-suited for handling datasets commonly used for macroeconomic analysis in policy institutions. Our approach avoids extensive specification searches through a novel mixture specification for the activation function that appropriately selects the form of nonlinearities. Shrinkage priors are used to prune the network and force irrelevant neurons to zero. To cope with heteroskedasticity, the BNN is augmented with a stochastic volatility model for the error term. We illustrate how the model can be used in a policy institution through simulations and by showing that BNNs produce more accurate point and density forecasts compared to other machine learning methods.

Nowcasting with Mixed Frequency Data Using Gaussian Processes

working paper, by N. Hauzenberger, M. Marcellino, M. Pfarrhofer and A. Stelzer [WP]

We develop Bayesian machine learning methods for mixed data sampling (MIDAS) regressions. This involves handling frequency mismatches and specifying functional relationships between many predictors and the dependent variable. We use Gaussian processes (GPs) and compress the input space with structured and unstructured MIDAS variants. This yields several versions of GP-MIDAS with distinct properties and implications, which we evaluate in short-horizon now- and forecasting exercises with both simulated data and data on quarterly US output growth and inflation in the GDP deflator. It turns out that our proposed framework leverages macroeconomic Big Data in a computationally efficient way and offers gains in predictive accuracy compared to other machine learning approaches along several dimensions.

Bayesian Modeling of TVP-VARs Using Regression Trees

project member N. Hauzenberger co-authored this paper with F. Huber, G. Koop and J. Mitchell [WP]

In light of widespread evidence of parameter instability in macroeconomic models, many time-varying parameter (TVP) models have been proposed. This paper proposes a nonparametric TVP-VAR model using Bayesian additive regression trees (BART) that models the TVPs as an unknown function of effect modifiers. The novelty of this model arises from the fact that the law of motion driving the parameters is treated nonparametrically. This leads to great flexibility in the nature and extent of parameter change, both in the conditional mean and in the conditional variance. Parsimony is achieved through adopting nonparametric factor structures and use of shrinkage priors. In an application to US macroeconomic data, we illustrate the use of our model in tracking both the evolving nature of the Phillips curve and how the effects of business cycle shocks on inflation measures vary nonlinearly with changes in the effect modifiers.

Presentations

Conference presentations by project members

Presentation of Gaussian Process VARs and Macroeconomic Uncertainty at 16th International Conference on Computational and Financial Econometrics (CFE) in London by project member N. Hauzenberger

Presentation of Nowcasting with mixed frequency data using Gaussian processes at IAAE Annual Conference 2024 in Rethymno, IHS Research Seminar and OeNB Freitagsseminar by M. Pfarrhofer
Presentation of Nowcasting with mixed frequency data using Gaussian processes at the Macroeconomic Modelling Frontiers for Research and Policy in Central Banks conference by the European Central Bank by co-author A. Stelzer
Presentation (keynote) of Bayesian nonparametric methods for macroeconomic forecasting at the 12th ECB Conference on Forecasting Techniques by co-author M. Marcellino

Other materials

Noteworthy achievements by project members

N. Hauzenberger was appointed Senior Lecturer at the University of Strathclyde, Scotland, UK in September 2023
M. Pfarrhofer successfully achieved a habilitation at the University of Salzburg in January 2023; further, he was appointed Assistant Professor (tenure-track) at WU Vienna in September 2023

Code repositories

Nonparametric mixed-frequency Bayesian Additive Regression Trees [mf-bavart]
Nonparametric quantile regression [qf-bart]
Gaussian Process MIDAS models [gp-midas] to be uploaded upon publication of the corresponding paper

Collection of real-time data sets

United States, Real-Time Data Set for Macroeconomists
United Kingdom, Gross Domestic Product Real-Time Database
Australia, Australian Real-Time Macroeconomic Database
Canada, CAN-MD-QD, Homepage of D. Stevanovic

Michael Pfarrhofer