One tool that has been helpful with dissecting ML model output (specifically those that rely on random forest methods) is tree interpreter (TI). TI is a package that works in tandem with Python’s scikit-learn library and allows for disaggregation of random forest-based probabilities by feature, which can offer insight on how individual model inputs can influence a final prediction. This method provides some important benefits beyond existing methods that only consider feature importances in an aggregate sense. In this work, TI is used to examine probabilistic forecasts for severe hazards (tornadoes, wind, and hail) from the Colorado State University Machine Learning Probabilities (CSU-MLP) system. Contributions from 15 Global Ensemble Forecast System (GEFS) environmental fields (which serve as model predictors) are analyzed in time and space for approximately two years of daily forecasts. While CSU-MLP produces probabilistic severe forecasts out to Day 8, contributions corresponding to forecasts for Days 2-4 will be emphasized. Results will cover diurnal, seasonal, and spatial trends that were identified amongst the contributions. This presentation will conclude with a discussion on the potential utility of viewing TI contributions alongside CSU-MLP severe probabilities in operational forecasting settings, as well as plans for developing forecaster-friendly visualizations for these data.

