To produce forecasts of hail, ML models are trained for HREFv2 and HRRRE using Maximum Estimated Size of Hail from the WSR-88D radar network as an observed hail proxy for training. As input, the ML models use a set of 2-dimensional variable fields from the specified ensemble forecast (HREFv2 or HRRRE), including both storm variables (those directly related to convective storms; e.g. convective available potential energy (CAPE), column maximum reflectivity) and environmental variables (those representative of the larger-scale environment; e.g. temperature and winds at different vertical levels). Using column-maximum updraft speed, potential storm objects are identified in the model data. A random forest model predicts whether a given storm object will produce hail, and, if it is predicted to produce hail, a second random forest model predicts the size distribution of that hail via a gamma distribution. The predicted hail size distributions are then used to generate probabilistic hail forecasts. Finally, the hail forecasts are calibrated using isotonic regression to improve the reliability and bias performance of the forecast and generate a range of probabilities closer to that of human-generated severe hail outlooks.
Overall, during the spring and summer of 2019, these ML-based hail forecasts exhibited good skill in predicting severe hail using both HREFv2 and HRRRE data. Calibration via isotonic regression using local storm reports as the target dataset yielded forecasts with excellent reliability; the objective skill of the ML-based forecasts in terms of equitable threat score and other skill metrics is comparable to or better than that of severe hail forecasts produced using common severe weather proxy variables (such as updraft helicity). Detailed verification will be presented along with results from individual cases and subjective evaluations from 2019 HWT SFE participants.