The storm-identification and tracking algorithm is comprised of multiple tunable parameters, including a smoothing filter, cell matching method, and minimum and maximum thresholds of reflectivity. These tunable algorithm parameters were altered, one at a time, for the purpose of creating more accurate tracks in time and space. Each change in the algorithm parameters led to an additional dataset of individual cell IDs and tracks; therefore, an ensemble of different tracking algorithms was examined. Tracking statistics were computed to qualitatively examine the cell tracks throughout CONUS for one convectively active month (April 2011), and make comparisons with tracks produced from different tracking algorithms. These tracking statistics, including median duration, linearity error, and mismatch error (e.g., max VIL discontinuity), were factors in choosing the “best” tracking algorithm. A subset of storms were evaluated to manually identify cells and tracks, and then comparisons were made with the corresponding cell IDs and tracks produced by the "best" automated storm-identification and tracking algorithm, which highlighted the data quality gap for machine learning applications.
MRMS products were available throughout CONUS for multiple years in the MYRORSS dataset, which allowed for a robust sample size of severe convective storm cells in various environments and storm modes. This "best" tracking algorithm was applied to multiple years of MYRORSS data to develop a cell track dataset for investigation. Evaluating the storm motion vectors of all cells in the dataset gave an estimation of the storm motion distributions for severe convective storms cells throughout CONUS. Future work includes using this multi-year cell track dataset to examine storm cell longevity and as a training dataset for machine learning algorithms.