The Developmental Testbed Center (DTC) has been tasked with evaluating a member of CLUE subsets to answer critical questions related to ensemble design. For example, should model uncertainty be addressed through multiple dynamic cores, multiple physics parameterizations, stochastic physics, or some combination of these? A number of ensemble-based verification measures will be evaluated, including widely-used traditional metrics (spread, skill, error, reliability, etc.) and newer methods that provide additional diagnostic information, especially at higher resolution. These latter approaches will include the Method for Object-based Diagnostic Evaluation (MODE) and neighborhood methods applied to deterministic and probabilistic output (e.g., Fractions Skill Score). Along with standard meteorological fields to highlight overall model performance, an evaluation of severe weather storm-attribute fields readily available in model output or analysis fields (e.g., updraft helicity, radar reflectivity, convective initiation indicators) will also be conducted through a collaboration with NSSL researchers.