This paper will describe core data categories and featured datasets, discuss how archiving tools are applied across the RDA, review the strategy to manage millions of files in conjunction with metadata databases to enable data discovery, and emphasize scale appropriate access capabilities of this heterogeneous collection.
The RDA has recently transitioned to a new supporting IT infrastructure. The current and future user benefits from this transition include much more data online, co-location of data and supercomputing that enable greater server-side data preparation from multi-terabyte datasets, data research opportunities at NCAR, and the potential to add new datasets that strengthen the relevance of the RDA to the research community and possibly address the NSF data management requirements for selected research efforts.