87th AMS Annual Meeting

Thursday, 18 January 2007: 2:15 PM
Deployable Suite of Data Mining Web Services for Online Science Data Repositories
216AB (Henry B. Gonzalez Convention Center)
Sara Graves, Univ. of Alabama, Huntsville, AL; and R. Ramachandran, K. Keiser, M. Maskey, and C. Lynnes
Poster PDF (111.5 kB)
NASA recently funded a project to create a suite of specialized deployable data mining web services designed specifically for science data. The project leverages the Algorithm Development and Mining (ADaM) toolkit as the basis. The ADaM toolkit is a robust, mature and freely available science data mining toolkit that is being used by different research organizations and educational institutions worldwide. These deployable services will give the scientific community a powerful and versatile data mining capability that can be used to create higher order products such as thematic maps from current and future NASA satellite data records with methods that are not currently available. Many of the specialized data mining, pattern recognition, image processing and data preparation algorithms in ADaM are specifically geared towards satellite imagery, making these tools a perfect fit for NASA satellite data. In addition to providing specialized deployable data mining services, the suite will also include the Earth Science Markup Language (ESML) giving users the power to handle a variety of heterogeneous data formats seamlessly. ESML is another proven technology that, like ADaM, is in use by organizations worldwide. The deployable package of mining and related services will be developed using web services standards so that community based measurement processing systems can access and interoperate with them. The maturation of web services standards and technology sets the stage for a distributed “Service-Oriented Architecture” (SOA) for NASA's next generation science data processing. This architecture will allow members of the scientific community to create and combine persistent distributed data processing services and make them available to other users over the internet. The Goddard Earth Sciences Data and Information Services Center (GES DISC) will be used as an operational site for demonstration. The details of this project will be presented in this paper.

Supplementary URL: