Thursday, 28 April 2005: 9:45 AM
International Room (Cathedral Hill Hotel)
ABSTRACT. Positive matrix factorization (PMF) techniques have been applied in many environmental studies. The commercial version of the PMF method has a relatively moderate practical limit for the size of the input data matrix, since the computer memory and time needed for the commercial method increases quadratically with the number of elements of solution matrices. To extend the applications of the PMF techniques to large datasets, such as concentrations of ozone and its precursors in a fine grid photochemical model that may have millions of elements, we exercised alternative methods that demand less computer memory and time. One such method, called non-negative matrix factorization (NMF) here, is extremely memory efficient, compared with the commercial PMF method. Both NMF and PMF methods are sensitive to the initialization of solution matrices, and the use of random numbers in the initialization usually starts with a large prediction error and requires a number of model runs with different random seeds. A novel, chemical mass balance method (ROC) is introduced here to provide a reasonable initialization for the NMF method for large datasets. Both NMF and ROC methods were validated with an ideal Cross example and the benchmark example of the commercial PMF method. The ROC-NMF method was further evaluated, in terms of computer time and the prediction error, in the preliminary application to a dataset that contains particle-phase polar organic compounds analyzed for a number of samples collected in Central California during the California Regional PM10/PM2.5 Air Quality Study (1999-2001).
- Indicates paper has been withdrawn from meeting
- Indicates an Award Winner