Using Machine Learning to Improve Offshore Wind Resource Assessment and Forecasting

Ma, Nicole; Ma, Nicole

New York's target of generating 9,000 megawatts from offshore wind turbines by 2035 has underscored the critical need for accurate offshore wind resource assessment and forecasting. Since there are few direct observations of winds at turbine hub height, achieving this goal necessitates using reliable atmospheric reanalyses and models, such as the fifth generation ECMWF atmospheric reanalysis (ERA5). This analysis provides wind speeds at several levels in the lower atmosphere; however, it exhibits biases. Because wind power production is proportional to the cube of wind speed, these biases can lead to substantial errors in power production forecasts. Improving ERA5 winds is especially important during the summer due to peak electricity usage.

This study investigates the effectiveness of various machine learning (ML) methods in improving ERA5 winds during the warm season (May-September). In order to quantify ERA5 errors, observed hourly winds were obtained every 20 m from 20-200 m above the ocean surface from two NYSERDA floating lidars in the New York Bight region (~142 km southeast of NYC) from May to September of 2019 to 2022. ERA5 data at several levels from 850-1000 hPa at the lidars and at an eastern Long Island location (KOKX NWS site) were used as inputs for the ML models. In addition, surface atmospheric data (2-meter temperature, boundary layer height, mean sea level pressure, and total cloud cover) and horizontal spatial data over the Northeastern United States at 950 hPa were used to relate the large-scale flow patterns and ERA5 errors in the ML models. Four ML models–the Support Vector Machine (SVM), Random Forest Regressor (RFR), Feed-forward Neural Network (FNN), and Convolutional Neural Network (CNN)–were employed to improve the ERA5 winds.

The RFR exhibited superior performance across various wind speed error metrics, followed by the CNN, suggesting that spatial patterns might not exert a substantial influence on biases in ERA5 data. The RFR indicated that the 1000 hPa u and v wind components at the lidar site, the 1000 hPa u component of wind at the KOKX site, and the boundary layer height at the lidar site contributed most to explaining the variance in the ERA5 wind bias. A similar methodology was followed for improving NOAA’s High Resolution Rapid Refresh model, an operational forecast, for wind power forecasting applications.

726 Using Machine Learning to Improve Offshore Wind Resource Assessment and Forecasting