The study area is the Gulf of Mexico, a region where many of the largest US ports are located. Several locations were selected including locations on the open coast, in embayments and ship channels, to ensure that the models were able to generalize to most of the Texas ports. The model was tested with high success for the tide gauges of Bob Hall Pier, Port Isabel, Rockport, Manchester, and North Jetty along or near the waterways of the ports of Corpus Christi, Brownsville, and Houston/Galveston. All these ports would highly benefit from more accurate operational real-time models. The goal to eventually transition these models to operation was accounted for for in the model design. For example sensors data can become unavailable, particularly during extreme events or due to maintenance or accidents hence we limited the number of inputs and length of past time series to increase the robustness of the predictions when implemented. The final models only include past wind and water level measurements and wind predictions.
This research compared the performance of multiple architectures such as Seq2Seq, and different types of transformers. From that comparison, we found that the best architecture for the described problem was achieved using Seq2Seq. Using this architecture, it was possible to maintain the Central Frequency of 15 cm (CF(15cm)) above 90% for up to 72 hours for most of the tested locations. This is a significant improvement compared to the present state-of-the-art AI models, which were only able to maintain the CF(15 cm) above 90% for 48 hours predictions or more for very few locations. Also, we found that for some locations, such as Port Isabel and Rockport, we were able to maintain a CF(15 cm) above 90% for up to 108 hours. This is a major improvement compared to the previous published work. This improvement will be highly valuable to stakeholders who will be able to benefit from these longer lead-time predictions.

