1. Introduction
2. Data Collection and Analysis
2.1 Ocean Buoy Data from the Korea Meteorological Administration
2.2 ECMWF Reanalysis Data
3. Design of ANN Models and Machine Learning
3.1 Data Preprocessing
3.2 Artificial Neural Network Models
3.3 Model Training and Validation
3.4 Hyperparameters and Hyperparameter Optimization
4. Results
4.1 Hyperparameter Optimization
4.2 Significant Wave Height
4.3 Peak Wave Period
4.4 Wave Direction
5. Conclusions
(1) The estimation results of all the optimized ANN models showed a lower Test MAE than the ocean wave data of ECMWF ERA5, which confirmed that the optimized ANN models were better. Next, this study compared the Test MAE of five optimized ANN models for each of the ocean wave parameters and suggested the best ANN model for each parameter. Accordingly, Bi-GRUN, GRUN, and FNN were considered to be the ANN models most suitable for filling in missing data for significant wave height, peak wave period, and wave direction.
(2) The increased Test MAE of the optimized ANN models for a higher sea state code suggests a drop in performance in filling in missing data for significant wave height. This study conjectured that it may be due to the issue of imbalanced data caused by the difference in the number of samples depending on the sea state code of the ocean wave data applied in the training and validation process. Future research may apply data sampling and data augmentation techniques to improve performance. As the eye of Typhoon Chaba in 2016 passed directly over the Geomundo ocean buoy, wind data were missing at the Geomundo ocean buoy, and the significant wave height in the ANN model, estimated with the sharply reduced wind speed in ECMWF ERA5, may have been underestimated. The results of the peak wave period estimated by the ANN model optimized for a peak wave period wave of greater than 10 s showed increased errors, confirming a trend of decreased accuracy. These ocean waves were generated and transferred by high-speed winds from spatially distant waters. Therefore, developing an ANN model based on wind data from waters, considering a typhoon’s track, is expected to improve the accuracy of long-term ocean wave estimation.
(3) This study suggested ANN models for filling in missing ocean wave data, and trained, validated, and tested the models based on the measurement results of the Geomundo ocean buoy installed in the waters around Korea. This study employed the measurement results of a single ocean buoy to evaluate the performance of the ANN models optimized for each of the ocean wave parameters. Nonetheless, the study’s finding, that the optimized ANN models were more accurate than the data of ECMWF ERA5, widely applied as reliable ocean wave data worldwide, suggests that AI models are expected to fill in the missing ocean wave data of an ocean buoy.