In the following study, we use data collected during HAOS (Hailuoto Atmospheric Observations over Sea ice) (Wenta et. al., 2021) and HAOS2 (22 - 27 March 2021) campaigns carried out near Hailuoto island (Bothnia Bay, Baltic Sea) and SUMO (Small Unmanned Meteorological Observer) UAS measurements from the Ross Sea (Cassano, 2020) to evaluate regional NWP models predictions. In the case of Baltic Sea campaigns the following models forecasts are verified: AROME (Applications of Research to Operations at Mesoscale) - Arctic, HIRLAM (High-Resolution Limited Area Model), and WRF (Weather Research and Forecasting), whereas for the Ross Sea we use Antarctic Mesoscale Prediction System (polar WRF) data. Models ability to simulate vertical changes of humidity, temperature, wind speed, and air pressure is analyzed during various states of the atmosphere including clear sky and cloudy conditions, anticyclone passing, and throughout the periods of very cold and relatively mild temperatures. Our goal is to assess which ABL state, stable, neutral, or unstable, poses the biggest challenge for the models and why? It is found that generally, models struggle with an accurate representation of surface-based temperature inversions, often overestimating its strength, and vertical variations of humidity during the day, as they tend to overestimate the amount of water vapor/liquid water in the air. Furthermore, the low-level jets (LLJ) are also misrepresented in the models results, together with temporal changes of wind speed. In many cases the properties of the upper layers of the atmosphere are better resolved, than the lower layers, adjacent to the surface, thus indicating (as many other studies) that surface-atmosphere interactions in the models are not represented correctly. In order to determine whether different models configurations might improve models ability to predict certain ABL conditions we focus on the WRF model, a connecting factor between Bothnia Bay and Ross Sea analysis, and using WRF Single-Column-Model version evaluate how different planetary boundary layer parameterizations (MYJ, YSU, MYNN, QNSE), vertical resolution, and more accurate representation of surface conditions affect the results. Experiments with WRF SCM are also used to determine the possible reasons behind model biases. Preliminary results show that accurate representation of sea ice conditions, including thickness, surface temperature, albedo, and snow coverage is crucial for increasing the quality of NWP model forecasts. We emphasize the importance of further development of parametrizations focusing on the processes at the sea ice-atmosphere interface.