Google DeepMind Open-Sources GenCast: A Machine Learning-based Weather Model that can Predict Different Weather Conditions up to 15 Days Ahead

Accurately forecasting weather remains a complex challenge due to the inherent uncertainty in atmospheric dynamics and the nonlinear nature of weather systems. As such, methodologies developed ought to reflect the most probable and potential outcomes, especially in high-stakes decision-making over disasters, energy management, and public safety. While numerical weather prediction (NWP) models offer probabilistic insights through ensemble forecasting, they are computationally expensive and prone to errors. Although ML models have been very promising in giving faster and more accurate predictions, they fail to represent forecast uncertainty, especially in extreme events. This makes ML-based models less useful in actual real-world applications.

The physics-based ensemble models, for example, the ENS from the European Centre for Medium-Range Weather Forecasts (ECMWF), rely on these simulations to produce probabilistic forecasts. These models properly represent the forecast distributions and joint spatiotemporal dependencies and require high computational resources and manual engineering. Conversely, the ML-based method, like GraphCast or FourCastNet, focuses only on deterministic forecasts and will minimize the errors in the mean outcome without considering any uncertainty. None of the attempts to generate probabilistic ensembles by MLWP produced realistic samples or competed with the accuracy of operational ensemble forecasts. Hybrid approaches like NeuralGCM embed ML-based parameterizations within traditional frameworks but have poor resolution and limited performance.

Researchers from Google DeepMind released GenCast, a probabilistic weather forecasting model that generates accurate and efficient ensemble forecasts. This machine learning model applies conditional diffusion models to produce stochastic trajectories of weather, such that the ensembles consist of the entire probability distribution of atmospheric conditions. In systematic ways, it creates forecast trajectories by using the prior states through autoregressive sampling and uses a denoising neural network, which is integrated with a graph-transformer processor on a refined icosahedral mesh. Utilizing 40 years of ERA5 reanalysis data, GenCast captures a rich set of weather patterns and provides high performance. This feature allows it to generate a 15-day global forecast at 0.25° resolution within 8 minutes, which is state-of-the-art ENS in terms of both skill and speed. The innovation has transformed operational weather prediction by enhancing both the accuracy and efficiency of forecasts.

GenCast models the conditional probability distribution of future atmospheric states through a diffusion-based approach. It iteratively refines noisy initial states using a denoiser neural network comprising three core components: an encoder that converts atmospheric data into refined representations on a mesh grid, a processor that implements a graph-transformer to capture neighborhood dependencies, and a decoder that maps refined mesh representations back to grid-based atmospheric variables. The model runs at 0.25° latitude-longitude resolution, producing forecasts at 12-hour intervals over a 15-day horizon. The training with ERA5 data from 1979 to 2018 was two-stage scaling from 1° to 0.25° resolution. It is efficient in generating probabilistic ensembles that make it different from the traditional and ML-based approaches.

GenCast demonstrated superior performance across a wide range of evaluation metrics, consistently outperforming the state-of-the-art ENS model. It achieved in 97.2% of the targeted fields a substantially improved probabilistic accuracy on key atmospheric variables like temperature and humidity, by up to 30%.GenCast provided better reliable predictions for extreme atmospheric events, including heatwaves and cyclones; it decreased the spatial uncertainty of tropical cyclone movement by around 12 hours at critical lead times. In addition, with spatiotemporal association, the model resulted in better regional wind energy predictability, with strong development in predictive skill over very short and medium-length lead times. These findings justify the capability of revolutionizing operational weather forecasting by offering a faster, more precise, and more resilient alternative to conventional techniques. 

GenCast stands to be a revolution in probabilistic weather forecasting; thus, it uses machine learning and generative modeling to ensure good quality, efficient, and realistic ensemble forecasts. Forecast uncertainty and spatiotemporal dependencies better fit into its novel diffusion-based approach than traditional as well as existing ML-based ones. Its ability to forecast extreme events and, eventually, support renewable energy management has opened new prospects of possibilities in operational forecasting that it points out the significant influence of generative AI. 


Check out the Paper and Codes. All credit for this research goes to the researchers of this project. Also, don’t forget to follow us on Twitter and join our Telegram Channel and LinkedIn Group. If you like our work, you will love our newsletter.. Don’t Forget to join our 60k+ ML SubReddit.

🚨 [Must Attend Webinar]: ‘Transform proofs-of-concept into production-ready AI applications and agents’ (Promoted)

Aswin AK is a consulting intern at MarkTechPost. He is pursuing his Dual Degree at the Indian Institute of Technology, Kharagpur. He is passionate about data science and machine learning, bringing a strong academic background and hands-on experience in solving real-life cross-domain challenges.

🧵🧵 [Download] Evaluation of Large Language Model Vulnerabilities Report (Promoted)