Publications | Joel Oskarsson

Papers

Building Machine Learning Limited Area Models: Kilometer-Scale Weather Forecasting in Realistic Settings

Simon Adamov^*, Joel Oskarsson^*, Leif Denby, Tomas Landelius, Kasper Hintz, Simon Christiansen, Irene Schicker, Carlos Osuna, Fredrik Lindsten, Oliver Fuhrer, and Sebastian Schemm

arXiv preprint, 2025

Abs Bib PDF Code

Machine learning is revolutionizing global weather forecasting, with models that efficiently produce highly accurate forecasts. Apart from global forecasting there is also a large value in high-resolution regional weather forecasts, focusing on accurate simulations of the atmosphere for a limited area. Initial attempts have been made to use machine learning for such limited area scenarios, but these experiments do not consider realistic forecasting settings and do not investigate the many design choices involved. We present a framework for building kilometer-scale machine learning limited area models with boundary conditions imposed through a flexible boundary forcing method. This enables boundary conditions defined either from reanalysis or operational forecast data. Our approach employs specialized graph constructions with rectangular and triangular meshes, along with multi-step rollout training strategies to improve temporal consistency. We perform systematic evaluation of different design choices, including the boundary width, graph construction and boundary forcing integration. Models are evaluated across both a Danish and a Swiss domain, two regions that exhibit different orographical characteristics. Verification is performed against both gridded analysis data and in-situ observations, including a case study for the storm Ciara in February 2020. Both models achieve skillful predictions across a wide range of variables, with our Swiss model outperforming the numerical weather prediction baseline for key surface variables. With their substantially lower computational cost, our findings demonstrate great potential for machine learning limited area models in the future of regional weather forecasting.
@article{adamov2025building, title = {Building Machine Learning Limited Area Models: Kilometer-Scale Weather Forecasting in Realistic Settings}, author = {Adamov, Simon and Oskarsson, Joel and Denby, Leif and Landelius, Tomas and Hintz, Kasper and Christiansen, Simon and Schicker, Irene and Osuna, Carlos and Lindsten, Fredrik and Fuhrer, Oliver and Schemm, Sebastian}, journal = {arXiv preprint}, year = {2025}, }
Continuous Ensemble Weather Forecasting with Diffusion models

Martin Andrae, Tomas Landelius, Joel Oskarsson, and Fredrik Lindsten

In International Conference on Learning Representations, 2025

Abs Bib PDF Code

Weather forecasting has seen a shift in methods from numerical simulations to data-driven systems. While initial research in the area focused on deterministic forecasting, recent works have used diffusion models to produce skillful ensemble forecasts. These models are trained on a single forecasting step and rolled out autoregressively. However, they are computationally expensive and accumulate errors for high temporal resolution due to the many rollout steps. We address these limitations with Continuous Ensemble Forecasting, a novel and flexible method for sampling ensemble forecasts in diffusion models. The method can generate temporally consistent ensemble trajectories completely in parallel, with no autoregressive steps. Continuous Ensemble Forecasting can also be combined with autoregressive rollouts to yield forecasts at an arbitrary fine temporal resolution without sacrificing accuracy. We demonstrate that the method achieves competitive results for global weather forecasting with good probabilistic properties.
@inproceedings{andrae2024continuousensembleweatherforecasting, title = {Continuous Ensemble Weather Forecasting with Diffusion models}, author = {Andrae, Martin and Landelius, Tomas and Oskarsson, Joel and Lindsten, Fredrik}, booktitle = {International Conference on Learning Representations}, year = {2025}, }
Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks

Joel Oskarsson, Tomas Landelius, Marc Peter Deisenroth, and Fredrik Lindsten

In Advances in Neural Information Processing Systems, 2024

Spotlight

Abs Bib PDF Code

In recent years, machine learning has established itself as a powerful tool for high-resolution weather forecasting. While most current machine learning models focus on deterministic forecasts, accurately capturing the uncertainty in the chaotic weather system calls for probabilistic modeling. We propose a probabilistic weather forecasting model called Graph-EFM, combining a flexible latent-variable formulation with the successful graph-based forecasting framework. The use of a hierarchical graph construction allows for efficient sampling of spatially coherent forecasts. Requiring only a single forward pass per time step, Graph-EFM allows for fast generation of arbitrarily large ensembles. We experiment with the model on both global and limited area forecasting. Ensemble forecasts from Graph-EFM achieve equivalent or lower errors than comparable deterministic models, with the added benefit of accurately capturing forecast uncertainty.
@inproceedings{oskarsson2024probabilistic, title = {Probabilistic Weather Forecasting with Hierarchical Graph Neural Networks}, author = {Oskarsson, Joel and Landelius, Tomas and Deisenroth, Marc Peter and Lindsten, Fredrik}, year = {2024}, booktitle = {Advances in Neural Information Processing Systems}, volume = {37}, }
Uncertainty Quantification of Pre-Trained and Fine-Tuned Surrogate Models using Conformal Prediction

Vignesh Gopakumar, Ander Gray, Joel Oskarsson, Lorenzo Zanisi, Stanislas Pamela, Daniel Giles, Matt Kusner, and Marc Peter Deisenroth

arXiv preprint, 2024

Abs Bib PDF Code

Data-driven surrogate models have shown immense potential as quick, inexpensive approximations to complex numerical and experimental modelling tasks. However, most surrogate models characterising physical systems do not quantify their uncertainty, rendering their predictions unreliable, and needing further validation. Though Bayesian approximations offer some solace in estimating the error associated with these models, they cannot provide they cannot provide guarantees, and the quality of their inferences depends on the availability of prior information and good approximations to posteriors for complex problems. This is particularly pertinent to multi-variable or spatio-temporal problems. Our work constructs and formalises a conformal prediction framework that satisfies marginal coverage for spatio-temporal predictions in a model-agnostic manner, requiring near-zero computational costs. The paper provides an extensive empirical study of the application of the framework to ascertain valid error bars that provide guaranteed coverage across the surrogate model’s domain of operation. The application scope of our work extends across a large range of spatio-temporal models, ranging from solving partial differential equations to weather forecasting. Through the applications, the paper looks at providing statistically valid error bars for deterministic models, as well as crafting guarantees to the error bars of probabilistic models. The paper concludes with a viable conformal prediction formalisation that provides guaranteed coverage of the surrogate model, regardless of model architecture, and its training regime and is unbothered by the curse of dimensionality.
@article{gopakumar2024uncertainty, title = {Uncertainty Quantification of Pre-Trained and Fine-Tuned Surrogate Models using Conformal Prediction}, author = {Gopakumar, Vignesh and Gray, Ander and Oskarsson, Joel and Zanisi, Lorenzo and Pamela, Stanislas and Giles, Daniel and Kusner, Matt and Deisenroth, Marc Peter}, journal = {arXiv preprint}, year = {2024}, }
MTP-GO: Graph-Based Probabilistic Multi-Agent Trajectory Prediction with Neural ODEs

Theodor Westny, Joel Oskarsson, Björn Olofsson, and Erik Frisk

IEEE Transactions on Intelligent Vehicles, 2023

Abs Bib PDF Code

Enabling resilient autonomous motion planning requires robust predictions of surrounding road users’ future behavior. In response to this need and the associated challenges, we introduce our model, titled MTP-GO. The model encodes the scene using temporal graph neural networks to produce the inputs to an underlying motion model. The motion model is implemented using neural ordinary differential equations where the state-transition functions are learned with the rest of the model. Multi-modal probabilistic predictions are provided by combining the concept of mixture density networks and Kalman filtering. The results illustrate the predictive capabilities of the proposed model across various data sets, outperforming several state-of-the-art methods on a number of metrics.
@article{westny2023graph, journal = {IEEE Transactions on Intelligent Vehicles}, title = {MTP-GO: Graph-Based Probabilistic Multi-Agent Trajectory Prediction with Neural ODEs}, year = {2023}, author = {Westny, Theodor and Oskarsson, Joel and Olofsson, Bj{\"o}rn and Frisk, Erik}, }
Evaluation of Differentially Constrained Motion Models for Graph-Based Trajectory Prediction

Theodor Westny, Joel Oskarsson, Björn Olofsson, and Erik Frisk

In 2023 IEEE Intelligent Vehicles Symposium (IV), 2023

Abs Bib PDF Code

Given their flexibility and encouraging performance, deep-learning models are becoming standard for motion prediction in autonomous driving. However, with great flexibility comes a lack of interpretability and possible violations of physical constraints. Accompanying these data-driven methods with differentially-constrained motion models to provide physically feasible trajectories is a promising future direction. The foundation for this work is a previously introduced graph-neural-network-based model, MTP-GO. The neural network learns to compute the inputs to an underlying motion model to provide physically feasible trajectories. This research investigates the performance of various motion models in combination with numerical solvers for the prediction task. The study shows that simpler models, such as low-order integrator models, are preferred over more complex, e.g., kinematic models, to achieve accurate predictions. Further, the numerical solver can have a substantial impact on performance, advising against commonly used first-order methods like Euler forward. Instead, a second-order method like Heun’s can greatly improve predictions.
@inproceedings{westny2023evaluation, title = {Evaluation of Differentially Constrained Motion Models for Graph-Based Trajectory Prediction}, author = {Westny, Theodor and Oskarsson, Joel and Olofsson, Bj{\"o}rn and Frisk, Erik}, booktitle = {2023 IEEE Intelligent Vehicles Symposium (IV)}, year = {2023}, }
Temporal Graph Neural Networks for Irregular Data

Joel Oskarsson, Per Sidén, and Fredrik Lindsten

In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, 2023

Abs Bib PDF Video Code Poster Slides

This paper proposes a temporal graph neural network model for forecasting of graph-structured irregularly observed time series. Our TGNN4I model is designed to handle both irregular time steps and partial observations of the graph. This is achieved by introducing a time-continuous latent state in each node, following a linear Ordinary Differential Equation (ODE) defined by the output of a Gated Recurrent Unit (GRU). The ODE has an explicit solution as a combination of exponential decay and periodic dynamics. Observations in the graph neighborhood are taken into account by integrating graph neural network layers in both the GRU state update and predictive model. The time-continuous dynamics additionally enable the model to make predictions at arbitrary time steps. We propose a loss function that leverages this and allows for training the model for forecasting over different time horizons. Experiments on simulated data and real-world data from traffic and climate modeling validate the usefulness of both the graph structure and time-continuous dynamics in settings with irregular observations.
@inproceedings{tgnn4i, title = {Temporal Graph Neural Networks for Irregular Data}, author = {Oskarsson, Joel and Sid\'en, Per and Lindsten, Fredrik}, booktitle = {Proceedings of The 26th International Conference on Artificial Intelligence and Statistics}, pages = {4515--4531}, year = {2023}, editor = {Ruiz, Francisco and Dy, Jennifer and van de Meent, Jan-Willem}, volume = {206}, series = {Proceedings of Machine Learning Research}, publisher = {PMLR}, }
Scalable Deep Gaussian Markov Random Fields for General Graphs

Joel Oskarsson, Per Sidén, and Fredrik Lindsten

In Proceedings of the 39th International Conference on Machine Learning, 2022

Abs Bib PDF Video Code Poster Slides

Machine learning methods on graphs have proven useful in many applications due to their ability to handle generally structured data. The framework of Gaussian Markov Random Fields (GMRFs) provides a principled way to define Gaussian models on graphs by utilizing their sparsity structure. We propose a flexible GMRF model for general graphs built on the multi-layer structure of Deep GMRFs, originally proposed for lattice graphs only. By designing a new type of layer we enable the model to scale to large graphs. The layer is constructed to allow for efficient training using variational inference and existing software frameworks for Graph Neural Networks. For a Gaussian likelihood, close to exact Bayesian inference is available for the latent field. This allows for making predictions with accompanying uncertainty estimates. The usefulness of the proposed model is verified by experiments on a number of synthetic and real world datasets, where it compares favorably to other both Bayesian and deep learning methods.
@inproceedings{pmlr-v162-oskarsson22a, title = {Scalable Deep {G}aussian {M}arkov Random Fields for General Graphs}, author = {Oskarsson, Joel and Sid{\'e}n, Per and Lindsten, Fredrik}, booktitle = {Proceedings of the 39th International Conference on Machine Learning}, pages = {17117--17137}, year = {2022}, editor = {Chaudhuri, Kamalika and Jegelka, Stefanie and Song, Le and Szepesvari, Csaba and Niu, Gang and Sabato, Sivan}, volume = {162}, series = {Proceedings of Machine Learning Research}, publisher = {PMLR}, url = {https://proceedings.mlr.press/v162/oskarsson22a.html}, }

Workshop papers

Diffusion-LAM: Probabilistic Limited Area Weather Forecasting with Diffusion

Erik Larsson, Joel Oskarsson, Tomas Landelius, and Fredrik Lindsten

In ICLR 2025 Workshop on Tackling Climate Change with Machine Learning, 2025

Abs Bib PDF

Machine learning methods have been shown to be effective for weather forecasting, based on the speed and accuracy compared to traditional numerical models. While early efforts primarily concentrated on deterministic predictions, the field has increasingly shifted toward probabilistic forecasting to better capture the forecast uncertainty. Most machine learning-based models have been designed for global-scale predictions, with only limited work targeting regional or limited area forecasting, which allows more specialized and flexible modeling for specific locations. This work introduces Diffusion-LAM, a probabilistic limited area weather model leveraging conditional diffusion. By conditioning on boundary data from surrounding regions, our approach generates forecasts within a defined area. Experimental results on the MEPS limited area dataset demonstrate the potential of Diffusion-LAM to deliver accurate probabilistic forecasts, highlighting its promise for limited-area weather prediction.
@inproceedings{larsson2025diffusion, title = {Diffusion-LAM: Probabilistic Limited Area Weather Forecasting with Diffusion}, author = {Larsson, Erik and Oskarsson, Joel and Landelius, Tomas and Lindsten, Fredrik}, booktitle = {ICLR 2025 Workshop on Tackling Climate Change with Machine Learning}, year = {2025}, }
Valid Error Bars for Neural Weather Models using Conformal Prediction

Vignesh Gopakumar, Joel Oskarsson, Ander Gray, Lorenzo Zanisi, Stanislas Pamela, Daniel Giles, Matt Kusner, and Marc Deisenroth

In ICML Workshop on Machine Learning for Earth System Modeling, 2024

Note: A substantial extension of this research is presented in our paper "Uncertainty Quantification of Pre-Trained and Fine-Tuned Surrogate Models using Conformal Prediction"

Abs Bib PDF Code

Neural weather models have shown immense potential as inexpensive and accurate alternatives to physics-based models. However, most models trained to perform weather forecasting do not quantify the uncertainty associated with their forecasts. This limits the trust in the model and the usefulness of the forecasts. In this work we construct and formalise a conformal prediction framework as a post-processing method for estimating this uncertainty. The method is model-agnostic and gives calibrated error bounds for all variables, lead times and spatial locations. No modifications are required to the model and the computational cost is negligible compared to model training. We demonstrate the usefulness of the conformal prediction framework on a limited area neural weather model for the Nordic region. We further explore the advantages of the framework for deterministic and probabilistic models.
@inproceedings{gopakumar2024valid, title = {Valid Error Bars for Neural Weather Models using Conformal Prediction}, booktitle = {ICML Workshop on Machine Learning for Earth System Modeling}, author = {Gopakumar, Vignesh and Oskarsson, Joel and Gray, Ander and Zanisi, Lorenzo and Pamela, Stanislas and Giles, Daniel and Kusner, Matt and Deisenroth, Marc}, year = {2024}, }
Graph-based Neural Weather Prediction for Limited Area Modeling

Joel Oskarsson, Tomas Landelius, and Fredrik Lindsten

In NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning, 2023

Abs Bib PDF Code Poster

The rise of accurate machine learning methods for weather forecasting is creating radical new possibilities for modeling the atmosphere. In the time of climate change, having access to high-resolution forecasts from models like these is also becoming increasingly vital. While most existing Neural Weather Prediction (NeurWP) methods focus on global forecasting, an important question is how these techniques can be applied to limited area modeling. In this work we adapt the graph-based NeurWP approach to the limited area setting and propose a multi-scale hierarchical model extension. Our approach is validated by experiments with a local model for the Nordic region.
@inproceedings{oskarsson2023graphbased, title = {Graph-based Neural Weather Prediction for Limited Area Modeling}, author = {Oskarsson, Joel and Landelius, Tomas and Lindsten, Fredrik}, booktitle = {NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning}, year = {2023}, }
Temporal Graph Neural Networks with Time-Continuous Latent States

Joel Oskarsson, Per Sidén, and Fredrik Lindsten

In ICML Workshop on Continuous Time Methods for Machine Learning, 2022

Note: A substantial extension of this research is presented in our paper "Temporal Graph Neural Networks for Irregular Data"

Abs Bib PDF Video Code Poster Slides

We propose a temporal graph neural network model for graph-structured irregular time series. The model is designed to handle both irregular time steps and partial graph observations. This is achieved by introducing a time-continuous latent state in each node of the graph. The latent dynamics are defined using a state-dependent decay-mechanism. Observations in the graph neighborhood are taken into account by integrating graph neural network layers in both the state update and predictive model. Experiments on a traffic forecasting task validate the usefulness of both the graph structure and time-continuous dynamics in this setting.
@inproceedings{temporal_continuous_gnns, author = {Oskarsson, Joel and Sid{\'e}n, Per and Lindsten, Fredrik}, title = {Temporal Graph Neural Networks with Time-Continuous Latent States}, booktitle = {ICML Workshop on Continuous Time Methods for Machine Learning}, year = {2022}, }

Theses

Probabilistic Regression using Conditional Generative Adversarial Networks

Joel Oskarsson

Linköping University, The Division of Statistics and Machine Learning, 2020

MSc Thesis

Abs Bib PDF Code

Regression is a central problem in statistics and machine learning with applications everywhere in science and technology. In probabilistic regression the relationship between a set of features and a real-valued target variable is modelled as a conditional probability distribution. There are cases where this distribution is very complex and not properly captured by simple approximations, such as assuming a normal distribution. This thesis investigates how conditional Generative Adversarial Networks (GANs) can be used to properly capture more complex conditional distributions. GANs have seen great success in generating complex high-dimensional data, but less work has been done on their use for regression problems. This thesis presents experiments to better understand how conditional GANs can be used in probabilistic regression. Different versions of GANs are extended to the conditional case and evaluated on synthetic and real datasets. It is shown that conditional GANs can learn to estimate a wide range of different distributions and be competitive with existing probabilistic regression models.
@mastersthesis{Oskarsson1442847, author = {Oskarsson, Joel}, institution = {Linköping University, The Division of Statistics and Machine Learning}, pages = {111}, school = {Linköping University, The Division of Statistics and Machine Learning}, title = {Probabilistic Regression using Conditional Generative Adversarial Networks}, keywords = {machine learning, ml, regression, probabilistic, distribution, cgan, gan, conditional gan, adversarial networks, neural network, deep learning, f-gan, f-cgan, f-divergence, adversarial training, bimodal, heteroskedastic, mmd, maximum mean discrepancy, gmmn, generative moment matching network, conditional gmmn, ipm, kde, cgan evaluation, cgan regression, gan regression, cgan-regression, regression using gan, deep, nn, implicit, generative, conditional, model, complex noise, aleatoric, uncertainty, dctd, mdn, heteroskedastic regression, gp}, year = {2020}, }