Upcoming seminars
Click this button to automatically add the events of the seminar into your calendar: iCal url.
| When | Who | Where |
|---|---|---|
| 12 June | Richard Sowers | B211 |
| 25 June | Florentin Guth | B211 |
| 17 September | Alessandro Barp | B211 |
| 24 September | Xinping Zhu | B211 |
-
Richard Sowers (University of Illinois Urbana-Champaign), Friday 12 June (TBC), Room B211
Lateral Boundary Conditions for a Kolmogorov-type PDE
We consider a Kolmogorov-type PDE corresponding to a particle under white noise force. We are interested in stopping the process at a fixed position i.e. imposing Dirichlet conditions at a side boundary. We construct a simple Gaussian heat kernel inside the domain and investigate a boundary-layer kernel connected to some work by McKean. We show that this boundary layer heat kernel has a novel jump condition. We outline a polynomial expansion of for the heat kernels and then construct a Volterra equation for solving the original problem. The novel jump leads to a periodic structure of the Volterra equation.
-
Florentin Guth (NYU, Flatiron Institute), Thursday 25 June at 10.30 am, Room B211
Towards a science of deep learning: the structure of data and weights (deep learning)
With deep learning, we now have access to strikingly powerful generative models of images and text, in apparent spite of the curse of dimensionality. This success rests on two facts: natural data is highly structured, and our choices of network architectures and training algorithms implicitly encode strong assumptions about this structure, so that only a small number of “effective” parameters ultimately need to be learned. However, we lack a principled understanding of the mechanisms behind this process, leaving us with limited guidance for improving data efficiency or mitigating failures such as memorization and hallucination. In this talk, I will present experiments on deep networks to uncover the structure of their training data and its encoding in the learned weights. I will show that with sufficient data, diffusion models learn a genuine probability distribution over natural images rather than a collection of memorized samples. Exploring this learned probability model reveals that the usual picture of images concentrating near a low-dimensional “typical manifold” needs to be revised. Local low-dimensionality alone is therefore insufficient to lift the curse of dimensionality. To capture the more global structure that deep networks exploit, I will present a framework for extracting “effective parameters” from trained weights, quantifying how many such parameters are learned, and comparing them across networks. These results provide insights into what and how neural networks learn, paving the way towards the principled design of architectures and algorithms whose inductive biases explicitly match available prior information about the data.
-
Alessandro Barp (University College London), Thursday 17 September at 10.30 am, Room B211
From Bernoulli to deRham: a historical perspective on distributions and random variables, from a geometric viewpointGeometric ideas have played a tangential role in statistics since the origins of mathematical statistics, and their role has grown increasingly central in modern methodologies— from sampling to model inference. In this talk, we describe how geometry not only motivates key tools in statistics, from Hamiltonian-based samplers to kernel distances, but also underpins the very foundations of probability. The talk will focus on drawing connection between fields and discuss the historical motivations, from a modern viewpoint.Abstract TBD
-
Xinping Zhu (Navier, École nationale des ponts et chaussée), Thursday 24 September at 10:30, Room B211
Multiscale simulation of cementitious materials
Concrete is the most widely used manufactured material in the world and forms the foundation of modern infrastructure. However, the production of Portland cement, the key binding component of concrete, is responsible for approximately 8% of global anthropogenic CO2 emissions. Improving the performance and durability of concrete is therefore an important strategy for reducing its environmental footprint by extending service life and minimizing repair and replacement needs. Achieving this goal requires a fundamental understanding of how hydration products form, how microstructures evolve, and how these microscopic features control macroscopic mechanical properties. Concrete is also a highly porous and heterogeneous material that can deteriorate under harsh environmental conditions, such as freeze-thaw cycles and chemical attack. Understanding the mechanisms responsible for such degradation is essential for developing more durable and resilient cementitious materials. At the same time, cement-based materials can act as carbon sinks through the carbonation of hydration products, particularly portlandite and calcium silicate hydrate (C-S-H). The efficiency of CO2 sequestration depends on a complex interplay of dissolution, transport, nucleation, and precipitation processes occurring across multiple length scales.
All these phenomena are rooted in molecular interactions among water, ions, hydrates, and mineral surfaces. Molecular simulation provides a powerful tool for probing these mechanisms and bridging microscopic processes with macroscopic behavior. In this seminar, I will present a multiscale perspective on cementitious materials, spanning molecular to mesoscale phenomena and covering both physical and chemical processes. I will begin with a brief introduction to molecular simulation methods. Then I will discuss several computational approaches, including classical molecular dynamics with non-reactive and reactive force fields, enhanced sampling techniques such as metadynamics, and coarse-grained modeling. Selected applications in C-S-H gelation, ice crystallization in confined space, and carbon mineralization will be presented to demonstrate how molecular simulations can provide fundamental insights and support the development of more sustainable and durable cement-based materials.
Past seminars (2025-2026)
-
Jesús De Loera (UC Davis),
Friday 5 September at 10:00, Room B211What is the Best Way to Slice a Polyhedron?
For hundreds of years mathematicians have been fascinated with slicing high-dimensional mathematical objects as a way to get knowledge and intuition of higher dimensions. There are many classical results and conjectures (e.g., Bourgain’s conjecture, recently a theorem, on the relation of volume and area of slices). My talk is computational contribution to this subject.
Given a d-dimensional convex polytope P, what is the “best’’ slice of P by a hyperplane? Here “best’’ can mean many possible things, analytics e.g., a slice with the largest volume? Or combinatorial, e.g., a slice with the largest number of vertices? etc. This touches on classical work by Lagrange, Bourgain, Ball, Koldobsky, Milman, Pournin, and many other mathematicians. Not only we investigate the above optimization theorem but also, as we slice P with hyperplanes, we create many combinatorially different (d-1)-slices, which are also polytopes of course. E.g., for a 3-dimensional regular cube there are 4 combinatorial types of slices (triangles, quadrilaterals, pentagons, hexagons). We investigate: How many different slices are there for a polytope P? How can we count them all? Can we give lower/upper bounds on their number? What are extremal cases?
I will explain a powerful new geometric model (a moduli space of slices) and algorithmic framework that answers these problems (and others) in polynomial time when dim(P) is fixed. Moreover, we show the problems have hard complexity otherwise.
This is joint work with Marie-Charlotte Brandenburg (U Bochum) and Chiara Meroni (ETH) and Antonio Torres and Gyivan López (UC Davis)
-
Marios Andreou (University of Wisconsin-Madison),
Tuesday 09 September at 10:30, Room B211Harnessing the Conditional Gaussian Nonlinear System Framework for Efficient Adaptive-Lag Online Smoothing and Causal Inference
This talk presents recent advances in data assimilation and causal inference through the Conditional Gaussian Nonlinear System (CGNS) framework. This is a class of nonlinear stochastic models in which, conditioned on a subset of state variables, the unobserved components follow a Gaussian posterior distribution. This structure enables efficient Bayesian state estimation and sampling via closed-form solutions, making CGNS especially suitable for high-dimensional, multiscale systems with regime shifts and intermittent extreme events. CGNSs have wide applications in uncertainty quantification, modelling complex geophysical phenomena, dealing with high-dimensional systems, and facilitating machine learning.
We first introduce an adaptive-lag online smoother that leverages the analytical tractability of CGNSs to effectively reduce the computational storage requirements of standard smoothing procedures. Adaptively adjusting the lag by exploiting information-theoretic criteria, this strategy is applicable to turbulent systems with time-varying temporal correlations and performs well in challenging real-world problems such as Lagrangian data assimilation and online parameter estimation.
Building on this, we present a paradigm-shifting causal inference framework, called Assimilative Causal Inference (ACI), which leverages Bayesian data assimilation to trace causes backward from observed effects by solving an inverse problem rather than quantifying forward influence. It uniquely identifies dynamic causal interactions without requiring observations of candidate causes, accommodates short datasets, and scales efficiently to high dimensions. Crucially, it provides online tracking of causal roles, which may reverse intermittently, and facilitates a mathematically rigorous criterion for the causal influence range, revealing how far effects propagate. ACI opens new avenues for studying complex systems, where transient causal structures are critical. Despite its general setting, within the CGNS framework ACI enables explicit nil-causality principles and analytical characterisations of the associated causal influence ranges for objective temporal attribution and prediction.
Applications to systems showcasing turbulent dynamics and nonlinear geophysical flows illustrate how CGNSs provide scalable, real-time, and theoretically grounded solutions to online smoothing and causal inference.
-
Laure Saint-Raymond (IHES),
Wednesday 10 September at 14:30, Room B211Autour de l’équation de Boltzmann.
Je présenterai quatre résultats mathématiques marquants de ces cinquante dernières années qui permettent de mieux appréhender ce modèle fondamental de la théorie cinétique des gaz.
-
Jonathan Eckstein (Rutgers University),
Thursday 2 October at 10:30, Room B211Monotone Operator Splitting, Projection, and Stochastic Programming
We begin by describing the basics of monotone set-valued operators, proximal, methods, and operator splitting, with a particular focus on solving stochastic programming problems defined on finite multistage trees. A classic form of operator splitting is (perhaps misleadingly) called “Douglas Rachford” and underlies the now-popular ADMM algorithm. We will survey the basics of that algorithm and how it leads to the progressive hedging (PH) method of Rockafellar and Wets. Projective splitting is a newer approach to operator splitting, and allows derivation of algorithms allowing forms of asynchronous implementation. We consider a specialization of this approach to stochastic programming, producing a method that needs to re-solve only a subset of the scenarios at each iteration. We will also touch on an ongoing project to develop a more strongly asynchronous implementation and apply it to stochastic problems with integer variables.
-
Charles Bertucci (École polytechnique),
Thursday 2 October at 10:30, Room B211Analysis on the space of probability measures and application to optimal control and large deviations
I will motivate the study of non-linear partial differential equations on the space of probability measures by presenting some general questions of optimal control and large deviations of mean field problems. I will then explain some general tools of analysis of functions of a measure argument such as derivatives or super-differentials. I will then explain how those tools can be used in the motivating problems.
-
Andrew McRae (EPFL/ CERMICS),
Thursday 13 November at 10:30, Room B211Nonconvex optimization landscapes in statistics: benignness, relaxation, and tightness
Nonconvex (continuous) optimization is everywhere. Classically, nonconvexity is problematic due to the possibility of spurious local optima. However, in practice, for many applications, naïve algorithms that find a local optimum give satisfactory solutions. Why is this? In my work, I try to explain theoretically how problem structure coming from statistical applications has a surprising impact on the difficulty of an optimization problem. One possible explanation is that the landscape is benign: that is, all local optima are, in fact, global optima (or at least close enough for the task at hand). When this fails, we can try to make the problem easier by relaxing or overparametrizing it. However, this then raises the questions of (a) how much do we relax (there is then often a tradeoff between the number of variables and how close the problem is to a convex one), and (b) is the relaxation tight? I will illustrate these issues and present some of my recent work in the area of low-rank matrix estimation, specifically low-rank matrix sensing and orthogonal group synchronization. These problems have applications in many areas such as computational microscopy, sensor network localization, robotics, and dynamical systems.
-
Bernardo Cockburn (University of Minnesota), Tuesday 18 November at 10:30, Room B211
Transforming stabilizations into Spaces
The idea of transforming what could be called “stabilization term” of a numerical scheme into an accuracy-enhancing space is presented and explored in detail in the framework of the discontinuous Galerkin (DG) and continuous Galerkin (CG) for a simple ODE. To do that, the weak formulation of the DG method is transformed into its strong form in order to establish a simple relation between the DG and CG approximations. Using it, a post-processed DG solution is defined which turns out to converge with one additional order than the DG approximation. We then apply the idea to the one-dimensional transport equation. It is known that, when polynomials of degree k are used, the DG approximation converges with order k+1. Nevertheless, we argue that an efficient postprocessing of the DG approximation can be defined as to converge with order 2k+1, even in unstructured meshes. We present many computational results that illustrate the behaviour of the postprocessing. We end by describing our ongoing work and the difficulties we are trying to overcome to extend this idea to multidimensional PDEs.
-
Alain Durmus (CMAP, Ecole Polytechnique), Thursday 11 December at 10:30, Room B211
Online Decision-Focused Learning
The widespread deployment of Machine Learning systems everywhere raises challenges, such as dealing with interactions or competition between multiple learners. In that goal, we study multi-agent sequential decision-making by considering principal-agent interactions in a tree structure. In this problem, the reward of a player is influenced by the actions of her children, who are all self-interested and non-cooperative, hence the complexity of making good decisions. Our main finding is that it is possible to steer all the players towards the globally optimal set of actions by simply allowing single-step transfers between them. A transfer is established between a principal and one of her agents: the principal actually offers the proposed payment if the agent picks the recommended action. The analysis poses specific challenges due to the intricate interactions between the nodes of the tree and the propagation of the regret within this tree. Considering a bandit setup, we propose algorithmic solutions for the players to end up being no-regret with respect to the optimal pair of actions and incentives. In the long run, allowing transfers between players makes them act as if they were collaborating together, although they remain self-interested non-cooperative: transfers restore efficiency.
-
Victor Chardès (Flatiron Institure, New York), Thursday 8 January at 10:30, Room B211
Flow-based generative modeling of gene regulation with single-cell omics
Single-cell omics methods provide high-throughput, molecular-scale measurements of cellular processes, offering insights into the machinery of cell fate decision-making. However, the high dimensionality, stochasticity, and cross-sectional nature of these measurements hinder their integration with biophysical models of gene regulation. My research aims to bridge this gap by combining generative AI with the biophysics of gene regulation to infer actionable and interpretable models of cell fate decision-making from single-cell omics data. In this direction, I recently developed probability flow inference (PFI), a computational approach to infer models specified by arbitrary stochastic differential equations from time-resolved single-cell RNA-seq data. I will discuss the mathematical underpinnings of the method, its relationship with the flow matching paradigm and the open mathematical questions it raises. I will show that such models not only infer better biological representations, but also generalize better to out-of-distribution data, outperforming state-of-the-art purely data-driven approaches.
-
Austin J.Stromme (ENSAE/CREST, Paris – Saclay), Thursday 19 February at 10:30, Room B211
On the implicit regularization of Langevin dynamics with projected noise
Motivated by the study of stochastic gradient descent for over-parameterized models, we consider Langevin dynamics with noise projected onto directions which are orthogonal to an isometric group action. Our main result identifies a novel form of implicit regularization: when the initial and target density are both invariant under the group action, projected Langevin is equivalent in law to Langevin with identity diffusion but with an additional drift term proportional to the negative log volume of the group orbit. The proof is based on constructing a certain stochastic process on the group itself, and identifies the extra drift term as the negative mean curvature of the group orbit. Based on ongoing joint work with Govind Menon and Adrien Vacher.
-
Max Zimmer (Zuse Institute Berlin), Monday 23 March at 10:30, Room B211
Continuous Optimization for Discrete Problems: Two Stories from LLM Pruning and Discrete GeometryAbstract: This talk presents two settings where continuous optimization unlocks progress on fundamentally combinatorial problems. In the first part, we consider pruning of large language models, where state-of-the-art methods operate layer-wise, minimizing a per-layer objective on a small calibration dataset. The underlying NP-hard problem is that of finding a binary mask that determines which weights to keep — the so-called mask selection problem. We propose to relax these combinatorial constraints to their convex hull and solve the resulting convex program using the Frank-Wolfe algorithm, reducing per-layer error by up to 80% with consistent improvements in perplexity and downstream accuracy. In the second part, we turn to the Hadwiger-Nelson problem, a long-standing open problem from discrete geometry about coloring the Euclidean plane while avoiding monochromatic unit-distance pairs. Using neural networks as approximators, we reformulate this geometric coloring problem as an optimization task with a probabilistic, differentiable loss function, enabling gradient-based exploration of admissible configurations. This leads to the discovery of two novel six-colorings providing the first improvements in thirty years to the off-diagonal variant of the original problem. Despite their different origins, both lines of work share a common denominator: relax a discrete problem into a continuous one, solve it with first-order methods, and extract combinatorial structure from the solution. -
Wen-Tai Hsu (University of Maryland), Thursday 26 March at 10:30, Room B211
Narrow escape problems in domains with bottlenecks
Narrow escape problems study the behavior of Brownian motion in bounded domains with reflecting boundaries, except for small escaping regions. Domains with bottlenecks arise naturally in applications such as the transmission of ions or molecules in dendritic spines. In this talk, I will discuss the expected escape time, the distribution of exit locations, and the (un)conditional distributions of escape times. A central technical difficulty is to prove convergence to a suitable quasi-stationary distribution for an associated discretized Markov process, which plays a key role in the analysis of these quantities.
-
Haitian Yang (Tsinghua University), Thursday 9 April at 10:30, Room B211
Boundary control of multi-dimensional first-order hyperbolic systems
Boundary control of first-order hyperbolic systems has been extensively studied in the spatially one-dimensional setting. In multi-dimensions, however, progress has been limited, largely due to the loss of Riemann invariant structure. This talk reports recent progress in my doctoral research on boundary stabilization, boundary controllability, and boundary optimal control for multi-dimensional first-order hyperbolic systems. I will also highlight how energy propagation and internal interaction may provide a unifying perspective across these three problems.
-
Ines Vati (Queensland University of Technology), Thursday 16 April at 10:30, Room tbd
Fast Geometric Machine Learning Methods for Brain Image Analysis
Alzheimer’s Disease (AD), the most common form of dementia, is characterised by the accumulation of misfolded proteins in the brain, ultimately leading to cortical atrophy and cognitive decline. Accurate measurement of the atrophy patterns is essential not only for diagnosis but also for early detection and potential prevention. However, due to the high variability in cortical geometry across individuals, population- based analyses typically rely on long, sophisticated neuroimaging pipelines for cortical feature computation and inter-subject alignment. These traditional approaches often require several days to weeks of computation for large cohorts, creating a major bottleneck for large-scale neuroimaging studies. Deep learning–based approaches have the potential to dramatically accelerate this process.
In this talk, I will present the recent work conducted in our team to streamline the cortical surface-based pipeline. I will review optimization-based approaches and end-to-end deep learning methods for cortical mesh reconstruction and surface registration. Some insights into the Spherical Demons algorithm, a classical diffeomorphic registration method on the sphere, will be provided. I will then introduce NC-Reg, our novel approach for rigid registration of spherical surfaces using a neural representation. Subsequently, I will outline our ongoing and future work, with a particular focus on CorticalFlow++, a deep learning model for fast cortical mesh reconstruction directly from MRI images.
We believe that this work lays a strong foundation for more eXicient, scalable, and robust tools for cortical surface analysis. These developments have the potential to enable large- cohort neuroimaging studies and ultimately contribute to both computational neuroscience and clinical applications.
-
Alice Marveggio (Institute for Applied Math, Bonn), Tuesday 19 April at 10:30, Room B211
The Verigin problem with phase transition as Wasserstein flow
We study the modeling of a compressible two-phase flow in a porous medium. The governing PDE system is known as the Verigin problem with phase transition, which is the compressible analogue to the Muskat problem. We prove the convergence of an implicit time discretization scheme using the Wasserstein distance, obtaining distributional solutions in the limit that satisfy an optimal energy-dissipation rate. This talk is based on a joint work with Anna Kubin and Tim Laux.
