Powered by OpenAIRE graph
Found an issue? Give us feedback

Laboratoire de mathématiques d'Orsay

Laboratoire de mathématiques d'Orsay

16 Projects, page 1 of 4
  • Funder: French National Research Agency (ANR) Project Code: ANR-23-CE40-0018
    Funder Contribution: 516,941 EUR

    Modern statistics and machine learning problems often feature data sitting in an ambient space of high dimension. Yet, methods such as random forests or deep neural networks have recently enabled remarkable performance even in such complex settings. One main reason is that the data can often be explained through a low dimensional structure, hidden to the statistician. In such settings, Bayesian methods such as spike-and-slab variable selection priors, Bayesian additive regression trees (BART), Bayesian deep neural networks or deep Gaussian processes are routinely used by statisticians as well as in physics, astronomy and genomics applications. Among the reasons for the popularity of Bayesian algorithms, one can mention: their flexibility, in that it is relatively easy to model the unknown structure underlying the data through the prior distribution; the broad range of computational methods available, including variational approximations; their ability to quantify uncertainty through so-called credible sets. While there are many empirical successes, there is an important need for understanding and validation for such methods. From the mathematical perspective, one would like to be able to understand and demonstrate under which conditions such algorithms effectively work. The BACKUP project aims at providing theoretical backup for such modern statistical algorithms, around three research avenues. First, new results will be obtained for high-dimensional models and latent variable settings using Bayesian posterior distributions, tackling important recent questions of multiple testing and variable selection. Second, foundational results will be obtained for complex methods such as random forests and Bayesian deep neural networks, both for posteriors and their variational approximations. Third, we will address the fundamental question of uncertainty quantification, by deriving optimal efficient confidence sets from well-chosen Bayesian credible regions.

    more_vert
  • Funder: French National Research Agency (ANR) Project Code: ANR-21-CE23-0035
    Funder Contribution: 272,496 EUR

    Unsupervised Learning is one of the most fundamental problem of machine learning, and more generally, of artificial intelligence. In a broad sense, it amounts to learning some unobserved latent structure over data. This structure may be of interest per se, or may serve as an important stepping stone integrated in a complex data analysis pipe-line - since large amounts of unlabeled data are more common than costly labeled data. Arguably, one the cornerstones of unsupervised learning is clustering, where the aim is to recover a partition of the data into homogeneous groups. Beside vanilla clustering, unsupervised learning encompasses a large variety of related other problems such as hierarchical clustering, where the group structure is more complex and reveals both the backbone and fine-grain organization of the data, segmentation where the shape of the clusters is constrained by side information, or ranking or seriation problems where where no actual cluster structure exists, but where there is some implicit ordering between the data. All these problems have already found countless applications and interest in these methods is even strengthening due to the amount of available unlabelled data. We can for instance cite crowdsourcing - where individuals answer to a subset of questions, and where, depending on the context, one might want to e.g. cluster them depending on their field of expertise, rank them depending on their performances, or seriate them depending on their affinities. Such problems are extremely relevant for recommender systems - where individuals are users, and questions are items - and for social network analyses. The analysis of unsupervised learning procedures has a long history that takes its roots both in the computer science and in mathematical communities. In response to recent bridges between these two communities, groundbreaking advances have been made in the theoretical foundations of vanilla clustering. We believe that these recent advances hold the key for deep impacts on the broader field of unsupervised learning because of the pervasive nature of clustering. In this proposal, we first aim at propagating these recent ground-breaking advances in vanilla clustering to problems where the latent structure is either more complex or more constrained. We will consider problems of increasing latent structure complexity - starting from hierarchical clustering and heading toward ranking, seriation, and segmentation - and propose new algorithms that will build on each other, focusing on the interfaces between these problems. As a result, we expect to provide new methods that are valid under weaker assumptions in comparison to what is usually done - e.g. parametric assumptions - while being also able to adapt to the unknown intrinsic difficulty of the problem. Moreover, many modern unsupervised learning applications are essentially of an online nature - and sometimes decisions have to be made sequentially on top of that. For instance, consider a recommender systems that sequentially recommends items to users. In this context where sequential, active recommendations are made, it is important to leverage the latent structure underlying the individuals. While both the fields of unsupervised learning, and sequential, active learning, are thriving, research at the crossroad has been conducted mostly separately by each community - leading to procedures that can be improved. A second aim of this proposal will then be to bring together the fields of unsupervised learning and active learning, in order to propose new algorithms that are more efficient at leveraging sequentially the unknown latent structure. We will consider the same unsupervised learning problems as in the batch learning side of the proposal. We will focus on developing algorithms that fully take advantage of new advances in clustering, and of our own future work in batch learning.

    more_vert
  • Funder: French National Research Agency (ANR) Project Code: ANR-21-CE40-0002
    Funder Contribution: 475,000 EUR

    This project will combine the expertise of many specialists in contact and symplectic topology. Our research objectives will concentrate around two specific themes: Lefschetz fibrations and open book decompositions on the one hand and persistent homology on the other hand. Lefschetz fibrations and open book decompositions are central tools to understand the structure of symplectic and contact manifolds. We plan to use these notions in order to derive new constraints on the topology of Lagrangian submanifolds. We also plan to gain a better understanding of their holomorphic curves invariants and of their properties, using the very recent theory of convex hypersurfaces. Exciting new results in C^0 symplectic topology were obtained using persistent homology. We plan to use it in order to extract richer information from homological invariants constructed using holomorphic curves, generating families or sheaves.

    more_vert
  • Funder: French National Research Agency (ANR) Project Code: ANR-21-CE40-0017
    Funder Contribution: 391,642 EUR

    Following Berger's holonomy classification and Atiyah and Donaldson's achievements in Yang-Mills theory, differential geometers have studied the interactions between variational and algebraic perspectives. Our project combines these traditions in the study of special geometric structures, such as extremal Kähler/Sasaki, special holonomy, generalized geometry and the interplay of all these concepts in Ströminger systems. In practice, these problems belong to gauge theory : a space of connections, a curvature equation to solve, a group of symmetries to control. In each case, the expected outcome is a correspondence between a special geometry and an algebraic condition, as provided by Kobayashi-Hitchin-D-U-Y, which allows to describe the local structure of the moduli space in terms of stability. In addition to constructing new families of examples, our objective is to understand their global topology, deformations and algebraic obstructions.

    more_vert
  • Funder: French National Research Agency (ANR) Project Code: ANR-16-CE40-0014
    Funder Contribution: 268,464 EUR

    The (real) Monge-Ampère equation is a fully nonlinear elliptic equation with a strong geometric nature. It can indeed be used, following Minkowski, to recover a convex hypersurface from the knowledge of its Gaussian curvature. This equation also plays a central role in the theory of optimal transport, which has a wide range of applications in physics, economics and geometry. However, the singular and highly non-linear nature of the equation is a serious obstruction to the development of efficient solvers. We propose to attack the problem from an original side, that combines the point of views of analysts and computational geometers, both categories being represented by members of this project. First, we will develop and study solvers for the standard Monge-Ampère equation in dimension two and three. Our objective is that it becomes possible to deal with very large discretizations without parameter tweaking. This will allow their use for the numerical resolution of difficult non-linear minimization problems involving Monge-Ampère operators. This includes minimization problems over the space of convex bodies or of convex functions but also minimization problems formulated in term of optimal transport. For instance, we will consider the problem of approximating minimizing geodesics over the space of measure-preserving maps, which can be used to construct non-deterministic solutions to Euler's equation for incompressible fluids. In all these cases, we hope that numerical experiments will lead to a better mathematical understanding of the underlying problems. Second, we will extend the computational geometric approach to generalized Monge-Ampère equations arising in particular from non-imaging optics. This includes the problem of designing reflectors and refractors that transport the light energy emitted from a point source into a target whose shape and intensity distribution is prescribed. Some of the equations occuring from non-imaging optics belong to a more general class of equations called ``prescribed generated Jacobian equations'', whose discretization will require new theoretical tools and sophisticated geometric algorithms.

    more_vert
  • chevron_left
  • 1
  • 2
  • 3
  • 4
  • chevron_right

Do the share buttons not appear? Please make sure, any blocking addon is disabled, and then reload the page.

Content report
No reports available
Funder report
No option selected
arrow_drop_down

Do you wish to download a CSV file? Note that this process may take a while.

There was an error in csv downloading. Please try again later.