Frugal Generative Modeling for Tabular Data - Algorithmique, Recherche Opérationnelle, Bioinformatique et Apprentissage Statistique
Communication Dans Un Congrès Année : 2024

Frugal Generative Modeling for Tabular Data

Résumé

This paper presents a generative modeling approach called Gmda designed for tabular data, adapted to its arbitrary feature correlation structure. The generative model is trained so that sampled regions in the feature space contain the same fraction of true and synthetic samples, allowing true and synthetic data distributions to be aligned using a frugal and sound learning criterion. The merits of Gmda in terms of the usual performance indicators (pairwise correlation errors, precision, recall, predictive performance) are on par with or better than the state-of-the-art approaches for tabular data based on VAEs, GANs, or diffusion models. The key point is that it provides generative models with one or more orders of magnitude that are more frugal than baseline approaches.
Fichier principal
Vignette du fichier
ECML2024.pdf (6.21 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04705131 , version 1 (15-10-2024)

Identifiants

Citer

Alice Lacan, Blaise Hanczar, Michele Sebag. Frugal Generative Modeling for Tabular Data. ECML PKDD 2024 - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2024, Vilnius, Lithuania. pp.55--72, ⟨10.1007/978-3-031-70371-3_4⟩. ⟨hal-04705131⟩
184 Consultations
32 Téléchargements

Altmetric

Partager

More