Synthesizing Tabular Data Using Conditional GAN

Synthesizing Tabular Data Using Conditional GAN
Author :
Publisher :
Total Pages : 93
Release :
ISBN-10 : OCLC:1202001437
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Synthesizing Tabular Data Using Conditional GAN by : Lei Xu (S.M.)

Download or read book Synthesizing Tabular Data Using Conditional GAN written by Lei Xu (S.M.) and published by . This book was released on 2020 with total page 93 pages. Available in PDF, EPUB and Kindle. Book excerpt: In data science, the ability to model the distribution of rows in tabular data and generate realistic synthetic data enables various important applications including data compression, data disclosure, and privacy-preserving machine learning. However, because tabular data usually contains a mix of discrete and continuous columns, building such a model is a non-trivial task. Continuous columns may have multiple modes, while discrete columns are sometimes imbalanced, making modeling difficult. To address this problem, I took two major steps. (1) I designed SDGym, a thorough benchmark, to compare existing models, identify different properties of tabular data and analyze how these properties challenge different models. Our experimental results show that statistical models, such as Bayesian networks, that are constrained to a fixed family of available distributions cannot model tabular data effectively, especially when both continuous and discrete columns are included. Recently proposed deep generative models are capable of modeling more sophisticated distributions, but cannot outperform Bayesian network models in practice, because the network structure and learning procedure are not optimized for tabular data which may contain non-Gaussian continuous columns and imbalanced discrete columns. (2) To address these problems, I designed CTGAN, which uses a conditional generative adversarial network to address the challenges in modeling tabular data. Because CTGAN uses reversible data transformations and is trained by re-sampling the data, it can address common challenges in synthetic data generation. I evaluated CTGAN on the benchmark and showed that it consistently and significantly outperforms existing statistical and deep learning models.


Synthesizing Tabular Data Using Conditional GAN Related Books

Synthesizing Tabular Data Using Conditional GAN
Language: en
Pages: 93
Authors: Lei Xu (S.M.)
Categories:
Type: BOOK - Published: 2020 - Publisher:

DOWNLOAD EBOOK

In data science, the ability to model the distribution of rows in tabular data and generate realistic synthetic data enables various important applications incl
Engineering Applications of Neural Networks
Language: en
Pages: 544
Authors: Lazaros Iliadis
Categories: Computers
Type: BOOK - Published: 2022-06-14 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 23rd International Conference on Engineering Applications of Neural Networks, EANN 2022, held in Chersonis
Data Envelopment Analysis (DEA) Methods for Maximizing Efficiency
Language: en
Pages: 413
Authors: Ajibesin, Adeyemi Abel
Categories: Computers
Type: BOOK - Published: 2024-01-16 - Publisher: IGI Global

DOWNLOAD EBOOK

In today's highly competitive and rapidly evolving global landscape, the quest for efficiency has become a crucial factor in determining the success of organiza
Hybrid Artificial Intelligent Systems
Language: en
Pages: 789
Authors: Pablo GarcĂ­a Bringas
Categories: Computers
Type: BOOK - Published: 2023-08-28 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book constitutes the refereed proceedings of the 18th International Conference on Hybrid Artificial Intelligent Systems, HAIS 2023, held in Salamanca, Spai
Artificial Intelligence in Medicine
Language: en
Pages: 505
Authors: Martin Michalowski
Categories: Computers
Type: BOOK - Published: 2020-09-25 - Publisher: Springer Nature

DOWNLOAD EBOOK

The LNAI 12299 constitutes the papers of the 18th International Conference on Artificial Intelligence in Medicine, AIME 2020, which will be held online in Augus