Handling imbalanced 5G and beyond network tabular data using conditional generative models

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Data-driven machine learning based approaches have been playing increasingly important roles in 5G and beyond (B5G) network management and optimisation. One of the major challenges is that the existence of class imbalance in networking datasets can greatly bias the classifier towards a majority classification. This discrepancy can pose a serious problem for AI-based models, which require large and diverse amounts of data to learn patterns and generate classifications. The generation of synthetic data is a process that has evolved over time from the application of statistical models to a more machine learning-centric approaches. In this research work we present the development, implementation, evaluation and comparison of four generative models for tabular data on a B5G network management dataset. These models have been previously optimised according to certain evaluation metrics of the generated synthetic data. The dataset presents a problem of imbalance between its classes, which is improved by using generative models to enrich it with synthetic data. The results show that machine learning based generative models obtain more accurate data than traditional statistical models, and are much faster in terms of conditional data sampling.
Original languageEnglish
Title of host publicationProceedings of the 20th International Wireless Communications & Mobile Computing Conference
PublisherIEEE
Number of pages7
Publication statusAccepted/In press - 28 Mar 2024
EventThe 20th International Wireless Communications & Mobile Computing Conference: Green and Intelligent Communications - Adams Beach Hotel, Ayia Napa, Cyprus
Duration: 27 May 202431 May 2024
https://iwcmc.org/2024/index.php

Conference

ConferenceThe 20th International Wireless Communications & Mobile Computing Conference
Abbreviated titleIWCMC 2024
Country/TerritoryCyprus
CityAyia Napa
Period27/05/2431/05/24
Internet address

Keywords

  • generative model
  • synthetic tabular data
  • networking dataset
  • B5G network
  • network optimisation

Fingerprint

Dive into the research topics of 'Handling imbalanced 5G and beyond network tabular data using conditional generative models'. Together they form a unique fingerprint.

Cite this