Abstract
Data-driven machine learning based approaches have been playing increasingly important roles in 5G and beyond (B5G) network management and optimisation. One of the major challenges is that the existence of class imbalance in networking datasets can greatly bias the classifier towards a majority classification. This discrepancy can pose a serious problem for AI-based models, which require large and diverse amounts of data to learn patterns and generate classifications. The generation of synthetic data is a process that has evolved over time from the application of statistical models to a more machine learning-centric approaches. In this research work we present the development, implementation, evaluation and comparison of four generative models for tabular data on a B5G network management dataset. These models have been previously optimised according to certain evaluation metrics of the generated synthetic data. The dataset presents a problem of imbalance between its classes, which is improved by using generative models to enrich it with synthetic data. The results show that machine learning based generative models obtain more accurate data than traditional statistical models, and are much faster in terms of conditional data sampling.
Original language | English |
---|---|
Title of host publication | Proceedings of the 20th International Wireless Communications & Mobile Computing Conference |
Publisher | IEEE |
Number of pages | 7 |
DOIs | |
Publication status | Published - 17 Jul 2024 |
Event | The 20th International Wireless Communications & Mobile Computing Conference: Green and Intelligent Communications - Adams Beach Hotel, Ayia Napa, Cyprus Duration: 27 May 2024 → 31 May 2024 https://iwcmc.org/2024/index.php |
Conference
Conference | The 20th International Wireless Communications & Mobile Computing Conference |
---|---|
Abbreviated title | IWCMC 2024 |
Country/Territory | Cyprus |
City | Ayia Napa |
Period | 27/05/24 → 31/05/24 |
Internet address |
Keywords
- generative model
- synthetic tabular data
- networking dataset
- B5G network
- network optimisation