Abstract

Is it possible to find hidden relationships among variables in WiFi network using machine learning (ML)? Can we use ML to find a variable that significantly affects the TCP throughput in WiFi? In this work, we employ a publicly available WiFi dataset to investigate these questions. We use ML techniques, including principal component analysis (PCA), linear regression (LR), and random forest (RF), to study the effect of link speed, received signal strength, round-trip time (RTT), and number of available access points on TCP throughput in WiFi. More specifically, we are interested in employing ML to find the variable that most accurately predicts and thereby most significantly affects the throughput. Simple correlation analysis indicates that a combination of multiple variables is more likely to act as a reasonable predictor of the throughput, whereas a single variable, such as RTT, alone is not likely to predict the throughput with reasonable accuracy. From PCA, the first principal component (PC1) is seen as highly correlated to RTT. During predictive analysis, it is observed that the LR model is unable to find any hidden relationship between throughput and other variables. However, the RF model discovers that RTT explains the variation in throughput more closely and as such it predicts the throughput more accurately compared to other variables. PC1 captures nearly all of the variation in throughput with the RF model and predicts throughput with very high accuracy, which indirectly confirms RTT as the variable that most significantly affects the TCP throughput in WiFi. Consequently, we discover a very close relationship between RTT and TCP throughput using appropriate ML techniques, and these results can be helpful in developing a better understanding of the relationship between latency and throughput for designing future low-latency networks.

Highlights

  • WiFi is widely deployed and will continue to play a significant role in the presence of generation cellular networks like 5G

  • 5 Results for analysis The results for the three different types of analyses on the WiFi dataset, namely correlation analysis, principal component analysis, and predictive analysis, to investigate any hidden relationship between other variables and TCP throughput are presented here followed by an in-depth discussion at the end of this section

  • 6 Conclusions In this work, we studied the relationship between TCP throughput and other variables in the WiFi network such as link speed, received signal strength, round-trip time (RTT), and number of available WiFi access points

Read more

Summary

Introduction

WiFi is widely deployed and will continue to play a significant role in the presence of generation cellular networks like 5G. The round-trip delay in TCP—i.e., the delay encountered by the packet to reach the destination plus the delay experienced by the acknowledgement of that packet to arrive at the source—includes the WiFi-induced delay as well the delay incurred over the Internet, and we refer to it as the round-trip time (RTT) Considering these upcoming ULL applications, the effect of RTT on TCP throughput in WiFi networks is of significant interest. Principal component analysis is generally used to reduce the dimensionality of a dataset having a large number of interrelated variables while retaining as much as possible of the variation present in the dataset. This is done by generating a new set of uncorrelated variables referred to as the principal components. We first study the correlations between the original variables and PCs as well as proportions of variance for PCs, and we use principal components in generating RF models to predict the TCP throughput to indirectly confirm any hidden relationship between other variables and throughput

Objectives
Methods
Results
Discussion
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.