Latency-critical Applications Research Articles

Function-as-service (FaaS) platforms promise a simpler programming model for cloud computing, given that providers take care of the overall resource management while the developers can concentrate only on writing their applications in the scope of a function, not having to care about installing or scaling resources. As FaaS users are billed based on the execution of the functions, platform providers have a natural incentive not to keep idle resources running at the platform’s expense. However, this strategy may lead to the cold start issue, in which the execution of a function is delayed because there are no ready resources to host the execution. Despite the time to provision the computational resources, starting the runtime environment that manages the function execution can take hundreds of milliseconds to seconds, a prohibitive non-deterministic overhead to latency-critical applications. This work describes a practical technique, Prebaking, to reduce the function start-up time based on restoring snapshots of previously executed processes. The foundation of the Prebaking technique is that deciding when to create a snapshot of a function is essential. For example, based on a prototype we developed using the CRIU checkpoint/restore Linux tool, the performance improvement on the start-up of a properly snapshotted function is up to 25 times. Fortunately, generating a good function snapshot is easy: One only needs to checkpoint a warm function, that is, a function that has processed a request. To show the feasibility of using the Prebaking technique, we discussed integrating it into OpenFaaS, an open-source FaaS platform, including how to generate and restore function snapshots. We also evaluated our prototype by running a comprehensive set of experiments that compare the function start-up duration and cold start latency against the standard fork-exec procedure. We analyze the JVM, CPython, and Node.js runtimes. The results indicate that the technique can improve the function replica start-up time even for no-warmed functions: we improved the function replica start-up time from 3.5 times for a “do-nothing” function running in Node.js up to 12 times when considering an Image Resizer function running in CPython. Finally, we reinforce the practicability of the proposed technique by comparing our proposal to another snapshot-based technique called SEUSS. That evaluation shows that Prebaking provides better cold start latency even for slightly complex functions like the Markdown Renderer.

Read full abstract

Unlicensed cellular networks are being deployed worldwide by cellular operators to meet the rising data demands. However, the unlicensed band has existing incumbents such as Wi-Fi and radar systems. This creates a highly dynamic environment, making harmonious unlicensed coexistence difficult. Consequently, conventional optimization techniques are not sufficient to offer latency-critical applications and services. A data-driven hybrid optimization approach is necessary for optimal network performance with low convergence times. However, a largely unexplored problem in dense unlicensed network optimization is the accuracy-speed trade-off, that is, achieving high accuracy in optimization objectives with minimal time costs. This work seeks to address this problem through a hybrid optimization approach that combines machine learning and network optimization. It investigates the use of more precise higher-order network feature relationships (NFRs) in optimization formulations and the consequent trade-off that arises between the increase in convergence time (Speed) and the nearness to optimal results (Accuracy). In addition, it demonstrates the relevance of context awareness of network conditions and the traffic environment to mitigate the trade-off. To that end, a context-aware network feature relationship-based optimization (CANEFRO) approach is proposed and validated through decision matrix analysis. The experiments were carried out on a coexistence testbed consisting of both unlicensed LTE standards (LTE-U & LAA) and two Wi-Fi standards (802.11n/ac) on multiple channel bandwidths. In addition, LTE-U & LAA are contrasted on signaling and user data traffic data models and resource block allocation performance. More importantly, CANEFRO demonstrates the impact of the network context on the degree of feature relationship ( <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$2^{nd}$ </tex-math></inline-formula> & <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$3^{rd}$ </tex-math></inline-formula> degree polynomials), objective of optimization (SINR and Capacity), and the network use case (Accuracy vs. Speed). CANEFRO is also used to contrast LTE-U & LAA optimization performance. In particular, the decision matrix analysis demonstrates a higher decision score for LAA by as much as 42% compared to LTE-U.

Read full abstract

Latency-critical Applications Research Articles

Related Topics

Articles published on Latency-critical Applications

Agile C-states: A Core C-state Architecture for Latency Critical Applications Optimizing both Transition and Cold-Start Latency

ISwap: A New Memory Page Swap Mechanism for Reducing Ineffective I/O Operations in Cloud Environments

Prebaking runtime environments to improve the FaaS cold start latency

Metaheuristics Method for Computation Offloading In Mobile Edge Computing: Survey

Efficient Mobility Management in Mobile Edge Computing Networks: Joint Handover and Service Migration

Cooperative Service Placement and Request Routing in Mobile Edge Networks for Latency-Sensitive Applications

A Markov Decision Process Solution for Energy-Saving Network Selection and Computation Offloading in Vehicular Networks

Aerial-Aided Multiaccess Edge Computing: Dynamic and Joint Optimization of Task and Service Placement and Routing in Multilayer Networks

Resource Management and Reflection Optimization for Intelligent Reflecting Surface Assisted Multi-Access Edge Computing Using Deep Reinforcement Learning

CoreNap: Energy Efficient Core Allocation for Latency-Critical Workloads

Resource Scheduling in Edge Computing: Architecture, Taxonomy, Open Issues and Future Research Directions

Mitigating Trade-Off in Unlicensed Network Optimization Through Machine Learning and Context Awareness

A Survey on Nongeostationary Satellite Systems: The Communication Perspective

A Two-Timescale Approach to Mobility Management for Multicell Mobile Edge Computing

BLOCKCHAIN-ENABLED FOG RESOURCE ACCESS AND GRANTING

Cloud-Edge Orchestration for Smart Cities: A Review of Kubernetes-based Orchestration Architectures

Effective Task Scheduling in Critical Fog Applications

Cost and Latency Optimized Edge Computing Platform

Effect of Hyper-Threading in Latency-Critical Multithreaded Cloud Applications and Utilization Analysis of the Major System Resources

Image and Video Coding Techniques for Ultra-low Latency

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Latency-critical Applications Research Articles

Related Topics

Articles published on Latency-critical Applications

Agile C-states: A Core C-state Architecture for Latency Critical Applications Optimizing both Transition and Cold-Start Latency

ISwap: A New Memory Page Swap Mechanism for Reducing Ineffective I/O Operations in Cloud Environments

Prebaking runtime environments to improve the FaaS cold start latency

Metaheuristics Method for Computation Offloading In Mobile Edge Computing: Survey

Efficient Mobility Management in Mobile Edge Computing Networks: Joint Handover and Service Migration

Cooperative Service Placement and Request Routing in Mobile Edge Networks for Latency-Sensitive Applications

A Markov Decision Process Solution for Energy-Saving Network Selection and Computation Offloading in Vehicular Networks

Aerial-Aided Multiaccess Edge Computing: Dynamic and Joint Optimization of Task and Service Placement and Routing in Multilayer Networks

Resource Management and Reflection Optimization for Intelligent Reflecting Surface Assisted Multi-Access Edge Computing Using Deep Reinforcement Learning

CoreNap: Energy Efficient Core Allocation for Latency-Critical Workloads

Resource Scheduling in Edge Computing: Architecture, Taxonomy, Open Issues and Future Research Directions

Mitigating Trade-Off in Unlicensed Network Optimization Through Machine Learning and Context Awareness

A Survey on Nongeostationary Satellite Systems: The Communication Perspective

A Two-Timescale Approach to Mobility Management for Multicell Mobile Edge Computing

BLOCKCHAIN-ENABLED FOG RESOURCE ACCESS AND GRANTING

Cloud-Edge Orchestration for Smart Cities: A Review of Kubernetes-based Orchestration Architectures

Effective Task Scheduling in Critical Fog Applications

Cost and Latency Optimized Edge Computing Platform

Effect of Hyper-Threading in Latency-Critical Multithreaded Cloud Applications and Utilization Analysis of the Major System Resources

Image and Video Coding Techniques for Ultra-low Latency