1 option
2022 IEEE Real-Time Systems Symposium (RTSS) / IEEE Computer Society.
- Format:
- Book
- Author/Creator:
- IEEE Computer Society, author, issuing body.
- Language:
- English
- Subjects (All):
- Real-time data processing--Congresses.
- Real-time data processing.
- Physical Description:
- 1 online resource
- Other Title:
- 2022 IEEE Real-Time Systems Symposium
- Place of Publication:
- Piscataway : IEEE Computer Society, 2022.
- Summary:
- While high accuracy is of paramount importance for deep learning (DL) inference, serving inference requests on time is equally critical but has not been carefully studied especially when the request has to be served over a dynamic wireless network at the edge. In this paper, we propose Jellyfish-a novel edge DL inference serving system that achieves soft guarantees on end-to-end inference latency often specified as a service-level objective (SLO). To handle the network variability, Jellyfish exploits both data and deep neural network (DNN) adaptation to conduct tradeoffs between accuracy and latency. Jellyfish features a new design that enables collective adaptation policies where the decisions for data and DNN adaptations are aligned and coordinated among multiple users with varying network conditions. We propose efficient algorithms to dynamically adapt DNNs and map users, so that we fulfill latency SLOs while maximizing the overall inference accuracy. Our experiments based on a prototype implementation and real-world WiFi and LTE network traces show that Jellyfish can meet latency SLOs at around the 99th percentile while maintaining high accuracy.
- Notes:
- Description based on publisher supplied metadata and other sources.
- ISBN:
- 9781665453462
- 166545346X
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.