Cloud Versus Edge Deployment Strategies of Real-Time Face Recognition Inference

Koubaa Anis; Ammar Adel; Kanhouch AnasAlHabashi Yasser

首页> 外文期刊>IEEE Transactions on Network Science and Engineering >Cloud Versus Edge Deployment Strategies of Real-Time Face Recognition Inference

【24h】

Cloud Versus Edge Deployment Strategies of Real-Time Face Recognition Inference

机译：

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

Choosing the appropriate deployment strategy for any Deep Learning (DL) project in a production environment has always been the most challenging problem for industrial practitioners. There are several conflicting constraints and controversial approaches when it comes to deployment. Among these problems, the deployment on cloud versus the deployment on edge represents a common dilemma. In a nutshell, each approach provides benefits where the other would have limitations. This paper presents a real-world case study on deploying a face recognition application using MTCNN detector and FaceNet recognizer. We report the challenges faced to decide on the best deployment strategy. We propose three inference architectures for the deployment, including cloud-based, edge-based, and hybrid. Furthermore, we evaluate the performance of face recognition inference on different cloud-based and edge-based GPU platforms. We consider different models of Jetson boards for the edge (Nano, TX2, Xavier NX, Xavier AGX) and various GPUs for the cloud (GTX 1080, RTX 2080Ti, RTX 2070, and RTX 8000). We also investigate the effect of deep learning model optimization using TensorRT and TFLite compared to a standard Tensorflow GPU model, and the effect of input resolution. We provide a benchmarking study for all these devices in terms of frames per second, execution times, energy and memory usages. After conducting a total of 294 experiments, the results demonstrate that the TensorRT optimization provides the fastest execution on all cloud and edge devices, at the expense of significantly larger energy consumption (up to +40 and +35 for edge and cloud devices, respectively, compared to Tensorflow). Whereas TFLite is the most efficient framework in terms of memory and power consumption, while providing significantly less (-4 to -62) processing acceleration than TensorRT. Practitioners Note: The study reported in this paper presents the real-challenges that we faced during our development and deployment of a face-recognition application both on the edge and on the cloud, and the solutions we have developed to solve these problems. The code, results, and interactive analytic dashboards of this paper will be put public upon publication.

著录项

来源
《IEEE Transactions on Network Science and Engineering》 |2022年第1期|143-160|共18页
作者
Koubaa Anis; Ammar Adel; Kanhouch AnasAlHabashi Yasser;
展开▼
作者单位

Prince Sultan Univ, Dept Comp Sci, Riyadh 12435, Saudi Arabia|Prince Sultan Univ, Robot & Internet Things Lab, Riyadh 12435, Saudi Arabia;

Prince Sultan Univ, ROITU Lab, Riyadh 11586, Saudi Arabia;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类
关键词
Face recognition; Image edge detection; Deep learning; Cloud computing; Performance evaluation; Real-time systems; Visual analytics; Cloud inference; computation offloading; edge inference; face recognition; FaceNet; jetson boards; production environment;

Cloud Versus Edge Deployment Strategies of Real-Time Face Recognition Inference

摘要

著录项

相关主题

期刊订阅