Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
Tóm tắt
Từ khóa
Tài liệu tham khảo
hadidi, 2018, Musical chair: Efficient real-time recognition using collaborative IoT devices, arXiv 1802 02138
islam, 2019, Zygarde: Time-sensitive on-device deep intelligence on intermittently-powered systems, arXiv 1905 03854
xu, 2017, Enabling cooperative inference of deep learning on wearables and smartphones, arXiv 1712 03073
eshratifar, 2018, JointDNN: An efficient training and inference engine for intelligent mobile cloud computing services, arXiv 1801 08618
2017, Augmented and Virtual Reality The First Wave of 5G Killer Apps
iandola, 2016, SqueezeNet: AlexNet-level accuracy with 50 $\times$ fewer parameters and <0.5 MB model size, arXiv 1602 07360
kim, 2019, $\mu$ Layer: Low latency on-device inference using cooperative single-layer acceleration and processor-friendly quantization, 14th European Conf Proc, 45
pham, 2018, Efficient neural architecture search via parameter sharing, arXiv 1802 03268
van den oord, 2016, WaveNet: A generative model for raw audio, arXiv 1609 03499
mulhollon, 2004, Wondershaper
kulick, 2013, Bayesian Changepoint Detection
han, 2015, Deep compression: Compressing deep neural networks with pruning, trained quantization and Huffman coding, arXiv 1510 00149 [cs]
krizhevsky, 2009, Learning multiple layers of features from tiny images
howard, 2017, MobileNets: Efficient convolutional neural networks for mobile vision applications, arXiv 1704 04861
krizhevsky, 2012, ImageNet classification with deep convolutional neural networks, Proc Adv Neural Inf Process Syst, 1097
tokui, 2015, Chainer: A next-generation open source framework for deep learning, Proc 29th Annu Conf Neural Inf Process Syst, 1
kim, 2015, Compression of deep convolutional neural networks for fast and low power mobile applications, arXiv 1511 06530
adams, 2007, Bayesian online changepoint detection, arXiv 0710 3742
liu, 2018, Progressive neural architecture search, Proc Eur Conf Comput Vis (ECCV), 19
cai, 2018, Efficient architecture search by network transformation, Proc 32nd AAAI Conf Artif Intell, 2787