Machine Learning Algorithm and System Co-design for Hardware Efficiency

Machine Learning Algorithm and System Co-design for Hardware Efficiency
Author :
Publisher :
Total Pages : 0
Release :
ISBN-10 : OCLC:1405221788
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Machine Learning Algorithm and System Co-design for Hardware Efficiency by : Cheng Fu

Download or read book Machine Learning Algorithm and System Co-design for Hardware Efficiency written by Cheng Fu and published by . This book was released on 2023 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Deep Neural Networks (DNNs) are increasingly adopted in various fields due to their unprecedented performance. Yet, the computation overhead of DNN evaluation and training continues to grow exponentially. Enormous system-level advancements have recently been witnessed to improve the efficiency of DNN computation; many efficient DNN algorithms are proposed to reduce the cost of DNN computation. However, this is far from optimal. Most current DNN computation systems do not fully exploit efficient ML algorithms, while ML algorithms fail to consider novel systems for deployment. This thesis focuses on designing efficient DNN algorithms and DNN computation systems to enable fast DNN training and evaluation. Instead of solely focusing on creating either new DNN algorithms or systems, we propose a co-design approach by exploring DNN models that are system-aware and DNN computation systems that are algorithm-aware. By leveraging such a co-design approach, we effectively advance the Pareto-frontier between task accuracy and efficiency of DNN execution in various application domains. We present a set of works that explore the co-design method. Firstly, we present an algorithm-aware DNN compiler for quantized DNN. By leveraging the weight repetition feature of this efficient DNN algorithm, we can greatly reduce the computation overhead of DNN inference on both CPU and GPU. This work illustrates how algorithm-aware system design can help in pushing the Pareto-frontier. Secondly, we discuss a hardware-aware DNN algorithm with enhanced model parallelism. We observe that previous works design efficient DNNs for single-device platforms. When customizing the DNN design for a multi-device system, we can reduce the DNN inference latency by a large margin while previous models can hardly be parallelized across multiple devices. Thirdly, we present a hardware-friendly transfer learning framework for natural language processing tasks. The existing transfer learning frameworks have a lot of computation redundancy when deploying on the existing systems. By reusing the computation of different transfer learning models, we can greatly reduce the computation overhead as well. Lastly, we introduce a novel training method to reduce the computation cost of DNN training and DNN design process. The key idea is to initialize the large models using small pretrained weights. The implicit knowledge in the pretrained models facilitates faster convergence of the large models. Besides, changing only the initialization phase of training means no extra computation overhead will be introduced to the existing training systems. Also, this new training method can be applied to accelerate the design process of system-aware DNN models. As Moore's Law is slowing down, the computational capacity of current DNN systems is plateauing. This thesis sheds light on how to overcome this limitation by designing domain-specific DNN algorithms and computation systems.


Machine Learning Algorithm and System Co-design for Hardware Efficiency Related Books

Machine Learning Algorithm and System Co-design for Hardware Efficiency
Language: en
Pages: 0
Authors: Cheng Fu
Categories:
Type: BOOK - Published: 2023 - Publisher:

GET EBOOK

Deep Neural Networks (DNNs) are increasingly adopted in various fields due to their unprecedented performance. Yet, the computation overhead of DNN evaluation a
Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing
Language: en
Pages: 481
Authors: Sudeep Pasricha
Categories: Technology & Engineering
Type: BOOK - Published: 2023-10-09 - Publisher: Springer Nature

GET EBOOK

This book presents recent advances towards the goal of enabling efficient implementation of machine learning models on resource-constrained systems, covering di
Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing
Language: en
Pages: 418
Authors: Sudeep Pasricha
Categories: Technology & Engineering
Type: BOOK - Published: 2023-11-01 - Publisher: Springer Nature

GET EBOOK

This book presents recent advances towards the goal of enabling efficient implementation of machine learning models on resource-constrained systems, covering di
Hardware-aware Algorithms for Efficient Machine Learning
Language: en
Pages: 0
Authors: Tri Dao Phuc Quang
Categories:
Type: BOOK - Published: 2023 - Publisher:

GET EBOOK

Machine learning (ML) training will continue to grow to consume more cycles, their inference will proliferate on more kinds of devices, and their capabilities w
Efficient Processing of Deep Neural Networks
Language: en
Pages: 254
Authors: Vivienne Sze
Categories: Technology & Engineering
Type: BOOK - Published: 2022-05-31 - Publisher: Springer Nature

GET EBOOK

This book provides a structured treatment of the key principles and techniques for enabling efficient processing of deep neural networks (DNNs). DNNs are curren