Algorithm-accelerator Co-design for High-performance and Secure Deep Learning

Algorithm-accelerator Co-design for High-performance and Secure Deep Learning
Author :
Publisher :
Total Pages : 0
Release :
ISBN-10 : OCLC:1404076818
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Algorithm-accelerator Co-design for High-performance and Secure Deep Learning by : Weizhe Hua

Download or read book Algorithm-accelerator Co-design for High-performance and Secure Deep Learning written by Weizhe Hua and published by . This book was released on 2022 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Deep learning has emerged as a new engine for many of today's artificial intelligence/machine learning systems, leading to several recent breakthroughs in vision and natural language processing tasks.However, as we move into the era of deep learning with billions and even trillions of parameters, meeting the computational and memory requirements to train and serve state-of-the-art models has become extremely challenging. Optimizing the computational cost and memory footprint of deep learning models for better system performance is critical to the widespread deployment of deep learning. Moreover, a massive amount of sensitive and private user data is exposed to the deep learning system during the training or serving process. Therefore, it is essential to investigate potential vulnerabilities in existing deep learning hardware, and then design secure deep learning systems that provide strong privacy guarantees for user data and the models that learn from the data. In this dissertation, we propose to co-design the deep learning algorithms and hardware architectural techniques to improve both the performance and security/privacy of deep learning systems. On high-performance deep learning, we first introduce channel gating neural network (CGNet), which exploits the dynamic sparsity of specific inputs to reduce computation of convolutional neural networks. We also co-develop an ASIC accelerator for CGNet that can turn theoretical FLOP reduction into wall-clock speedup. Secondly, we present Fast Linear Attention with a Single Head (FLASH), a state-of-the-art language model specifically designed for Google's TPU that can achieve transformer-level quality with linear complexity with respect to the sequence length. Through our empirical studies on masked language modeling, auto-regressive language modeling, and fine-tuning for question answering, FLASH achieves at least similar if not better quality compared to the augmented transformer, while being significantly faster (e.g., up to 12 times faster). On the security of deep learning, we study the side-channel vulnerabilities of existing deep learning accelerators. We then introduce a secure accelerator architecture for privacy-preserving deep learning, named GuardNN. GuardNN provides a trusted execution environment (TEE) with specialized protection for deep learning, and achieves a small trusted computing base and low protection overhead at the same time. The FPGA prototype of GuardNN achieves a maximum performance overhead of 2.4\% across four different modern DNNs models for ImageNet.


Algorithm-accelerator Co-design for High-performance and Secure Deep Learning Related Books

Algorithm-accelerator Co-design for High-performance and Secure Deep Learning
Language: en
Pages: 0
Authors: Weizhe Hua
Categories:
Type: BOOK - Published: 2022 - Publisher:

DOWNLOAD EBOOK

Deep learning has emerged as a new engine for many of today's artificial intelligence/machine learning systems, leading to several recent breakthroughs in visio
Accelerator Architecture for Secure and Energy Efficient Machine Learning
Language: en
Pages: 0
Authors: Mohammad Hossein Samavatian
Categories: Computer architecture
Type: BOOK - Published: 2022 - Publisher:

DOWNLOAD EBOOK

ML applications are driving the next computing revolution. In this context both performance and security are crucial. We propose hardware/software co-design sol
Deep Learning for Computer Architects
Language: en
Pages: 125
Authors: Brandon Reagen
Categories: Computers
Type: BOOK - Published: 2017-08-22 - Publisher: Morgan & Claypool Publishers

DOWNLOAD EBOOK

This is a primer written for computer architects in the new and rapidly evolving field of deep learning. It reviews how machine learning has evolved since its i
Data Orchestration in Deep Learning Accelerators
Language: en
Pages: 158
Authors: Tushar Krishna
Categories: Technology & Engineering
Type: BOOK - Published: 2022-05-31 - Publisher: Springer Nature

DOWNLOAD EBOOK

This Synthesis Lecture focuses on techniques for efficient data orchestration within DNN accelerators. The End of Moore's Law, coupled with the increasing growt
Algorithm-Centric Design of Reliable and Efficient Deep Learning Processing Systems
Language: en
Pages: 0
Authors: Elbruz Ozen
Categories:
Type: BOOK - Published: 2023 - Publisher:

DOWNLOAD EBOOK

Artificial intelligence techniques driven by deep learning have experienced significant advancements in the past decade. The usage of deep learning methods has