Search Results

Hardware-aware Algorithms for Efficient Machine Learning

Download or Read eBook Hardware-aware Algorithms for Efficient Machine Learning PDF written by Tri Dao Phuc Quang and published by . This book was released on 2023 with total page 0 pages. Available in PDF, EPUB and Kindle.
Hardware-aware Algorithms for Efficient Machine Learning
Author :
Publisher :
Total Pages : 0
Release :
ISBN-10 : OCLC:1383660999
ISBN-13 :
Rating : 4/5 (99 Downloads)

Book Synopsis Hardware-aware Algorithms for Efficient Machine Learning by : Tri Dao Phuc Quang

Book excerpt: Machine learning (ML) training will continue to grow to consume more cycles, their inference will proliferate on more kinds of devices, and their capabilities will be used in more domains. Some goals central to this future are to make ML models efficient so they remain practical to train and deploy, and to unlock new application domains with new capabilities. We describe some recent developments in hardware-aware algorithms to improve the efficiency-quality tradeoff of ML models and equip them with long context. In Chapter 2, we focus on structured sparsity, a natural approach to mitigate the extensive compute and memory cost of large ML models. We describe a line of work on learnable fast transforms that, thanks to their expressiveness and efficiency, yields some of the first sparse training methods to speed up large models in wall-clock time (2x) without compromising their quality. In Chapter 3, we focus on efficient Transformer training and inference for long sequences. We describe FlashAttention, a fast and memory-efficient algorithm to compute attention with no approximation. By careful accounting of reads/writes between different levels of memory hierarchy, FlashAttention is 2-4x faster and uses 10-20x less memory compared to the best existing attention implementations, allowing us to train higher-quality Transformers with 8x longer context. FlashAttention is now widely used in some of the largest research labs and companies. In Chapter 4, we examine state-space models, a promising architecture designed for long-range memory. As we seek to understand why early state-space models did not perform well on language modeling tasks, we propose simple multiplicative interaction that expands their expressiveness. We also design hardware-friendly algorithms to train them. As a result, we are able to train state-space models to multi-billion parameter scale, demonstrating a new kind of model competitive with the dominant Transformers in language modeling. We conclude with some exciting directions in ML and systems, such as software-hardware co-design, structured sparsity for scientific AI, and long context for new AI workflows and modalities.


Hardware-aware Algorithms for Efficient Machine Learning Related Books

Hardware-aware Algorithms for Efficient Machine Learning
Language: en
Pages: 0
Authors: Tri Dao Phuc Quang
Categories:
Type: BOOK - Published: 2023 - Publisher:

DOWNLOAD EBOOK

Machine learning (ML) training will continue to grow to consume more cycles, their inference will proliferate on more kinds of devices, and their capabilities w
Hardware-Aware Probabilistic Machine Learning Models
Language: en
Pages: 163
Authors: Laura Isabel Galindez Olascoaga
Categories: Technology & Engineering
Type: BOOK - Published: 2021-05-19 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book proposes probabilistic machine learning models that represent the hardware properties of the device hosting them. These models can be used to evaluate
Efficient Machine Learning Software Stack from Algorithms to Compilation
Language: en
Pages: 0
Authors: Zixuan Jiang
Categories:
Type: BOOK - Published: 2023 - Publisher:

DOWNLOAD EBOOK

Machine learning enables the extraction of knowledge from data and decision-making without explicit programming, achieving great success and revolutionizing man
Efficient Processing of Deep Neural Networks
Language: en
Pages: 254
Authors: Vivienne Sze
Categories: Technology & Engineering
Type: BOOK - Published: 2022-05-31 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book provides a structured treatment of the key principles and techniques for enabling efficient processing of deep neural networks (DNNs). DNNs are curren
Machine Learning Algorithm and System Co-design for Hardware Efficiency
Language: en
Pages: 0
Authors: Cheng Fu
Categories:
Type: BOOK - Published: 2023 - Publisher:

DOWNLOAD EBOOK

Deep Neural Networks (DNNs) are increasingly adopted in various fields due to their unprecedented performance. Yet, the computation overhead of DNN evaluation a
Scroll to top