Efficient and Scalable Transfer Learning for Natural Language Processing

Download or Read eBook Efficient and Scalable Transfer Learning for Natural Language Processing PDF written by Kevin Stefan Clark and published by . This book was released on 2021 with total page pages. Available in PDF, EPUB and Kindle.

Author	: Kevin Stefan Clark
Publisher	:
Total Pages	:
Release	: 2021
ISBN-10	: OCLC:1243112741
ISBN-13	:
Rating	: 4/5 (41 Downloads)

DOWNLOAD EBOOK

Book Synopsis Efficient and Scalable Transfer Learning for Natural Language Processing by : Kevin Stefan Clark

Book excerpt: Neural networks work best when trained on large amounts of data, but most labeled datasets in natural language processing (NLP) are small. As a result, neural NLP models often overfit to idiosyncrasies and artifacts in their training data rather than learning generalizable patterns. Transfer learning offers a solution: instead of learning a single task from scratch and in isolation, the model can benefit from the wealth of text on the web or other tasks with rich annotations. This additional data enables the training of bigger, more expressive networks. However, it also dramatically increases the computational cost of training, with recent models taking up to hundreds of GPU years to train. To alleviate this cost, I develop transfer learning methods that learn much more efficiently than previous approaches while remaining highly scalable. First, I present a multi-task learning algorithm based on knowledge distillation that consistently improves over single-task training even when learning many diverse tasks. I next develop Cross-View Training, which revitalizes semi-supervised learning methods from the statistical era of NLP (self-training and co-training) while taking advantage of neural methods. The resulting models outperform pre-trained LSTM language models such as ELMo while training 10x faster. Lastly, I present ELECTRA, a self-supervised pre-training method for transformer networks based on energy-based models. ELECTRA learns 4x--10x faster than previous approaches such as BERT, resulting in excellent performance on natural language understanding tasks both when trained at large scale or even when it is trained on a single GPU.

Efficient and Scalable Transfer Learning for Natural Language Processing Related Books

Language: en
Pages:

Efficient and Scalable Transfer Learning for Natural Language Processing

Authors: Kevin Stefan Clark

Categories:

Type: BOOK - Published: 2021 - Publisher:

DOWNLOAD EBOOK

Neural networks work best when trained on large amounts of data, but most labeled datasets in natural language processing (NLP) are small. As a result, neural N

Language: en
Pages: 262

Transfer Learning for Natural Language Processing

Authors: Paul Azunre

Categories: Computers

Type: BOOK - Published: 2021-08-31 - Publisher: Simon and Schuster

DOWNLOAD EBOOK

Build custom NLP models in record time by adapting pre-trained machine learning models to solve specialized problems. Summary In Transfer Learning for Natural L

Language: en
Pages: 381

Advanced Natural Language Processing with TensorFlow 2

Authors: Ashish Bansal

Categories: Computers

Type: BOOK - Published: 2021-02-04 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

One-stop solution for NLP practitioners, ML developers, and data scientists to build effective NLP systems that can perform real-world complicated tasks Key Fea

Language: en
Pages: 333

Introduction to Transfer Learning

Authors: Jindong Wang

Categories: Computers

Type: BOOK - Published: 2023-03-30 - Publisher: Springer Nature

DOWNLOAD EBOOK

Transfer learning is one of the most important technologies in the era of artificial intelligence and deep learning. It seeks to leverage existing knowledge by

Language: en
Pages: 798

Natural Language Processing in Action

Authors: Hannes Hapke

Categories: Computers

Type: BOOK - Published: 2019-03-16 - Publisher: Simon and Schuster

DOWNLOAD EBOOK

Summary Natural Language Processing in Action is your guide to creating machines that understand human language using the power of Python with its ecosystem of