Search Results

Efficient and Scalable Transfer Learning for Natural Language Processing

Download or Read eBook Efficient and Scalable Transfer Learning for Natural Language Processing PDF written by Kevin Stefan Clark and published by . This book was released on 2021 with total page pages. Available in PDF, EPUB and Kindle.
Efficient and Scalable Transfer Learning for Natural Language Processing
Author :
Publisher :
Total Pages :
Release :
ISBN-10 : OCLC:1243112741
ISBN-13 :
Rating : 4/5 (41 Downloads)

Book Synopsis Efficient and Scalable Transfer Learning for Natural Language Processing by : Kevin Stefan Clark

Book excerpt: Neural networks work best when trained on large amounts of data, but most labeled datasets in natural language processing (NLP) are small. As a result, neural NLP models often overfit to idiosyncrasies and artifacts in their training data rather than learning generalizable patterns. Transfer learning offers a solution: instead of learning a single task from scratch and in isolation, the model can benefit from the wealth of text on the web or other tasks with rich annotations. This additional data enables the training of bigger, more expressive networks. However, it also dramatically increases the computational cost of training, with recent models taking up to hundreds of GPU years to train. To alleviate this cost, I develop transfer learning methods that learn much more efficiently than previous approaches while remaining highly scalable. First, I present a multi-task learning algorithm based on knowledge distillation that consistently improves over single-task training even when learning many diverse tasks. I next develop Cross-View Training, which revitalizes semi-supervised learning methods from the statistical era of NLP (self-training and co-training) while taking advantage of neural methods. The resulting models outperform pre-trained LSTM language models such as ELMo while training 10x faster. Lastly, I present ELECTRA, a self-supervised pre-training method for transformer networks based on energy-based models. ELECTRA learns 4x--10x faster than previous approaches such as BERT, resulting in excellent performance on natural language understanding tasks both when trained at large scale or even when it is trained on a single GPU.


Efficient and Scalable Transfer Learning for Natural Language Processing Related Books

Efficient and Scalable Transfer Learning for Natural Language Processing
Language: en
Pages:
Authors: Kevin Stefan Clark
Categories:
Type: BOOK - Published: 2021 - Publisher:

DOWNLOAD EBOOK

Neural networks work best when trained on large amounts of data, but most labeled datasets in natural language processing (NLP) are small. As a result, neural N
Transfer Learning for Natural Language Processing
Language: en
Pages: 262
Authors: Paul Azunre
Categories: Computers
Type: BOOK - Published: 2021-08-31 - Publisher: Simon and Schuster

DOWNLOAD EBOOK

Build custom NLP models in record time by adapting pre-trained machine learning models to solve specialized problems. Summary In Transfer Learning for Natural L
Advanced Natural Language Processing with TensorFlow 2
Language: en
Pages: 381
Authors: Ashish Bansal
Categories: Computers
Type: BOOK - Published: 2021-02-04 - Publisher: Packt Publishing Ltd

DOWNLOAD EBOOK

One-stop solution for NLP practitioners, ML developers, and data scientists to build effective NLP systems that can perform real-world complicated tasks Key Fea
Introduction to Transfer Learning
Language: en
Pages: 333
Authors: Jindong Wang
Categories: Computers
Type: BOOK - Published: 2023-03-30 - Publisher: Springer Nature

DOWNLOAD EBOOK

Transfer learning is one of the most important technologies in the era of artificial intelligence and deep learning. It seeks to leverage existing knowledge by
Natural Language Processing in Action
Language: en
Pages: 798
Authors: Hannes Hapke
Categories: Computers
Type: BOOK - Published: 2019-03-16 - Publisher: Simon and Schuster

DOWNLOAD EBOOK

Summary Natural Language Processing in Action is your guide to creating machines that understand human language using the power of Python with its ecosystem of
Scroll to top