My Account Log in

1 option

Deep Learning for Natural Language Processing : A Gentle Introduction / Mihai Surdeanu and Marco Antonio Valenzuela-Escárcega.

Cambridge eBooks: Frontlist 2024 Available online

View online
Format:
Book
Author/Creator:
Surdeanu, Mihai, author.
Valenzuela-Escárcega, Marco Antonio, author.
Language:
English
Subjects (All):
Natural language processing (Computer science).
Deep learning (Machine learning).
Physical Description:
1 online resource (xviii, 325 pages) : digital, PDF file(s).
Edition:
First edition.
Place of Publication:
Cambridge : Cambridge University Press, [2024]
Summary:
Deep Learning is becoming increasingly important in a technology-dominated world. However, the building of computational models that accurately represent linguistic structures is complex, as it involves an in-depth knowledge of neural networks, and the understanding of advanced mathematical concepts such as calculus and statistics. This book makes these complexities accessible to those from a humanities and social sciences background, by providing a clear introduction to deep learning for natural language processing. It covers both theoretical and practical aspects, and assumes minimal knowledge of machine learning, explaining the theory behind natural language in an easy-to-read way. It includes pseudo code for the simpler algorithms discussed, and actual Python code for the more complicated architectures, using modern deep learning libraries such as PyTorch and Hugging Face. Providing the necessary theoretical foundation and practical tools, this book will enable readers to immediately begin building real-world, practical natural language processing systems
Contents:
Cover
Half-title page
Title page
Imprints page
Contents
List of Figures
List of Tables
Preface
1 Introduction
1.1 What This Book Covers
1.2 What This Book Does Not Cover
1.3 Deep Learning Is Not Perfect
1.4 Mathematical Notations
2 The Perceptron
2.1 Machine Learning Is Easy
2.2 Use Case: Text Classification
2.3 Evaluation Measures for Text Classification
2.4 The Perceptron
2.5 Voting Perceptron
2.6 Average Perceptron
2.7 Drawbacks of the Perceptron
2.8 Historical Background
2.9 References and Further Readings
2.10 Summary
3 Logistic Regression
3.1 The Logistic Regression Decision Function and Learning Algorithm
3.2 The Logistic Regression Cost Function
3.3 Gradient Descent
3.4 Deriving the Logistic Regression Update Rule
3.5 From Binary to Multiclass Classification
3.6 Evaluation Measures for Multiclass Text Classification
3.7 Drawbacks of Logistic Regression
3.8 Historical Background
3.9 References and Further Readings
3.10 Summary
4 Implementing Text Classification Using Perceptron and Logistic Regression
4.1 Binary Classification
4.2 Multiclass Classification
4.3 Summary
5 Feed-Forward Neural Networks
5.1 Architecture of Feed-Forward Neural Networks
5.2 Learning Algorithm for Neural Networks
5.3 The Equations of Backpropagation
5.4 Drawbacks of Neural Networks (So Far)
5.5 Historical Background
5.6 References and Further Readings
5.7 Summary
6 Best Practices in Deep Learning
6.1 Minibatching
6.2 Other Optimization Algorithms
6.3 Other Activation Functions
6.4 Cost Functions
6.5 Regularization
6.6 Dropout
6.7 Temporal Averaging
6.8 Parameter Initialization and Normalization
6.9 References and Further Readings
6.10 Summary
7 Implementing Text Classification with Feed-Forward Networks
7.1 Data
7.2 Fully Connected Neural Network
7.3 Training
7.4 Summary
8 Distributional Hypothesis and Representation Learning
8.1 Traditional Distributional Representations
8.2 Matrix Decompositions and Low-Rank Approximations
8.3 Drawbacks of Representation Learning Using Low-Rank Approximation
8.4 The Word2vec Algorithm
8.5 Drawbacks of the Word2vec Algorithm
8.6 Historical Background
8.7 References and Further Readings
8.8 Summary
9 Implementing Text Classification Using Word Embeddings
9.1 PretrainedWord Embeddings
9.2 Text Classification with PretrainedWord Embeddings
9.3 Summary
10 Recurrent Neural Networks
10.1 Vanilla Recurrent Neural Networks
10.2 Deep Recurrent Neural Networks
10.3 The Problem with Simple Recurrent Neural Networks: Vanishing Gradient
10.4 Long Short-Term Memory Networks
10.5 Conditional Random Fields
10.6 Drawbacks of Recurrent Neural Networks
10.7 Historical Background
10.8 References and Further Readings
10.9 Summary
Notes:
Title from publisher's bibliographic system (viewed on 02 Feb 2024).
Description based on print version record.
Includes bibliographical references and index.
ISBN:
9781009027809
1009027808
9781009027991
1009027999
9781009026222
1009026224

The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.

Find

Home Release notes

My Account

Shelf Request an item Bookmarks Fines and fees Settings

Guides

Using the Find catalog Using Articles+ Using your account