Introduction to Natural Language Processing

Sampul Depan
MIT Press, 1 Okt 2019 - 536 halaman
A survey of computational methods for understanding, generating, and manipulating human language, which offers a synthesis of classical representations and algorithms with contemporary machine learning techniques.

This textbook provides a technical perspective on natural language processing—methods for building computer software that understands, generates, and manipulates human language. It emphasizes contemporary data-driven approaches, focusing on techniques from supervised and unsupervised machine learning. The first section establishes a foundation in machine learning by building a set of tools that will be used throughout the book and applying them to word-based textual analysis. The second section introduces structured representations of language, including sequences, trees, and graphs. The third section explores different approaches to the representation and analysis of linguistic meaning, ranging from formal logic to neural word embeddings. The final section offers chapter-length treatments of three transformative applications of natural language processing: information extraction, machine translation, and text generation. End-of-chapter exercises include both paper-and-pencil analysis and software implementation.

The text synthesizes and distills a broad and diverse research literature, linking contemporary machine learning techniques with the field's linguistic and computational foundations. It is suitable for use in advanced undergraduate and graduate-level courses and as a reference for software engineers and data scientists. Readers should have a background in computer programming and college-level mathematics. After mastering the material presented, students will have the technical skill to build and analyze novel natural language processing systems and to understand the latest research in the field.

 

Halaman terpilih

Isi

I
8
3
47
4
67
5
91
6
119
7
137
8
9
Reference Resolution 333
Discourse 357
APPLICATIONS 377
Information Extraction 379
18
19
Probability 447
Modeling and Estimation 453

10
PredicateArgument Semantics 289
Distributional and Distributed Semantics 309
Bibliography 459
Index
Hak Cipta

Edisi yang lain - Lihat semua

Istilah dan frasa umum

Tentang pengarang (2019)

Jacob Eisenstein works at Google as a research scientist. He was previously on the faculty in the School of Interactive Computing at Georgia Institute of Technology.

Informasi bibliografi