Building and Using Comparable Corpora for Multilingual Natural Language Processing

Building and Using Comparable Corpora for Multilingual Natural Language Processing
Author :
Publisher : Springer Nature
Total Pages : 138
Release :
ISBN-10 : 9783031313844
ISBN-13 : 3031313844
Rating : 4/5 (844 Downloads)

Book Synopsis Building and Using Comparable Corpora for Multilingual Natural Language Processing by : Serge Sharoff

Download or read book Building and Using Comparable Corpora for Multilingual Natural Language Processing written by Serge Sharoff and published by Springer Nature. This book was released on 2023-08-23 with total page 138 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.


Building and Using Comparable Corpora for Multilingual Natural Language Processing Related Books

Building and Using Comparable Corpora for Multilingual Natural Language Processing
Language: en
Pages: 138
Authors: Serge Sharoff
Categories: Computers
Type: BOOK - Published: 2023-08-23 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual trans
Building and Using Comparable Corpora
Language: en
Pages: 333
Authors: Serge Sharoff
Categories: Computers
Type: BOOK - Published: 2013-12-13 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining
Corpus Analysis for Language Studies at the University Level
Language: en
Pages: 176
Authors: Giedrė Valūnaitė Oleškevičienė
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2021-02-08 - Publisher: Cambridge Scholars Publishing

DOWNLOAD EBOOK

This book highlights corpora use in teaching foreign languages in university education. It will appeal to both academics and practitioners interested in the pro
Data Analytics and Management in Data Intensive Domains
Language: en
Pages: 231
Authors: Alexander Sychev
Categories: Computers
Type: BOOK - Published: 2021-07-15 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book constitutes the post-conference proceedings of the 22nd International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RC
Computational Phraseology
Language: en
Pages: 341
Authors: Gloria Corpas Pastor
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2020-05-15 - Publisher: John Benjamins Publishing Company

DOWNLOAD EBOOK

Whether you wish to deliver on a promise, take a walk down memory lane or even on the wild side, phraseological units (also often referred to as phrasemes or mu