Login       My Wishlist
  My Cart
$0.00 / 0 items
 
Translate This Website
International Translation Network
 
International Access
Global Shipping Options Available
Home About Us News Our Blog Our Catalog My Cart My Account Track Shippment Contact Us
  Our Catalog   Computers & Technology   Databases & Big Data   Data Modeling & Design

Web Corpus Construction (Synthesis Lectures on Human Language Technologies)


Save 1% on the Web Corpus Construction (Synthesis Lectures on Human Language Technologies) by Morgan & Claypool Publishers at Translate This Website. MPN: black & white illustrations. Hurry! Limited time offer. Offer valid only while supplies last. The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting


Product Description

The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and removal of duplicated content. Linguistic processing and problems with linguistic processing coming from the different kinds of noise in web corpora are also covered. Finally, the authors show how web corpora can be evaluated and compared to other corpora (such as traditionally compiled corpora).

For additional material please visit the companion website: sites.morganclaypool.com/wcc

Table of Contents: Preface / Acknowledgments / Web Corpora / Data Collection / Post-Processing / Linguistic Processing / Corpus Evaluation and Comparison / Bibliography / Authors' Biographies

Additional Information

Manufacturer:Morgan & Claypool Publishers
Part Number:black & white illustrations
Publisher:Morgan & Claypool Publishers
Studio:Morgan & Claypool Publishers
MPN:black & white illustrations
EAN:9781608459834
Item Weight:0.58 pounds
Item Size:0.33 x 9.25 x 9.25 inches
Package Weight:0.75 pounds
Package Size:7.5 x 0.33 x 0.33 inches

Web Corpus Construction (Synthesis Lectures on Human Language Technologies) by Morgan & Claypool Publishers

Buy Now:
Web Corpus Construction (Synthesis Lectures on Human Language Technologies)

Brand: Morgan & Claypool Publishers
Condition: New
Lead Time: 1 - 2 Business Days
Availability: In Stock
$40.00
$39.99


Quantity:  

 


 


Have questions about this item, or would like to inquire about a custom or bulk order?


If you have any questions about this product by Morgan & Claypool Publishers, contact us by completing and submitting the form below. If you are looking for a specif part number, please include it with your message.

First Name:
Last Last:
Email Address:
Your Message:

Related Best Sellers


mpn: black & white illustrations, ean: 9781491901427, isbn: 149190142X,
Data science libraries, frameworks, modules, and toolkits are great for doing data science, but they’re also a good way to dive into the discipline without actually understanding data science. In this book, you’ll learn how many of the most funda...

mpn: black & white illustrations, ean: 9781491945285, isbn: 1491945281,
The financial industry has adopted Python at a tremendous rate recently, with some of the largest investment banks and hedge funds using it to build core trading and risk management systems. This hands-on guide helps both developers and quantitative ...

ean: 9780120885596, isbn: 012088559X,
In today’s information age, scientists and engineers must quickly and efficiently analyze extremely large sets of data. One of the best tools to accomplish this is Interactive Data Language (IDL®), a programming and visualization environment that ...

mpn: 46005891, ean: 9781491957660, isbn: 1491957662,
Get complete instructions for manipulating, processing, cleaning, and crunching datasets in Python. Updated for Python 3.6, the second edition of this hands-on guide is packed with practical case studies that show you how to solve a broad set of data...

ean: 9783319693675, isbn: 3319693670,
This reader-friendly textbook presents a concise and easy to follow introduction to Scala. Scala is an ideal first programming language, which permits programming in multiple paradigms, and enables developers to be more productive with modern computi...

mpn: black & white illustrations, ean: 9781511820875, isbn: 151182087X,
Excel 2013 Pivot Tables Including the "Data Model" A pivot table is a simple, yet powerful technique, that enables Excel's users to transform data overload into meaningful and organized knowledge. With pivot tables you can: * See the data in dozens o...

mpn: 9780596809157, ean: 9780596809157, isbn: 0596809158,
With more than 200 practical recipes, this book helps you perform data analysis with R quickly and efficiently. The R language provides everything you need to do statistical work, but its structure can be difficult to master. This collection of conci...

mpn: 37600747, ean: 9781449357108, isbn: 1449357105,
Learn how to perform data analysis with the R language and software environment, even if you have little or no programming experience. With the tutorials in this hands-on guide, you’ll learn how to use the essential R tools you need to know to anal...

mpn: 43425631, ean: 9781491912058, isbn: 1491912057,
For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources exist for individual pieces of this data science stack, but only with the Python Data Scien...

mpn: black & white illustrations, ean: 9781904811817, isbn: 1904811817,
This book is a comprehensive and practical guide to the design, development, usage, and syntax of Business Process Execution Language (BPEL). BPEL is explained in detail, code snippets and complete examples are used to show how business processes are...



Privacy Policy / Terms of Service
© 2018 - translateth.is. All Rights Reserved.