Effectively Access, Transform, Manipulate, Visualize, and Reason about Data and Computation
Data Science in R: A Case Studies Approach to Computational Reasoning and Problem Solving illustrates the details involved in solving real computational problems encountered in data analysis. It reveals the dynamic and iterative process by which data analysts approach a problem and reason about different ways of implementing solutions.
The book’s collection of projects, comprehensive sample solutions, and follow-up exercises encompass practical topics pertaining to data processing, including:
- Non-standard, complex data formats, such as robot logs and email messages
- Text processing and regular expressions
- Newer technologies, such as Web scraping, Web services, Keyhole Markup Language (KML), and Google Earth
- Statistical methods, such as classification trees, k-nearest neighbors, and naïve Bayes
- Visualization and exploratory data analysis
- Relational databases and Structured Query Language (SQL)
- Algorithm implementation
- Large data and efficiency
Suitable for self-study or as supplementary reading in a statistical computing course, the book enables instructors to incorporate interesting problems into their courses so that students gain valuable experience and data science skills. Students learn how to acquire and work with unstructured or semistructured data as well as how to narrow down and carefully frame the questions of interest about the data.
Blending computational details with statistical and data analysis concepts, this book provides readers with an understanding of how professional data scientists think about daily computational tasks. It will improve readers’ computational reasoning of real-world data analyses.
|Manufacturer:||Chapman and Hall/CRC|
|Part Number:||79 black & white illustrations|
|Publisher:||Chapman and Hall/CRC|
|Studio:||Chapman and Hall/CRC|
|MPN:||79 black & white illustrations|
|Item Weight:||0 pounds|
|Item Size:||1 x 9 x 9 inches|
|Package Weight:||2.95 pounds|
|Package Size:||6.9 x 1.2 x 1.2 inches|
Have questions about this item, or would like to inquire about a custom or bulk order?
If you have any questions about this product by CRC Press, contact us by completing and submitting the form below. If you are looking for a specif part number, please include it with your message.
By O'Reilly Media
mpn: 48032261, ean: 9781491912218, isbn: 1491912219,
Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei...
By Brand: Chapman and Hall/CRC
ean: 9781420085921, isbn: 9781420085921,
The Handbook of Natural Language Processing, Second Edition presents practical tools and techniques for implementing natural language processing in computer systems. Along with removing outdated material, this edition updates every chapter and expand...