Veridical Data Science

Veridical Data Science
Author :
Publisher : MIT Press
Total Pages : 527
Release :
ISBN-10 : 9780262379700
ISBN-13 : 0262379708
Rating : 4/5 (708 Downloads)

Book Synopsis Veridical Data Science by : Bin Yu

Download or read book Veridical Data Science written by Bin Yu and published by MIT Press. This book was released on 2024-10-15 with total page 527 pages. Available in PDF, EPUB and Kindle. Book excerpt: Using real-world data case studies, this innovative and accessible textbook introduces an actionable framework for conducting trustworthy data science. Most textbooks present data science as a linear analytic process involving a set of statistical and computational techniques without accounting for the challenges intrinsic to real-world applications. Veridical Data Science, by contrast, embraces the reality that most projects begin with an ambiguous domain question and messy data; it acknowledges that datasets are mere approximations of reality while analyses are mental constructs. Bin Yu and Rebecca Barter employ the innovative Predictability, Computability, and Stability (PCS) framework to assess the trustworthiness and relevance of data-driven results relative to three sources of uncertainty that arise throughout the data science life cycle: the human decisions and judgment calls made during data collection, cleaning, and modeling. By providing real-world data case studies, intuitive explanations of common statistical and machine learning techniques, and supplementary R and Python code, Veridical Data Science offers a clear and actionable guide for conducting responsible data science. Requiring little background knowledge, this lucid, self-contained textbook provides a solid foundation and principled framework for future study of advanced methods in machine learning, statistics, and data science. Presents the Predictability, Computability, and Stability (PCS) methodology for producing trustworthy data-driven results Teaches how a data science project should be conducted from beginning to end, including extensive discussion of the data scientist's decision-making process Cultivates critical thinking throughout the entire data science life cycle Provides practical examples and illuminating case studies of real-world data analysis problems with associated code, exercises, and solutions Suitable for advanced undergraduate and graduate students, domain scientists, and practitioners


Veridical Data Science Related Books

Veridical Data Science
Language: en
Pages: 527
Authors: Bin Yu
Categories: Computers
Type: BOOK - Published: 2024-10-15 - Publisher: MIT Press

GET EBOOK

Using real-world data case studies, this innovative and accessible textbook introduces an actionable framework for conducting trustworthy data science. Most tex
Machine Learning and Data Science
Language: en
Pages: 276
Authors: Prateek Agrawal
Categories: Computers
Type: BOOK - Published: 2022-07-25 - Publisher: John Wiley & Sons

GET EBOOK

MACHINE LEARNING AND DATA SCIENCE Written and edited by a team of experts in the field, this collection of papers reflects the most up-to-date and comprehensive
Principles of Managerial Statistics and Data Science
Language: en
Pages: 688
Authors: Roberto Rivera
Categories: Mathematics
Type: BOOK - Published: 2020-02-05 - Publisher: John Wiley & Sons

GET EBOOK

Introduces readers to the principles of managerial statistics and data science, with an emphasis on statistical literacy of business students Through a statisti
Data Science and Machine Learning
Language: en
Pages: 538
Authors: Dirk P. Kroese
Categories: Business & Economics
Type: BOOK - Published: 2019-11-20 - Publisher: CRC Press

GET EBOOK

Focuses on mathematical understanding Presentation is self-contained, accessible, and comprehensive Full color throughout Extensive list of exercises and worked
Game Theory for Data Science
Language: en
Pages: 135
Authors: Boi Mirsky
Categories: Computers
Type: BOOK - Published: 2022-05-31 - Publisher: Springer Nature

GET EBOOK

Intelligent systems often depend on data provided by information agents, for example, sensor data or crowdsourced human computation. Providing accurate and rele