The most recommended data processing books

Who picked these books? Meet our 31 experts.

31 authors created a book list connected to data processing, and here are their favorite data processing books.
Shepherd is reader supported. When you buy books, we may earn an affiliate commission.

What type of data processing book?

Loading...
Loading...
Shuffle

Book cover of A First Course in Statistical Programming with R

Tilman M. Davies Author Of The Book of R: A First Course in Programming and Statistics

From my list on intro to programming and data science with R.

Why am I passionate about this?

Iā€™m an applied statistician and academic researcher/lecturer at New Zealandā€™s oldest university ā€“ the University of Otago. R facilitates everything I do ā€“ research, academic publication, and teaching. Itā€™s the latter part of my job that motivated my own book on R. From first-year statistics students who have never seen R to my own Ph.D. students using R to implement novel and highly complex statistical methods and models, my experience is that all ultimately love the ease with which the R language permits exploration, visualisation, analysis, and inference of oneā€™s data. The ever-growing need in todayā€™s society for skilled statisticians and data scientists means there's never been a better time to learn this essential language.

Tilman's book list on intro to programming and data science with R

Tilman M. Davies Why did Tilman love this book?

From well-known authorities in the R-sphere (including a former R Core Team member), this is a long-standing text whose first edition was one of the early books intended to teach R to beginners. It provides concise instructions and examples on how R is used as a programming language before focusing on 'number-crunching' statistical methods that are typically seen as computationally intensive. One of the notable features of this book is the statistical methods at hand are not just illustrated using 'black-box' code--the reader is provided with the necessary mathematical detail to understand what's going on behind the scenes for those that are so inclined.

By W. John Braun, Duncan J. Murdoch,

Why should I read it?

1 author picked A First Course in Statistical Programming with R as one of their favorite books, and they share why you should read it.

What is this book about?

This third edition of Braun and Murdoch's bestselling textbook now includes discussion of the use and design principles of the tidyverse packages in R, including expanded coverage of ggplot2, and R Markdown. The expanded simulation chapter introduces the Box-Muller and Metropolis-Hastings algorithms. New examples and exercises have been added throughout. This is the only introduction you'll need to start programming in R, the computing standard for analyzing data. This book comes with real R code that teaches the standards of the language. Unlike other introductory books on the R system, this book emphasizes portable programming skills that apply to mostā€¦


Book cover of Deep Medicine: How Artificial Intelligence Can Make Healthcare Human Again

Kerrie Holley Author Of AI-First Healthcare: AI Applications in the Business and Clinical Management of Health

From my list on artificial intelligence in health care.

Why am I passionate about this?

I fell in love with technology when I wrote my first computer program at age 14 when there was no public Internet, no personal computers, no iPhone, no cloud. I have made technical contributions to every era of computing from mainframes, to PCs, Internet, Cloud, and now AI. I was recently elected to the National Academy of Engineering. AI currently surpasses my wildest imagination on the art of whatā€™s possible. I'm still passionately working in technology at Google focused on how to live healthier lives. I believe we can make AI the telescope of the future, to helping everyone live long and healthy lives.

Kerrie's book list on artificial intelligence in health care

Kerrie Holley Why did Kerrie love this book?

This book explores how AI is transforming healthcare and the potential benefits it can bring to patients and doctors.

The author, Eric, is a cardiologist with working knowledge of technology of AI. I love how he describes with clarity, the present and potential to make people healthier with AI First thinking. That is, how AI can make the business of health care human.

I love the premise and basis of Ericā€™ thinking that we can make healthcare personalized, proactive, anticipatory, helping people live healthier lives and reducing the cost of healthcare. 

At the same time he is mindful that AI could be used to dehumanize healthcare and exacerbate existing inequalities.

By Eric Topol,

Why should I read it?

1 author picked Deep Medicine as one of their favorite books, and they share why you should read it.

What is this book about?

A visit to a physician these days is cold: physicians spend most of their time typing at computers, making minimal eye contact. Appointments generally last only a few minutes, with scarce time for the doctor to connect to a patient's story, or explain how and why different procedures and treatments might be undertaken. As a result, errors abound: indeed, misdiagnosis is the fourth-leading cause of death in the United States, trailing only heart disease, cancer, and stroke. This is because, despite having access to more resources than ever, doctors are vulnerable not just to the economic demand to see moreā€¦


Book cover of Mismeasuring Schools' Vital Signs: How to Avoid Misunderstanding, Misinterpreting, and Distorting Data

Jenny Grant Rankin Author Of Increasing the Impact of Your Research: A Practical Guide to Sharing Your Findings and Widening Your Reach

From Jenny's 3 favorite reads in 2023.

Why am I passionate about this?

Author Nerd Hyper Vegan Streetunwise

Jenny's 3 favorite reads in 2023

Jenny Grant Rankin Why did Jenny love this book?

Like professionals in other industries, educators are recognizing the power of data and are using it to guide their decision-making. Yet quantifying what works and what doesnā€™t when it comes to something as variable-rich as learning is extremely difficult.

Hence, as bright as they are, educators have only a 14% accuracy rate when interpreting student data. Fortunately, authors Rees and Wynns have exactly what is needed to remedy this problem.

They offer the hard-but-important-to-look-at facts concerning data use in our schools and pair it with a clear path to fixing problems. They pull engaging stories from their extensive experience and have a whip-smart writing style I envy.

One wonders how a book on data that uncovers harsh realities can be such an enjoyable read ā€“ the kind too enthralling to put down.

By Steve Rees, Jill Wynns,

Why should I read it?

1 author picked Mismeasuring Schools' Vital Signs as one of their favorite books, and they share why you should read it.

What is this book about?

This book helps school and district leaders avoid the pitfalls that await those making sense of their school's data. Whether you're interpreting achievement gaps, graduation rates or test results, you're at risk of reaching a mistaken judgment. By learning about common errors and how they're made, you'll be ready to choose safer, surer paths to making better sense of the wealth of data in your school or district. The authors help educators build better evidence, see conclusions more clearly, and explain the data more persuasively.

Special features Include:

"Questions to Spark Discussion" in each chapter encourage school site, district leaders,ā€¦


Book cover of R in Action: Data Analysis and Graphics with R

Tilman M. Davies Author Of The Book of R: A First Course in Programming and Statistics

From my list on intro to programming and data science with R.

Why am I passionate about this?

Iā€™m an applied statistician and academic researcher/lecturer at New Zealandā€™s oldest university ā€“ the University of Otago. R facilitates everything I do ā€“ research, academic publication, and teaching. Itā€™s the latter part of my job that motivated my own book on R. From first-year statistics students who have never seen R to my own Ph.D. students using R to implement novel and highly complex statistical methods and models, my experience is that all ultimately love the ease with which the R language permits exploration, visualisation, analysis, and inference of oneā€™s data. The ever-growing need in todayā€™s society for skilled statisticians and data scientists means there's never been a better time to learn this essential language.

Tilman's book list on intro to programming and data science with R

Tilman M. Davies Why did Tilman love this book?

This provides a superb balance between technical aspects of R coding and the statistical methods that motivate its use. It's rare to find a book on topics like this that are written with Kabacoff's easygoing yet precise style, which makes it ideal for beginners. From my own experience, it is obvious the author has spent many years teaching this type of content, knowing where things deserve extra explanation up front and where other more technical details can be relegated to more advanced texts.

By Robert I. Kabacoff,

Why should I read it?

1 author picked R in Action as one of their favorite books, and they share why you should read it.

What is this book about?

DESCRIPTION

R is a powerful language for statistical computing and graphics that can handle virtually any data-crunching task. It runs on all important platforms and provides thousands of useful specialized modules and utilities. This makes R a great way to get meaningful information from mountains of raw data.



R in Action, Second Edition is language tutorial focused on practical problems. Written by a research methodologist, it takes a direct and modular approach to quickly give readers the information they need to produce useful results. Focusing on realistic data analyses and a comprehensive integration of graphics, it follows the steps thatā€¦


Book cover of Kafka: The Definitive Guide: Real-Time Data and Stream Processing at Scale

Tomasz Lelek Author Of Software Mistakes and Tradeoffs: How to make good programming decisions

From my list on big data processing ecosystem.

Why am I passionate about this?

I am motivated by working on products that many people use. I've been a part of companies that deliver products impacting millions of people. To achieve it, I am working in the Big Data ecosystem and striving to simplify it by contributing to Dremio's Data LakeHouse solution. I worked on projects using Spark, HDFS, Cassandra, and Kafka technologies. I have been working in the software engineering industry for ten years now, and I've tried to share my experience and lessons learned in the Software Mistakes and Tradeoffs book, hoping that it will allow current and the next generation of engineers to create better software, leading to more happy users.

Tomasz's book list on big data processing ecosystem

Tomasz Lelek Why did Tomasz love this book?

Apache Kafka is the backbone of almost every streaming-based system today.

The solutions created and implemented in Kafka are the key concepts in every streaming system that you will work with.

This book will allow you to fully understand the Kafka architecture, its internals, and APIs and allow you to become an expert in this technology.

By Neha Narkhede, Gwen Shapira, Todd Palino

Why should I read it?

1 author picked Kafka as one of their favorite books, and they share why you should read it.

What is this book about?

Every enterprise application creates data, whether it's log messages, metrics, user activity, outgoing messages, or something else. And how to move all of this data becomes nearly as important as the data itself. If you're an application architect, developer, or production engineer new to Apache Kafka, this practical guide shows you how to use this open source streaming platform to handle real-time data feeds.

Engineers from Confluent and LinkedIn who are responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and build scalable stream-processing applications with this platform. Through detailed examples, you'll learn Kafka'sā€¦


Book cover of Super Founders: What Data Reveals About Billion-Dollar Startups

Simon Court Author Of Founder's Legacy: 50 Game-Changing Leadership Lessons for Building a Great Business

From my list on books for founders trying to be in the 10% of businesses that succeed.

Why am I passionate about this?

For the last 25 years, I have been a coach to business founders, leaders, and leadership teams. My work has taken me to every continent from my base in London. A lot of my work is done behind closed doors, but I have been instrumental in building two unicorns in the last decade. Iā€™m a founder myself and have always been fascinated by what it takes to succeed as a founder. I have a powerful conviction that learning to lead is the heart of it. The books I love are either based on real-world research or deeply practical and based on hands-on experience. Practice trumps theory every time in my world!

Simon's book list on books for founders trying to be in the 10% of businesses that succeed

Simon Court Why did Simon love this book?

I love the fact that Ali has provided evidence that ANYONE can succeed as a founder. He has done a lot of number-crunching on ā€˜unicornsā€™ (I admire his tenacity for that!) and examined what makes a successful startup founder and some surprises emerge.

Age, education, and number of cofounders were not predictors of a startupā€™s success, as many of us might have expected. The big thing that does matter is experience. Sixty percent of unicorn founders had previously launched startups. It seems that in pretty much every walk of life, including this one, practice is the key.

By Ali Tamaseb,

Why should I read it?

1 author picked Super Founders as one of their favorite books, and they share why you should read it.

What is this book about?

Every VC wants to find the next billion dollar company to invest in, and every startup wants to become one. Ali Tamaseb set out to find patterns in the backgrounds, methods, and trajectories of these companies, gathering and analyzing 40,000 data points about the 200+ billion dollar companies and the people who founded them. And you'll be surprised by what he discovered:

* Half of unicorn founders are over 35;
* Most founders don't have any directly relevant work experience in the industry they're disrupting;
* There's no disadvantage to being a solo founder;
* Sixty percent of billion dollarā€¦


Book cover of Journey to the Moon (Library of Flight)

Don Eyles Author Of Sunburst and Luminary: An Apollo Memoir

From my list on by Apollo insiders.

Why am I passionate about this?

I have read most of the books written about Apollo, especially those ostensibly written by my fellow participants. I have read these books for pleasure, to find out about parts of the moon effort that I did not see first-hand, and to learn what I could from the authorsā€™ mistakes and successes ā€” with a view to the writing of my own book. The books I have come to value the most are the books that seem to have been created for some other reason than commercial gain, the books unmarred by ghostwriting or heavy-handed editing, the books where the authorā€™s authentic voice speaks from the page.

Don's book list on by Apollo insiders

Don Eyles Why did Don love this book?

Eldon Hall led the development of the Apollo Guidance Computer, that one-cubic-foot device with 76kb of memory that navigated, guided, and controlled each of the Apollo spacecraft ā€” the machine that I helped program. His book is both a detailed description of the Apollo computer and a history of its development. The most dramatic chapter chronicles the bold decision to use integrated circuits in the design of the computer ā€” all of the same type, to encourage the vendor to keep making them ā€” although that technology was then anything but reliable. 

By Eldon C. Hall,

Why should I read it?

1 author picked Journey to the Moon (Library of Flight) as one of their favorite books, and they share why you should read it.

What is this book about?

The first of its kind, Journey to the Moon details the history and design of the computer that enabled U.S. astronauts to land on the moon. The book recalls the history of computer technology, both hardware and software, and the applications of digital computing to missile guidance systems and manned spacecraft. The book also offers graphics and photos drawn from the Draper Laboratories' archives that illustrate the technology and related events during the Apollo project. Written for experts as well as lay persons, Journey to the Moon is the first book of its kind and a must for anyone interestedā€¦


Book cover of Predict and Surveil: Data, Discretion, and the Future of Policing

Luke Hunt Author Of Police Deception and Dishonesty: The Logic of Lying

From my list on the cluster-f*ck we call policing.

Why am I passionate about this?

Iā€™m an Associate Professor in the University of Alabamaā€™s Department of Philosophy. I worked as an FBI Special Agent before making the natural transition to academic philosophy. Being a professor was always a close second to Quantico, but that scene in Point Break in which Keanu Reeves and Patrick Swayze fight Anthony Kiedis on the beach made it seem like the FBI would be more fun than academia. In my current position as a professor at the University of Alabama, I teach in my departmentā€™s Jurisprudence Specialization. My primary research interests are at the intersection of philosophy of law, political philosophy, and criminal justice. Iā€™ve written three books on policing.

Luke's book list on the cluster-f*ck we call policing

Luke Hunt Why did Luke love this book?

I love this book because it reminds us of the many ways that technology can affect justice.

It is tempting to think sophisticated tactics such as ā€œpredictive policingā€ can solve all problems relating to human bias. However, Brayne shows that data and algorithms do not eliminate bias and discretion. Instead, high-tech police tools simply make bias less overt and visible, which erodes the publicā€™s ability to hold the police accountable.

I especially enjoyed how the book flips the script, considering diverse ways to use these tools to help the public. For example, how can municipalities use technology to analyze the underlying factors that contribute to policing problems in the first place?

By Sarah Brayne,

Why should I read it?

1 author picked Predict and Surveil as one of their favorite books, and they share why you should read it.

What is this book about?

The scope of criminal justice surveillance, from the police to the prisons, has expanded rapidly in recent decades. At the same time, the use of big data has spread across a range of fields, including finance, politics, health, and marketing. While law enforcement's use of big data is hotly contested, very little is known about how the police actually use it in daily operations and with what consequences.

In Predict and Surveil, Sarah Brayne offers an unprecedented, inside look at how police use big data and new surveillance technologies, leveraging on-the-ground fieldwork with one of the most technologically advanced lawā€¦


Book cover of Getting Started with p5.js: Making Interactive Graphics in JavaScript and Processing

Scott Murray Author Of Unstuck: Javascript

From my list on learning how to code interactive graphics.

Why am I passionate about this?

Iā€™ve been making web pages since the World Wide Web began in the mid-1990s. Back then, the web was visually quite sparse. It wasnā€™t until the late 2000s that new browser capabilities let the web get visually interesting and an exciting place for interactive graphics. Graphics are great: they can be informational (like charts and maps) or purely aesthetic. My personal journey of learning to code interactive graphics has been so rewarding that Iā€™ve shared the love with others through teaching creative coding workshops and undergraduate courses. If youā€™re new to coding or computer graphics, I hope youā€™ll give one of these books a try!

Scott's book list on learning how to code interactive graphics

Scott Murray Why did Scott love this book?

If I were getting started with coding graphics today, I would start with this book, hands down. Learning p5 is the easiest way to create interactive graphics that run in a web browser, and this book is a very friendly, accessible, and beautifully illustrated introduction to coding graphics with p5.jsā€”no prior experience needed. You might be wondering about the name ā€œp5.jsā€. Itā€™s a JavaScript library (thatā€™s the ā€œ.jsā€ part) based on Processing, the open-source programming language created for artists and designers. (More on Processing in a moment.) I have taught college courses with this book, and students love it. Plus, all the skills you learn here with p5 are applicable to JavaScriptā€”the worldā€™s most popular programming languageā€”more generally.

By Lauren McCarthy, Casey Reas, Ben Fry

Why should I read it?

1 author picked Getting Started with p5.js as one of their favorite books, and they share why you should read it.

What is this book about?

Processing opened up the world of programming to artists, designers, educators, and beginners. The p5.js JavaScript implementation of Processing reinterprets it for today's web. This short book gently introduces the core concepts of computer programming and working with Processing. Written by the co-founders of the Processing project, Reas and Fry, along with Lauren McCarthy, one of the minds behind p5.js, Getting Started with Processing gets you in on the fun!


Book cover of All-in On AI: How Smart Companies Win Big with Artificial Intelligence

Roger W. Hoerl Author Of Statistical Thinking: Improving Business Performance

From my list on AI and data science that are actually readable.

Why am I passionate about this?

As a professional statistician, I am naturally interested in AI and data science. However, in our current information age, everyone, in all segments of society, needs to understand the basics of AI and data science. These basics include such things as what these disciplines are, what they can contribute to society, and perhaps most importantly, what can go wrong. However, I have found that much of the literature on these topics is highly technical and beyond the reach of most readers. These books are specifically selected because they are readable by virtually everyone, and yet convey the key concepts needed to be data-literate in the 21st century. Enjoy!

Roger's book list on AI and data science that are actually readable

Roger W. Hoerl Why did Roger love this book?

Books on AI often go to extremes, either promoting it as the solution to all the worldā€™s problems, or depicting it as an evil that will destroy humanity.

This book is much more practical, and based on experience using AI in actual business applications. It is the result of considerable research, involving investigation of applications not only in silicon-valley, but from various business sectors, such as Airbus, Ping, Progressive Insurance, and Capital One Bank.

Donā€™t let the title fool you; this book is not simply a promotion of AI, but addresses the practical issues that have to be considered if success is to be achieved. For example, they argue that ā€œthe most important aspect in AI success is not machinery, but human leadership, behavior, and change.ā€

By Thomas H. Davenport, Nitin Mittal,

Why should I read it?

1 author picked All-in On AI as one of their favorite books, and they share why you should read it.

What is this book about?

A Wall Street Journal bestseller

A Publisher's Weekly bestseller

A fascinating look at the trailblazing companies using artificial intelligence to create new competitive advantage, from the author of the business classic, Competing on Analytics, and the head of Deloitte's US AI practice.

Though most organizations are placing modest bets on artificial intelligence, there is a world-class group of companies that are going all-in on the technology and radically transforming their products, processes, strategies, customer relationships, and cultures.

Though these organizations represent less than 1 percent of large companies, they are all high performers in their industries. They have better businessā€¦