Not long ago, I stumbled on a book about data science that piqued my interest. I found it on the O'Reilly site, which is a subscription based site that has tons of technical books, videos, and other learning material. I like the selection of material available on O'Reilly, but I don't really care for the apps they provide for customers to consume their products, so, instead, I convert their online books into ePubs so that I can read them offline, on my iPhone or iPad, when and however I like.
The book was written in 2015 by a guy named Jeroen Janssens from the Netherlands. He started the original edition while working on his PhD about half a dozen years ago. The central theme of his book is the intersection of data science and the use of command line tools. Because the state of tools has changed so much since his first publication, he embarked upon an effort to update it. The cool part is that he's doing this in an open source way, using GitHub to collect feedback from h