What’s the difference between machine learning, statistics, and data mining?

Over the last few blog posts, I’ve discussed some of the basics of what machine learning is and why it’s important: – Why machine learning will reshape software engineering – What is the core task of machine learning – How to get started in machine learning in R Throughout those posts, I’ve been using the … Read more

How to make a small multiples chart in R

An important principle in analyzing data is “overview first, zoom and filter, then details on demand” (quote: Ben Shneiderman) In practice, this typically means starting at a high level with a single chart, and then “zooming into” the data by replicating that chart for specific subsets of the dataset. And, even more valuable is being … Read more

How to build an R line chart, step by step (and the importance of process)

Last week, I was talking to a guy who’s learning analytics, coaching him on what skills to learn next and helping him plan a career path. He’s a smart guy with an analytical background and minor coding experience, but he’s new to R.

Towards the end of the conversation, I asked him, “what’s the biggest challenge you have right now, learning analytics.”

His response? “The code is intimidating.”