Skip to Content


RStudio's Hadley Wickham has just introduced a new package for filtering, selecting, restructuring and aggregating tabular data in R: the dplyr package. It's similar in concept to Hadley's original plyr package from 2009, but with several key improvements:

Choropleth maps are a popular way of representing spatial or geographic data, where a statistic of interest (say, income, voting results or crime rate) are color-coded by region.

The team at AMPLab has announced a developer preview of SparkR, an R package enabling R users to run jobs on an Apache Spark cluster.  Spark is an open source project that supports distributed in-memory computing for advanced analytics, such as fast queries, machine learning, streaming analytics and graph engines.

Zach King consistently manages to combine humour, surprise and magic into six-second vignettes on his Vine channel. Here's a compilation — it's 7 minutes total, but I bet you watch to the end.


In case you missed them, here are some articles from December of particular interest to R users:

A ComputerWorld tutorial on basic data processing with R.

Prediction: R will replace legacy SAS solutions and go mainstream.

O'Reilly has just published the results of the Data Scientist Salary Survey, based on data collected from attendees of the O'Reilly Strata conferences in 2012 and 2013. There were some interesting results from the salary portion of the survey:

Our crack-shot R trainer Luba Gloukhov generated a spirited (pun intended!) discussion from her post K-means Clustering 86 Single Malt Scotch Whiskies, with mentions of her analysis at FlowingData and

David Smith's picture
January 13, 2014

We had a marvellous series of guest posts here on the blog over the past few weeks. I'd like to give a special thanks to all of our guest bloggers for contributing, with special thanks to Joe Rickert for stepping in as our acting editor for the past 3 weeks. If you were celebrating or vacationing over the holidays, here's what you missed:

I tweeted out the image below earlier this month, and it quickly went viral: