Skip to Content

Blogs

Want to see how you can use a drag-and-drop user interface to run and share R code? Check out our webinar next Wednesday January 29 (hosted by Alteryx and Revolution Analytics): Creating Value That Scales with Revolution Analytics & Alteryx

RStudio's Hadley Wickham has just introduced a new package for filtering, selecting, restructuring and aggregating tabular data in R: the dplyr package. It's similar in concept to Hadley's original plyr package from 2009, but with several key improvements:

Choropleth maps are a popular way of representing spatial or geographic data, where a statistic of interest (say, income, voting results or crime rate) are color-coded by region.

The team at AMPLab has announced a developer preview of SparkR, an R package enabling R users to run jobs on an Apache Spark cluster.  Spark is an open source project that supports distributed in-memory computing for advanced analytics, such as fast queries, machine learning, streaming analytics and graph engines.

Zach King consistently manages to combine humour, surprise and magic into six-second vignettes on his Vine channel. Here's a compilation — it's 7 minutes total, but I bet you watch to the end.

 

In case you missed them, here are some articles from December of particular interest to R users:

A ComputerWorld tutorial on basic data processing with R.

Prediction: R will replace legacy SAS solutions and go mainstream.

O'Reilly has just published the results of the Data Scientist Salary Survey, based on data collected from attendees of the O'Reilly Strata conferences in 2012 and 2013. There were some interesting results from the salary portion of the survey:

Our crack-shot R trainer Luba Gloukhov generated a spirited (pun intended!) discussion from her post K-means Clustering 86 Single Malt Scotch Whiskies, with mentions of her analysis at FlowingData and