Skip to Content


Patrick Burns, author of the excellent R Inferno, gave a presentation about R at the Cambridge R User Group this week. (Revolution Analytics is a proud sponsor of CambR.) I wasn't at the presentation myself, but Pat always gives a great talk, and he's generously provided his s

For the past 12 years, KDNuggets has conducted an annual poll asking "What analytics/data mining software you used in the past 12 months for a real project (not just evaluation)". In this year's poll, R was the top-ranked data mining solution, selected by 30.7% of poll respondents. Microsoft Excel was second, at 29.8%. Rapidminer, which took the #1 spot over R in 2011 and 2010, ranked third.

The next release of open-source R, codenamed "Roasted Marshmallows", is scheduled to be released on June 22, according to this announcement on the r-announce mailing list. Don't expect too many changes in this update: despite the fact that "there have been very few issues with 2.15.0 ... some people may be waiting superstitiously for a .1 release".

The results of the 2012 Future of Open Source Survey were presented at last week's OSBC conference in San Francisco. (Revolution Analytics is a collaborator in the Survey.) You can read a summary of the results in this press release, but one interesting trend is evident from Slide 13 of the presentation:

If you're planning a house party this long weekend, you might want to take a cue from Everett Hiller to liven up your party snaps with a bit of Bill Murray:

And what party is complete without Tom Cruise on a piñata?

In computing, social networks are traditionally represented as graphs: a connection of nodes (people), pairs of which may be connected by edges (friend relationships). Visually, the social networks can then be represented like this:

As a discipline, Data Science is growing up fast. That's my key takeaway from the 2012 Data Science Summit.

In conjunction with Facebook's record-setting IPO last Thursday, the New York Times created an infographic to put the size of the offer in context with other recent IPOs. A detail of the graphic as it appeared in the print edition appears below:

As widely reported by CNN, the Huffington Post, Talking Points Memo, the sophistication of speeches by US politicians has declined in recent years, dropping from an 11th-grade level in 2005 to a 10th-grade level today.

If you want to get a quick numerical summary of a data set, the summary function gives a nice overview for data frames: