Skip to Content


All around the world at noon GMT on April 28, data scientists around the world will compete in the world's first one-day International Data Science Hackathon, organized by Data Science London. Participants will receive a data set at the beginning of the event, and work in teams of 3-5 over the following 24 hours to create the best predictive model from the data.

Most of the time when we're programming in R, we don't think about how R gets from an object name (say, "stdev") to what it represents (a function to calculate standard deviation, perhaps). If you're writing functions, you've probably know about R's lexical scoping.

Marketing is one of the pioneering domains when it comes to applications of predictive analytics to Big Data.

You probably already knew that you can draw mathematical equations in Google by typing the equation into the search box. For example, here's the Standard Normal density function:

I can't find a way to embed the graph directly, but if you click on it you'll find it's interactive: you can inspect points, zoom in/out etc. You can create a similar chart in R quite easily with the command:

When the USA codified its copyright and patent laws in the 18th century, the goal was to create a robust public domain that all of society could benefit from, while still giving inventors and content creators a window in which to capitalize on their creative investment. Filmmaker Kirby Ferguson concludes his excellent Everything is a Remix series with a look at how over the years the laws have turned to preventing content from entering the public domain and protecting creators indefinitely:


David Smith's picture
March 30, 2012

The image below isn't a bearskin rug in the shape of the USA. It's fact, it's a visualization of the wind flowing over the United States, as of 4PM EDT today, March 30.

The R core group today announced the availability of R 2.15.0, codenamed "Easter Beagle". If you build R yourself, the new source distribution (including packages for Debian Linux) is available now, and binaries for Windows, MacOS and Linux will be available from your local CRAN mirror over the next couple of days.

Many thanks to Jim Guszcza (Predictive Analytics lead at Deloitte Consulting and Assistant Professor at UW-Madison) who gave a great webinar presentation yesterday on actuarial analysis with R

Tomorrow (at 2PM Eastern on Thursday), the White House White House Office of Science and Technology Policy will host a 90-minute forum on Challenges and Opportunities in Big Data.

The Google Summer of Code project provides funding for students to write and contribute code to open-source projects of interest to Google.