Skip to Content

Blogs

by Terry M. Therneau Ph.D.
Faculty, Mayo Clinic

About a year ago there was a query about how to do "type 3" tests for a Cox model on the R help list, which someone wanted because SAS does it. The SAS addition looked suspicious to me, but as the author of the survival package I thought I should understand the issue more deeply. It took far longer than I expected but has been illuminating.

How do you summarize fashion? For New York Fashion Week, the New York Times used the idea of "Fashion Fingerprints", distilling a designer's collections into small fragments highlighting the palette. Here's what Marc Jacobs' current collection looks like:

For his PhD at Delft University of Technology's Faculty of Mechanical Engineering, Thomas Geijtenbeek created robots that learned how to walk. These were virtual robots — simulations in a computer system — but with realistic muscles, joints and mass that behave in real-life ways.

If you haven't heard the buzz about Docker but you often need to spin up Linux-based VM's for testing, simulations, etc. then you should check it out. In short, Docker rocks: we use it for testing our Linux-based distros of Revolution R Open.

by Andrie de Vries

Last week we announced the availability of Revolution R Open, an enhanced distribution of R.  One of the enhancements is the inclusion of high performance linear algebra libraries, specifically the Intel MKL. This library significantly speeds up many statistical calculations, e.g. the matrix algebra that forms the basis of many statistical algorithms.

by Jamie F Olson
Professional Services Consultant, Revolution Analytics

One challenge in transitioning R code into a production environment is ensuring consistency and reliability. These challenges span a wide variety of issues, but runtime characteristics are an important operational characteristic. Specifically, production code should have a consistent, predictable runtime for a particular computational infrastructure. Among other things, this makes it possible to plan and scale IT infrastructure based on operational requirements.

Many R scripts depend on CRAN packages, and most CRAN packages in turn depend on other CRAN packages. If you install an R package, you'll also be installing its dependencies to make it work, and possibly other packages as well to enable its full functionality.

It's been a super-busy time at Strata this week, so I'm taking the easy route for Because it's Friday this week: funny dog and cat videos. If you're not one of the 10 million people who have seen Sad Dog Diary, well, now's your chance:

 

And if you're more of a cat person, there's also Sad Cat Diary:

 

That's all for this week! Have a great weekend, and we'll be back on Monday.

My second-favourite keynote from yesterday's Strata Hadoop World conference was this one, from Pinterest's John Rauser. To many people (especially in the Big Data world), Statistics is a series of complex equations, but a just a little intuition goes a long way to really understanding data.