Skip to Content


by Michele Usuelli
Microsoft Data Scientist

Azure Machine Learning Studio is a drag-and-drop tool to deploy data-driven solutions. It contains pre-built items including data preparation tools and Machine Learning algorithms. In addition, it allows to include R and Python custom scripts.

Powerpoint is a powerful application for creating presentations, and allows you to include all sorts of text, pictures, animations and interactivity to create a compelling story. Most of the time you'll use the Powerpoint application to create slides, but if you want to include data and/or charts in your slides, in the interests of reproducibility you may want to automate the slide creation process.

Jamie Raines was born a girl, but at age 18 he began a three-year journey to transform his body to match his identity with hormone therapy. Every day Jamie took a selfie to document the change, and the dramatic transformation is shown in this time-lapse video:


In case you missed them, here are some articles from September of particular interest to R users. 

A tutorial on using R with Jupyter Notebooks and how to control the size of R graphics therein.

by Joseph Rickert

Early October: somewhere the leaves are turning brilliant colors, temperatures are cooling down and that back to school feeling is in the air. And for more people than ever before, it is going to seem to be a good time to commit to really learning R. I have some suggestions for R courses below, but first: What does it mean to learn R anyway? My take is that the answer depends on a person's circumstances and motivation.

I find the following graphic to be helpful in sorting things out.

by Andrie de Vries

In one of John D. Cooke's blog posts of 2010 (Parameters and Percentiles), he poses the following problem:

by Jens Carl Streibig, Professor Emeritus at University of Copenhagen

Editor's introduction: for background on the miniCRAN package, see our previous blog posts:

For more than six years, the New York Times has been using the R language to develop and implement much of the fantastic data journalism on the website and in the newspaper.

I just got back from the Strata+Hadoop World conference in New York, and amongst the usual talks on the technology and applications of big data and data science ran a new thread: data ethics. DJ Patil, the US government's chief data scientist, made a call for comments on data ethics in his keynote and in a follow-up discussion session.

Hadley Wickham, RStudio's Chief Scientist and prolific author of R books and packages, conducted an AMA (Ask Me Anything) session on Reddit this past Monday. The session was tremendously popular, generating more than 500 questions/comments and promoting the AMA to the front page of Reddit.

If you're not familiar with Hadley's work (which would be a surprise if you're an R user), his own introduction in the Reddit AMA post will fill you in: