R is a computer language dedicated to statistics and data science. The best known implementation of the R language is the GNU R software. The R language is derived from the S language developed by John Chambers and his colleagues at Bell Laboratories. GNU R is a free software distributed under the terms of the GNU GPL license and available under GNU / Linux, FreeBSD, NetBSD, OpenBSD, Mac OS X and Windows.

Project R was born in 1993 as a research project by Ross Ihaka and Robert Gentleman at the University of Auckland (New Zealand). Since 1997, about 20 developers have been involved in the development of R (R Development Core team). The version R 1.0.0, the first official version of the R language, was published on 29 February 20002. In 2003, the development team created the R Foundation for Statistical Computing to support the R project and to become a reference point for those who want to get in touch with the R community. At this time, the language has more than 200 libraries developed by the scientific community using R. In 2015, several important economic players such as IBM, Microsoft and RStudio created the R Consortium to support the R community and to fund projects around this language.

There are two or three distributions of R and in this tutorial we propose to use the one from the CRAN (which is the free one).

You can download it from the website of the CRAN Project.

RStudio is a very convenient IDE for the R language : you can download it from here.

Once installed the home page of RStudio looks as follows :

Next step : understanding and preparing the data.