Efficient data manipulation with R

 

In this course you will learn how to handle your data management tasks using the most modern R data manipulation tools: dplyr, tidyr and lubridate.
Data manipulation is the process of changing data to make it suitable for the further steps of the data analysis process, as modeling and visualizing: nevertheless, it often occupies the most of the time.
Trough this course you will learn how to organize your data manipulation tasks in a standard and clear way, write clean and efficient code, and build reproducible data management processes.

Audience

This class will be a good fit for you if you have a working knowledge of R, and you usually handle with data and databases.

Attendees

6 attendees max.

Course organization

The first part of the course teaches how to do basic data manipulation using the dplyr package with some focus on packages tidyr and lubridate.
The second part of the course will focus more advanced data manipulation tecniques from package dplyr including  the usage of backend databases behind dplyr and SE vs NSE programming techniques.

Outline

  • Tidying data with tidyr
  • The fundamental verbs of data manipulation: select, filter, arrange, mutate and summarize
  • Group-wise calculations
  • Date handling with lubridate
  • Joining tables
  • Chain operators
  • Do as generic data manipulation tool
  • Programming with dplyr and tidyr: NSE vs SE
  • Working with backend databases

Cost

The cost of a 2 day course is 800 + VAT per person, which includes lunch, comprehensive course materials plus 1 hour of individual online post course support for each student within 30 days from course date.

Discounts

We offer an academic discount for those engaged in full time studies or research. Please contact us for further information.

Date

Next session will be in Spring, the dates will be available soon, for any further information you can contact us here.

Location

Quantide premises
Corso Italia, 85
20025 Legnano, MI
Italy

Teacher

Andrea Spanò
Andrea Spanò is an Rstudio certificated instructor who has worked as an R trainer and consultant for over 20 years. Andrea graduated in Statistics from the University of Siena and obtained a Master’s degree in Applied Statistics at the University College of London. He runs Quantide consulting firm and teaches at Luiss University post grad course on Big Data Management