“Tidy Text Analysis in R” by Professor Marc Dotson on Monday, October 14, 2019

R is one of the most popular open-source languages for data analysis. R’s recent popularity is in part thanks to the tidyverse, a collection of powerful packages in R for common tasks, including data cleaning and visualization, that provide a consistent and intuitive approach to data analysis and serve as a foundation for a growing number of packages. In particular, built on the tidyverse foundation, text analysis has become increasingly accessible in R.

In this tutorial, we will cover the basics of using R and the tidyverse and analyzing text, including conducting sentiment analysis and topic modeling. Before the tutorial, participants should setup a free account at rstudio.cloud.

Session 1: Wrangling and Summarizing
– Session 2: Tidying and Tokenizing
– Session 3: Sentiment Analysis and Visualization
– Session 4: Topic Modeling and Classification


This workshop was made possible thanks to the generous support of Labex ECODEC

