Introduction to an R Package for Text Analysis: stm

Date
May 3, 2016, 9:30 am12:00 pm
Location
Speaker
Event Description

The Structural Topic Model is a general framework for topic modeling with document-level covariate information. The covariates can improve inference and qualitative interpretability and are allowed to affect topical prevalence, topical content or both. The software package implements the estimation algorithms for the model and also includes tools for every stage of a standard workflow from reading in and processing raw text through making publication quality figures.  The workshop will provide a hands-on introduction to using the stm package which currently includes functionality to:

  •  ingest and manipulate text data 
  •  estimate Structural Topic Models 
  • calculate covariate effects on latent topics with uncertainty 
  • estimate a graph of topic correlations
  • compute model diagnostics and summary measures
  • create the plots used in various papers about stm

Audience

Attendees should have previous R experience.

Format

Lecture, discussion and hands-on exercises.

Requirements

Attendees should bring a laptop with R and the R package stm already installed.  The stm package is available on CRAN and can be installed using: install.packages("stm")