In this workshop, students will learn the basics of topic modeling with the MAchine Learning for LanguagE Toolkit, or MALLET. The focus will be on using topic modeling for digital literary applications, using a sample corpus of novels by Victor Hugo, but the techniques learned can be applied to any Big Data text corpus. Special attention will be paid to explaining the principles underlying LDA (Latent Dirichlet Allocation), as well as tips for proper data management in MALLET use cases.
For assistance, reach out by chat below or submit a request
We can be reached by email at data.services@nyu.edu
If you've met with us before, tell us how we're doing
Chat Service Staffed Hours: Spring 2021
Mondays: 12pm - 6pm
Tuesdays: 12pm - 6pm
Wednesdays: 12pm - 6pm
Thursdays: 12pm - 6pm
Fridays: 12pm - 6pm