In this workshop, students will learn the basics of topic modeling with the MAchine Learning for LanguagE Toolkit, or MALLET. The focus will be on using topic modeling for digital literary applications, using a sample corpus of novels by Victor Hugo, but the techniques learned can be applied to any Big Data text corpus. Special attention will be paid to explaining the principles underlying LDA (Latent Dirichlet Allocation), as well as tips for proper data management in MALLET use cases.
For assistance, reach out by chat below or submit a request
We can be reached by email at data.services@nyu.edu
Join our Discord server
If you've met with us before, tell us how we're doing
Staffed Hours: Spring 2023
Mondays: 12pm - 5pm
Tuesdays: 12pm - 5pm
Wednesdays: 12pm - 5pm
Thursdays: 12pm - 5pm
Fridays: 12pm - 5pm