An introduction to the text analysis for literature with basic introduction to software packages. This workshop is an introduction to working with text as data in the humanities. This workshop will cover:
Presented with grateful consultation from Dr. David Hoover, NYU English Department.
In this workshop, students will learn the basics of topic modeling with the MAchine Learning for LanguagE Toolkit, or MALLET. The focus will be on using topic modeling for digital literary applications, using a sample corpus of novels by Victor Hugo, but the techniques learned can be applied to any Big Data text corpus. Special attention will be paid to explaining the principles underlying LDA (Latent Dirichlet Allocation), as well as tips for proper data management in MALLET use cases.
Voyant is a web-based text analysis tool designed to "facilitate reading and interpretive practices [...]"
With Voyant, you can upload or paste in a corpus of text and get some visualizations and analyses right in your browser. After uploading your corpus, you have 5 immediate panes:
Network diagrams are a great way to visualize the relationships between entities in your source material. It can sometimes help to explore materials from different angles to uncover new patterns and understanding.
If you're looking for more types of visualizations, you can find more information about data visualization generally at the corresponding library guide: https://guides.nyu.edu/viz/.
EndNote is freely available for the NYU community. You can find more information about downloading this software at the NYU EndNote support page. Below you can find a link to Tutorial slides that will walk you through the process of setting up and using EndNote to manage your bibliographic data.
1. Open the library of references you want to export.
2. In the toolbar at the top of the screen, click "Select Another Style" and select BibTex Export. BibTeX should be your archival copy. These files can also be used with LaTeX software to create citations and bibliographies.
3. Go to File > Export and save the file.
1. In your desktop Zotero, go to File > Export library
2. Make sure both notes and files are clicked so you export everything!
3. Pick which format you'd like to export. We recommend exporting as a CSV for your archival copy, and also as BibLaTex, particularly if you want to import your library into other programs. BibTeX files can be used with LaTeX software to create citations and bibliographies also.
1. Go to File > Export
2. Pick your export format. We recommend using BibLaTex as your archival copy. BibTeX files can be used with LaTeX software to create citations and bibliographies also, and most other bibliographic managers allow import of BibLaTex files.
1. Go to References > Export or Tools > Export, depending on your version.
2. Select All References or a specific folder of references you want to export.
3. Under "Export Formats" select BibTex and XML. XML will be your archival copy, and BibTex will be the secondary copy, one which you can import into other reference managers or use with LaTex software.
4. Click "Export to Text" and save.