Skip to Main Content

Text Data Mining

Resources for working with text as data, including corpus preparation, tutorials, data sources, and lists of tools.

Corpus Preparation

Optical Character Recognition (OCR)

Web Scraping

Regular Expressions

Method Selection/Statistical Knowledge