COVID-19 Service Status
Data Services has shifted to virtual services for the Summer 2020 sessions. During our normal working hours, we will respond to requests
via e-mail and hold consultations
via Zoom when necessary.
Staffed Hours: Summer 2020
Mondays: 12pm - 6pm
Tuesdays: 12pm - 6pm
Wednesdays: 12pm - 6pm
Thursdays: 12pm - 6pm
Fridays: 12pm - 4pm
If you've met with us before, tell us how we're doing.
Tesseract is an open source optical character recognition (OCR) platform. OCR extracts text from images and documents without a text layer and outputs the document into a new searchable text file, PDF, or most other popular formats. Tesseract is highly customizable and can operate using most languages, including multilingual documents and vertical text. Although the software can be used on Windows or Linux, this guide will be based on Mac operating systems which is done through the terminal application.
The goals of this guide are to learn how to: