Skip to Main Content

Data Services Class Descriptions

Information, materials, and schedules for all currently offered Data Services classes.
Learn both basic and advanced techniques for transforming data using OpenRefine, an essential open-source tool for fast clean up of tabular data in preparation for analysis.
Software: OpenRefine
Duration: 120 min

Room description:

Some tutorials are held remotely and require NYU sign on to access, while others are held in person, without a remote component. Please note the correct modality and location of the tutorial when registering

Prerequisites:

None

Skills Taught / Learning Outcomes:
  • Learn how to use the OpenRefine interface, import and export datasets
  • Understand how OpenRefine documents changes to datasets to enable reproducible scholarship
  • Perform mass edits on data syntax to enable accurate data analysis
  • Perform automated transformations to save time in cleaning data
  • Split and join cells and columns
  • Perform built-in transformations (changing case, removing leading/trailing whitespace)
  • Be introduced to regular expressions and GREL for advanced transformations
Class Materials:

2018 Squirrel Census data

Related Classes:

Data Visualization with Tableau

Introduction to Stata

Introduction to R

Introduction to Python

Data Wrangling in R

Data Wrangling in Stata

Introduction to ArcGIS

Data Cleaning for GIS

Additional Training Materials:

Websites:

Exercises/Projects:

Feedback: bit.ly/feedbackds

Upcoming sessions for this tutorial