Hello! I am Vicky Rampin, the Librarian for Research Data Management and Reproducibility. I am also the liaison to computer science and data science programs at NYU! I am here to help you navigate the resources for both at NYU and beyond. You can set up an appointment with me or always email me at: vs77@nyu.edu.
If you need help with a specific quantitative, GIS, or qualitative software, you should reach out to Data Services.
Openness is a shared value in data science. Sharing code, data, research articles, and other materials is part and parcel of participating in the wider data science community. You might consider sharing your work as well!
A simple and effective way to share your research materials is to publish them in a repository. A repository is a storage facility (often also a preservation and curation facility) where users can upload and download their data, make it accessible and discoverable, all in an effort to fulfill grant requirements and/or support the free sharing of scholarly knowledge. Materials that are deposited into a repository should be:
Persistent (not likely to be modified)
Searchable and browesable
Retrieved or downloaded easily
Citeable
A wide variety of institution-based and discipline-specific repositories exist for researchers to choose from. The repository itself should be:
If both a discipline-specific repository and an institution-based one exist for your data, then consider depositing in both locations to maximize discovery and safety of the data. If you need some help finding an appropriate repository for your work, don't hesitate to reach out to us!
Licenses exist on a spectrum from totally open (like CC0, a license that says anyone can do anything with your materials and don't need to cite you -- it puts your materials in the public domain) to more restrictive (like CC-BY-NC-ND, which says that anyone can use your materials as long as it's for non-commercial use, derivatives are distributed, and they attribute you). You can choose a license that you are comfortable with -- there is no one-size-fits-all solution. There are licenses that have no restrictions.
There are many more repositories than we could list here, so we'll include our institutional repository and some up-to-date aggregators of repositories that can help you search for the right repository in your field:
Here are some resources to help you pick a license for data:
Your options for publishing research code are somewhat more limited. There are:
Here are some resources to help you pick a license for code: