The below list includes data-related resources that may be applied to population health-related projects. Data can also be found and accessed through the Data Core at the Center for Population Health Sciences. Be sure to check Lane Library's classes and events page for upcoming workshops related to data management, data sharing, and other topics.
For researchers looking to publish and share their own (non-sensitive) research data, Lane Library recommends the Dryad data repository which is free to use for all Stanford affiliated researchers. For more information about Dryad, see the box at the bottom of this page.
The Dryad Digital Repository is a curated resource that makes research data discoverable, reusable, and citable. Dryad provides a home for a wide range of data types and is free to use for all Stanford affiliated researchers.
Key features of Dryad:
See their FAQ page for additional information about Dryad's features.
There are a variety of models and potential platforms for sharing your datasets with other researchers. Lane Library recommends Dryad as a way to openly share datasets that do not fit into more specialized repositories. For more information about Dryad, contact your liaison librarian.
Dryad uses ORCID iDs for login. The first time you log in, you will be asked if you are affiliated with a member institution. After selecting Stanford from the drop-down menu, you will be asked to sign in using your Stanford credentials. On every subsequent login, you will only have to use your iDs.
Once you have logged into Dryad, you can begin the process of publishing and sharing your data. After clicking Start New Dataset, you will be prompted to begin entering metadata. Good metadata (also called data documentation) is vital for ensuring that your dataset can be discovered, understood, and used by other researchers.
Dryad only requires that you complete the title, authors, and abstract fields, but we strongly recommend that you complete every field and upload additional documentation (e.g. data dictionaries, readme files, etc) alongside your dataset.
Dryad has two different methods for uploading data. Both methods allow you to upload multiple files.
Once you've uploaded your files, you can decide to submit them to the curation process immediately or keep them temporarily private for peer review. During the curation process, expert curators perform basic checks to ensure that the title and abstract are meaningful, there are sufficient methods and usage notes, that files can be opened, and that no sensitive information of material subject to copyright restrictions have been inadvertently included in the dataset. As an author, you can review the curation process for your dataset.
If you are plan to use Dryad to publish and share your data, please feel free to use or adapt the following description when completing data management plans or other documents:
Stanford University is a Dryad member institution. Dryad is an open source tool for data publication and digital preservation. Datasets deposited into Dryad are permanently archived in a CoreTrustSeal-certified repository. Data files are regularly audited to ensure fixity and authenticity and are replicated with multiple copies in multiple geographic locations. Professional curators examine all Dryad deposits to ensure the validity of the data, apply robust metadata, and make certain that highly sensitive information has not been inadvertently included. Datasets deposited in Dryad are automatically assigned a Digital Object Identifier (DOI) and are indexed by Google Dataset Search and other tools to enhance discoverability.
More information about Dryad's features, see this page. For additional assistance in describing Dryad or to discuss how it can be integrated into your research workflow, contact your liaison librarian.
Increasingly, there is an expectation that researchers will share their data. Data sharing can be a complex endeavor and, though we think very highly of Dryad, Lane Library recommends that you choose the method for sharing that is right for you and your data. Answering the questions below will help guide you through this process. For additional assistance, please see our upcoming classes and events page for workshops related to data management and sharing or contact your liaison librarian.
In some cases, your research funder or the journal publishing your work will specify that your data should be shared through a specific repository. For example, some projects funded by the National Institute of Mental Health are expected to share their data through NIMH Data Archive. In cases like this, we recommend that you share your data through the required repository.
Please note that some requirements state that data should be shared, but do not specify where. In such cases, refer to the next question.
If your research community typically shares the type of data you are looking to share through a specific repository, we generally recommend that you use that repository. To find repositories specialized for particular types of data, we recommend searching the Registry of Research Data Repositories (Re3Data).
If there is not a repository that is specific to the type of data your working with or if you have other concerns about sharing your data, see the next question.
Certain characteristics of your data may determine how and where it can be shared. For example, if you are working with big data (over 300gb) or data that contains personally identifying information, we recommend scheduling a consultation with your liaison librarian so we can refer you to the appropriate group on campus to help you determine your options for making your data available.
However, if you are simply looking for a general-purpose data repository, we strongly recommend Dryad.