3me PhD students - Part 3
Hopefully this is becoming clearer…
If it’s not clear after part 3, shame on me
Part 1
Part 2
Part 3
Lecture (right now)
Interactive session
This illustration is created by Scriberia with The Turing Way community.
Used under a CC-BY 4.0 licence. DOI: 10.5281/zenodo.3332807
Secure and backed-up (e.g. Project Data U:)
Or cloud solutions
Cloud solutions are deleted after your contract ends
Send encrypted files via SURFfilesender
3 copies of data (including active work)
2 media
1 off-site
Think about it. Can someone else take over without your help?
File naming should be consistent and understandable (to humans and machines)
-
, _
)Use version control.
Best if done during the project!
README file(s)
Paper/electronic lab notebooks
Annotated scripts or Code notebooks (e.g. R Notebook and Jupyter Notebook)
Working with human participants requires an HREC application:
Resources
This illustration is created by Scriberia with The Turing Way community.
Used under a CC-BY 4.0 licence. DOI: 10.5281/zenodo.3332807
As a PhD student you are resposible for:
Ensuring that all data and code underlying completed PhD theses are appropriately documented and accessible for at least 10 years from the end of the research project, in accordance with the FAIR principles (Findable, Accessible, Interoperable and Reusable), unless there are valid reasons which make research data unsuitable for sharing.
Minimal requirement:
Encouraged:
Data are available upon request to corresponding author.
Considerations
As open as possible; as closed as necessary.
Not suitable for sharing:
If in doubt, contact supervisor, owner of data, faculty contract manager, privacy officer, data steward
Data relating to identifiable person
. . .
Consent from participants needed for publishing and archiving
. . .
Can make empty informed consent form available
Findable - persistent identifier (e.g. DOI) and detailed metadata
Accessible - long-term accessibility of data (or just metadata if restricted)
Interoperable - non-proprietary file formats
Reusable - proper documentation and clear license
Image: https://book.fosteropenscience.eu/
Collective benefit - inclusive development and equitable outcomes
Authority to control - Rights, interests, and governance
Responsibility - respect, reciprocity, and trust
Ethics - minimising harm and maximising benefit
https://www.gida-global.org/care
JoKalliauer; foter, CC BY-SA 3.0 https://creativecommons.org/licenses/by-sa/3.0, via Wikimedia Commons
. . .
. . .
(You can of course use these in addition to a certified repository)
TU Delft policy on research software
License compatibility
Bazuine, Merlijn. (2021). TU Delft Guidelines on Research Software: Licensing, Registration and Commercialisation. Zenodo. https://doi.org/10.5281/zenodo.4629635
Commercial vs. open source
Can co-exist (e.g. RStudio, NextCloud, ownCloud, Linux distros, WordPress)
protocols.io
Reproducible: Reproducing results using the same methods and data
Replicable: Reproducing results using the same methods but DIFFERENT data
Main causes of failure to reproduce research:
https://doi.org/10.1038/533452a
Another barrier to reproducibility is the use of proprietary software and file formats (Not all institutions have a MatLab license - very few individuals)
This illustration is created by Scriberia with The Turing Way community.
Used under a CC-BY 4.0 licence. DOI: 10.5281/zenodo.3332807
Going the extra mile for science! 🥼
By Marwick et al. 2017 (https://doi.org/10.31235/osf.io/72n8g)
NOT everything everywhere all at once - start small
Removing Barriers to Reproducible Research in Archaeology
link to slides