Reproducible Research: An Introduction to The Turing Way and Binder
Schedule
Thu Nov 21 2024 at 04:30 pm to 05:30 pm
UTC+01:00Location
Online | Online, 0
About this Event
Sarah Gibson is an Open Source Infrastructure Engineer at 2i2c, an open source contributor, and advocate for open science. She holds more than two years of experience as a Research Engineer at a national institute for data science and artificial intelligence, as well as holding a core contributor role in the open source projects JupyterHub, Binder, and The Turing Way. Sarah is passionate about working with domain experts to leverage cloud computing in order to accelerate cutting-edge, data-intensive research and disseminating the results in an open, reproducible and reusable manner. Sarah holds a Fellowship with the Software Sustainability Institute and advocates for best software practices in research. She is a member of the mybinder.org operating team and maintains infrastructure supporting a global community in sharing reproducible computational environments. She has also mentored projects through cohorts of the OLS and Outreachy programmes, imparting lived experience of her skills participating and leading in open science projects. You can connect further with Sarah and follow her work on GitHub: @sgibson91
----------
Abstract
Reproducible research is necessary to ensure that scientific work can be trusted. Funders and publishers are beginning to require that publications include access to the underlying data and the analysis code. The goal is to ensure that all results can be independently verified and built upon in future work. This is sometimes easier said than done! Sharing these research outputs means understanding data management, library sciences, software development, and continuous integration techniques: skills that are not widely taught or expected of academic researchers. The Turing Way is a handbook to support students, their supervisors, funders and journal editors in ensuring that reproducible research is "too easy not to do". It includes training material on version control, analysis testing, and open and transparent communication with future users, and includes case studies and common "gotchas" for researchers to avoid. This project is openly developed and any and all questions, comments and recommendations are welcome at our GitHub repository: https://github.com/the-turing-way/the-turing-way
One tool The Turing Way recommends using is Project Binder (https://jupyter.org/binder), the free and public version of which is hosted at https://mybinder.org. This project uses openly developed tools to transform a static GitHub repository - containing Jupyter Notebooks and scripts written in Python, R, Julia and other languages - into an interactive computational environment hosted in the cloud. This environment can then be shared with anyone, anywhere via a single, clickable URL so that the analysis code and dataset can be interactively explored without the need for the user to install the software.
In this talk, Sarah Gibson, will take you on a whirlwind tour of The Turing Way chapters that already exist and the directions in which we're continuing to develop. As well as providing a brief introduction to how Binder works “under the hood”. All participants will leave the talk knowing that "Every Little Helps" when making their work reproducible, where to ask for help as they start or continue their open research journey, and how they can contribute to improve The Turing Way for future readers.
----------
Other talks from ReproducibiliTea Basel
Want to see what else is coming up?
Can't wait?
Where is it happening?
OnlineUSD 0.00