Whole-Tale: Merging Science and Cyberinfrastructure Pathways

"Whole-Tale: Merging Science and Cyberinfrastructure Pathways" is an NSF-funded project that will enable researchers to examine, transform and then seamlessly re-publish research data that was used in an article. The Whole-Tale project will link existing software tools and cyberinfrastructure to support the entire computational process that underlies discovery, using popular frontends such as Jupyter and RStudio, thereby simplifying the ability for researchers to conduct, share, and publish their research. We will provide a research environment that captures and, at the time of publication, exposes salient details of the research process via access to persistent versions of the data and code used, the underlying workflow, and relevant data lineage. In this way, researchers can create computational and data narratives that span the “whole story”, from conducting the computational science to the publication of a “living paper".

The Whole-Tale summer internship program runs from May through July/August for a period of 10 weeks. We invite applications from undergraduate students, graduate students, and postgraduates (who have received their degree within the past five years). The internship project can be conducted remotely and offers a stipend of $5000 (see the linked web site for details):

The proposed internship projects cover a range of topic areas and vary in the extent and type of prior background required. We expect to fund at least three of the projects described below; the final determination will also take into account applicants’ interests aTnd expertise.

  1. The Reproducible Data Wrangler
  2. Benchmarking Machine Learning Methods in Materials Science
  3. Analyzing Bias in Machine Learning Research
  4. Reproducibility Study in Biodiversity Informatics

Please visit http://wholetale.org/2018/03/08/internships.html for further details, including how to apply!

