We're sorry but this page doesn't work properly without JavaScript enabled. Please enable it to continue.
Feedback

Towards reproducible Jupyter notebooks

00:00

Formal Metadata

Title
Towards reproducible Jupyter notebooks
Title of Series
Number of Parts
Author
License
CC Attribution 2.0 Belgium:
You are free to use, adapt and copy, distribute and transmit the work or content in adapted or unchanged form for any legal purpose as long as the work is attributed to the author in the manner specified by the author or licensor.
Identifiers
Publisher
Release Date
Language

Content Metadata

Subject Area
Genre
Abstract
Jupyter has become a tool of choice for researchers willing to share a narrative and supporting code that their peers can re-run. This talk is about Jupyter’s Achille’s heel: software deployment. I will present Guix-Jupyter, which aims to make notebook self-contained and to support reproducible deployment. Jupyter has become a tool of choice for researchers in data science and others fields. Jupyter Notebooks allow them to share a narrative and supporting code that their peers can re-run, which is why it is often considered a good tool for reproducible science. However, Jupyter Notebooks do not describe their software dependencies, which significantly hinder reproducibility: What if your peer runs different Python version? What if your notebook depends on a library that your peer hasn’t installed? What will happen if you try to run your notebook in a few years? All these issues are being addressed by tools such as Binder and its friend repo2docker. These solutions, though, do not address what we think is the core issue: that notebooks lack information about their software dependency. In this talk I will present our take on this problem, Guix-Jupyter. Guix-Jupyter allows users to annotate their notebook with information about their run-time environment. Those annotations are interpreted and Guix takes care of deploying the dependencies described. Furthermore, Guix-Jupyter ensures that code runs in an isolated environment (a container) as a way to maximize reproducibility.
Goodness of fitBitSupercomputerINTEGRALComputer animation
QuicksortDirection (geometry)LaptopTerm (mathematics)Goodness of fitTwitterDigital rights managementComputer animation
Random numberRange (statistics)Sample (statistics)Binary filePatch (Unix)HistogramLaptopInstallation artLaptopSoftwareDirectory serviceCodeCellular automatonRevision controlInstallation artInternetworkingMultiplication signAddress spaceComputer animation
Interactive televisionSoftware repositoryRepository (publishing)Integrated development environmentCodeRun-time systemRight angleDescriptive statisticsMedical imagingLaptopLatent heatSoftwareComputer animation
Fatou-MengeInstallation artIntegrated development environmentDefault (computer science)Run-time systemCodeRun time (program lifecycle phase)LaptopBuildingMedical imagingComputer animation
Bound stateResultantMereologyDescriptive statisticsVirtual machineLaptopInformationCore dumpPoint (geometry)Computer animation
Integrated development environmentINTEGRALIntegrated development environmentInstallation artDigital rights managementSoftwareRun-time systemComputer animation
Computer fileLaptopIntegrated development environmentKernel (computing)Random numberSample (statistics)Range (statistics)Patch (Unix)Binary fileHistogramKernel (computing)BlogDigital rights managementProgramming languageRun-time systemLaptopNatural languageProxy serverRight angleQuicksortCodeSupercomputerMeta elementSoftwareIntegrated development environmentImplementationComputer animation
Integrated development environmentKernel (computing)Run-time systemInformationAsynchronous Transfer ModeComponent-based software engineeringMilitary operationSimilarity (geometry)Endliche ModelltheorieHazard (2005 film)Read-only memoryComputer programmingDigital rights managementSoftwareMemory managementAbstractionSheaf (mathematics)LaptopMereologyComputerLocal ringDescriptive statisticsCellular automatonoutputComputer fileRevision controlHash functionResultantLatent heatIntegrated development environmentComputer animation
CodeIntegrated development environmentMultiplication signIntegrated development environmentCodePositional notationRun-time systemLaptopComputer animation
Multiplication signInternet service providerRevision controlService (economics)Point (geometry)Personal identification numberLatent heatDifferent (Kate Ryan album)NumberComputer animation
Point cloudFacebookOpen source
Transcript: Englisch(auto-generated)