Papers



Submit a Paper!

Browse ReproHack papers

  • Deep Structural Causal Models for Tractable Counterfactual Inference

    Authors: Nick Pawlowski, Daniel C. Castro, Ben Glocker.
    Submitted by hub-admin    
      Mean reproducibility score:   5.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    Some may argue that the field of machine learning is in a reproducibility crisis. It will be interesting to know how difficult it is for others to reproduce the results of a paper that proposed a quite complex methodology.

    Tags: Python
  • Plasmonic nanostructure design and characterization via Deep Learning

    Authors: Malkiel, I., Mrejen, M., Nagler, A. et al.
    DOI: 10.1038/s41377-018-0060-7
    Submitted by hub-admin    

    Why should we attempt to reproduce this paper?

    The current code is written in Torch, which is no longer actively maintained. Since deep learning in nanophotonics is an area of active interest (e.g. for the design of new metamaterials), it is important to update the code to use a more modern deep learning library such as tensorflow/keras

  • The role of conidia in the dispersal of Ascochyta rabiei

    Authors: Khaliq, I., Fanning, J., Melloy, P. et al.
    DOI: 10.1007/s10658-020-02126-2
    Submitted by hub-admin    

    Why should we attempt to reproduce this paper?

    I suggested a few papers last year. I’m hoping that we’ve improved our reproducibility with this one, this year. We’ve done our best to package it up both in Docker and as an R package. I’d be curious to know what the best way to reproduce it is found to be. Working through vignettes or spinning up a Docker instance. Which is the preferred method?

    Tags: R Docker
  • Unveiling the diversity of spatial data infrastructures in Latin America: evidence from an exploratory inquiry

    Authors: Luis M. Vilches-Blázquez & Daniela Ballari
    DOI: 10.1080/15230406.2020.1772113
    Submitted by hub-admin    
      Mean reproducibility score:   10.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    It is kind of an easy reproducible code. It reads the data, makes few descriptive statistical analysis and plots figures using ggplot2.

    Tags: R
  • Evolutionary and food supply implications of ongoing maize domestication by Mexican campesinos

    Authors: Bellon, M. R., Mastretta-Yanes, A., Ponce-Mendoza, A., Ortiz-Santamaría, D., Oliveros-Galindo, O., Perales, H., … Sarukhán, J.
    DOI: 10.1098/rspb.2018.1049
    Submitted by hub-admin    
      Mean reproducibility score:   6.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    Cleaning the databases used for this study was one of the most challenging aspects of it, so making it public is the best way to make the more out of it. We made an effort to document all analyses and data wrangling steps. We are interested to know if it is truly reproducible so that we can follow this same scheme for further projects, or adjust accordingly.

    Tags: R
  • FlowFrontNet : Improving Carbon Composite Manufacturing with CNNs

    Authors: Stieber, S., Schröter, N., Schiendorfer, A., Hoffmann, A., & Reif, W.
    Submitted by hub-admin    

    Why should we attempt to reproduce this paper?

    To use data from a manufacturing process: RTM for carbon composite production.To see if you can handle large amounts of data: the 36 k injection runs contain a total of 5 m frames. Maybe it is possible for you to reach our performance on smaller parts of the data, which would be great.

    Tags: Python
  • pyKNEEr: An image analysis workflow for open and reproducible research on femoral knee cartilage

    Authors: Bonaretti S, Gold GE, Beaupre GS
    DOI: 10.1371/journal.pone.0226501
    Submitted by hub-admin    
      Mean reproducibility score:   6.5/10   |   Number of reviews:   2
    Why should we attempt to reproduce this paper?

    The paper describes pyKNEEr, a python package for open and reproducible research on femoral knee cartilage using Jupyter notebooks as a user interface. I created this paper with the specific intent to make both the workflows it describes and the paper itself open and reproducible, following guidelines from authorities in the field. Therefore, two things in the paper can be reproduced: 1) workflow results: Table 2 contains links to all the Jupyter notebooks used to calculate the results. Computations are long and might require a server, so if you want to run them locally, I recommend using only 2 or 3 images as inputs for the computations. Also, the paper should be sufficient, but if you need further introductory info, there are a documentation website: https://sbonaretti.github.io/pyKNEEr/ and a "how to" video: https://youtu.be/7WPf5KFtYi8 2) paper graphs: In the captions of figures 1, 4, and 5 you can find links to data repository, code (a Jupyter notebook), and the computational environment (binder) to fully reproduce the graph. These computations can be easily run locally and require a few seconds. All Jupyter notebooks automatically download data from Zenodo and provide dependencies, which should make reproducibility easier.

  • Dynamic redistribution of plasticity in a cerebellar spiking neural network reproducing an associative learning task perturbed by TMS

    Authors: Alberto Antonietti, Jessica Monaco, Egidio D'Angelo, Alessandra Pedrocchi, and Claudia Casellato
    Submitted by @_Aalph    

    Why should we attempt to reproduce this paper?

    Paper and codes+data have been published 4 years ago, will they still work? I always try to release data and codes to reproduce my papers, but I seldom receive feedback. It would be useful to have comments from a reproducers' team, in order to improve sharing for future research (I switched from MATLAB to Python already).

  • A novel approach to modelling transcriptional heterogeneity identifies the oncogene candidate CBX2 in invasive breast carcinoma

    Authors: Piqué, D.G., Montagna, C., Greally, J.M. et al.
    Submitted by hub-admin    
      Mean reproducibility score:   4.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    This paper provides a novel approach to identifying oncogenes based on RNA overexpression in subsets of tumor relative to adjacent normal tissue. Showing that this study can be reproduced would aid other researchers who are attempting to identify oncogenes in other cancer types using the same methodology.

    Tags: R
  • Good Me Bad Me: Prioritization of the Good-Self During Perceptual Decision-Making

    Authors: Hu, C.-P., Lan, Y., Macrae, C. N., & Sui, J.
    DOI: 10.1525/collabra.301
    Submitted by hub-admin    
      Mean reproducibility score:   7.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    It'll a great helpful to independently check the scientific record I've published, so that errors, if there are any, could be corrected. Also, I will learn how to share the data in a more accessible to other if you could give me feedback.

    Tags: Python R Matlab
  • Hyperparameter importance Across Datasets

    Authors: Jan N van Rijn and Frank Hutter
    DOI: 10.1145/3219819.3220058
    Submitted by hub-admin    
      Mean reproducibility score:   7.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    I tried hard to make this paper as reproducible as possible, but as techniques and dependencies become more complex, it is hard to make it 100% clear. Any form of feedback is more than welcome.

  • Mental Health and Social Contact During the COVID-19 Pandemic: An Ecological Momentary Assessment Study

    Authors: Eiko I. Fried, Faidra Papanikolaou, Sacha Epskamp
    DOI: 10.31234/osf.io/36xkp
    Submitted by hub-admin    
      Mean reproducibility score:   7.6/10   |   Number of reviews:   5
    Why should we attempt to reproduce this paper?

    Currently submitted paper on COVID19 on mental health. Unique clinical data (time series during the pandemic onset) & methods, hopefully fun to work on. Possibly too boring / easy to reproduce given my data & code? Not sure.

    Tags: R
  • Statistical analysis of coverage error in simple global temperature estimators

    Authors: K Cowtan, P Jacobs, P Thorne, R Wilkinson
    Submitted by hub-admin    
      Mean reproducibility score:   3.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    To see whether we did a good enough job in providing data and methods, and to check how the code has aged with respect to current libraries.

    Tags: Python
  • The Impact of Sound-Bite Journalism on Public Argument

    Authors: Rinke, E. M.
    DOI: 10.1111/jcom.12246
    Submitted by emrinke    
      Mean reproducibility score:   10.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    - This paper is a good example of a standard social science study that is (I hope!) fully reproducible, from main analysis, to supplementary analyses and figures. - I have not yet received any external feedback w.r.t. its reproducibility, so would be interested to see if I have overlooked any gaps in the reproduction workflow that I anticipated.

  • Where should new parkrun events be located? Modelling the potential impact of 200 new events on socio-economic inequalities in access and participation

    Authors: Schneider PP, Smith RA, Bullas AM, Bayley T, Haake SS, Brennan A, Goyder E
    Submitted by hub-admin    
      Mean reproducibility score:   7.0/10   |   Number of reviews:   3
    Why should we attempt to reproduce this paper?

    If all went right, the analysis should be fully reproducible without the need to make any adjustments. The paper aims to find optimal locations for new parkruns, but we were not 100% sure how 'optimal' should be defined. We provide a few examples, but the code was meant to be flexible enough to allow potential decision makers to specify their own, alternative objectives. The spatial data set is also quite interesting and fun to play around with. Cave: The full analysis takes a while to run (~30+ min) and might require >= 8gb ram.

  • Modality switch effects emerge early and increase throughout conceptual processing: Evidence from ERPs

    Authors: Bernabeu, P., Willems, R. M., & Louwerse, M. M.
    Submitted by hub-admin    

    Why should we attempt to reproduce this paper?

    Open data and reproducibility was important in this project.

    Tags: R
  • Evaluation of the ‘Irish Rules’: The Potato Late Blight Forecasting Model and Its Operational Use in the Republic of Ireland

    Authors: Cucak, M., Sparks, A., Moral, R.D.A., Kildea, S., Lambkin, K. and Fealy, R.
    Submitted by hub-admin  
    Number of reviews:   2
    Why should we attempt to reproduce this paper?

    It is a rare find of full reproducibility in the field of plant disease epidemiology.

    Tags: R
  • Quantitative analysis of spectroscopic Low Energy Electron Microscopy data: High-dynamic range imaging, drift correction and cluster analysis

    Authors: Jong T.A. de, Kok, D.N.L, Torren A.J.H. van der, Schopmans H., Tromp R.M., Molen S.J. van der & Jobst J.
    Submitted by hub-admin  
    Number of reviews:   2
    Why should we attempt to reproduce this paper?

    Low Energy Electron Microscopy (LEEM) is a somewhat specific form of electron microscopy used to study surfaces and 2D materials. In this paper we describe a set of data processing techniques applied to LEEM and adapted to the peculiarities of LEEM. This is combined with a parallelized Python implementation using Dask in separate notebooks. So if you are interested in microscopy, image analysis, clustering of experimental physics data or parallel Python, this paper should be interesting to you.

    Tags: Python
  • Explicit (but not implicit) environmentalist identity predicts pro-environmental behavior and policy preferences

    Authors: Brick, C., & Lai, C. K.
    DOI: 10.1016/j.jenvp.2018.07.003
    Submitted by hub-admin    
      Mean reproducibility score:   6.0/10   |   Number of reviews:   1
    Why should we attempt to reproduce this paper?

    The results of the individual studies (4) could be interpreted in support for the hypothesis, but the meta-analysis suggested that implicit identification was not a useful predictor overall. This conclusion is an important goalpost for future work.

  • Determination of the fundamental absorption and optical bandgap of dielectric thin films from single optical transmittance measurements

    Authors: A. Tejada, L. Montañez, C. Torres, P. Llontop, L. Flores-Escalante, F. De Zela, A. Winnacker, and J. A. Guerra
    Submitted by hub-admin    

    Why should we attempt to reproduce this paper?

    We propose a simple method to retrieve optical constants from single optical transmittance measurements, in particular in the fundamental absorption region. The construction of needed envelopes is arbitrary and will depend on the user. However, the method should still be robust and deliver similar results.

Search for papers

Filter by tags

Python R GDAL GEOS GIS Shiny PROJ Galaxies Astronomy HPC Databases Binder Social Science Stata make Computer Science Jupyter Notebook tidyverse emacs literate earth sciences clumped isotopes org-mode geology eyetracking LaTeX Git ArcGIS Docker Drake SVN knitr C Matlab Mathematica Meta-analysis swig miniconda tensorflow keras Pandas SQL neuroscience robotics deep learning planner reiforcement learning Plasma physics Hybrid-PIC EPOCH Laser Gamma-ray X-ray radiation Petawatt Fortran plasma PIC physics Monte Carlo Atomistic Simulation LAMMPS Electron Transport DFT descriptors interatomic potentials machine learning Molecular Dynamics Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning RNA bioinformatics CFD Fluid Dynamics OpenFOAM C++ DNS Mathematics Droplets Basilisk Particle-In-Cell psychology Stan Finance SAS Replication crisis Economics Malaria consumer behavior number estimation mental arithmetic psychophysics Archaeology Precipitation Epidemiology Parkrun Health Health Economics HTA plumber science of science Zipf networks city size distribution urbanism literature review Preference Visual Questionnaire Mann-Whitney Correlation Conceptual replication Cognitive psychology Multinomial processing tree (MPT) modeling #urbanism #R k-means cluster analysis city-regions Urban Knowledge Systems Topic modelling Planning Support Systems Software Citation Quarto snakemake Numerical modelling Ocean climate physical oceanography apptainer oceanography R package structural equation modeling bayes factor Forest Simulations Models of forest dynamics multi-lab study mice mechanics growth Tissue Cells Clustering Expectation-Maximization bootstrapping R software Position Weight Matrices singularity neuroimaging effect size biology replicability cancer reproducibility csv osf preclinical research genomics All tags Clear tags

Key

  Associated with an event
  Available for general review
  Public reviews welcome