ReproHack Hub

Browse ReproHack papers

Revisiting the zonally asymmetric extratropical circulation of the Southern Hemisphere spring using complex empirical orthogonal functions

Authors: Elio Campitelli, Leandro Díaz, Carolina Vera

DOI: 10.1007/s00382-023-06780-0

Submitted by eliocamp
Mean reproducibility score: 1.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
I used a lot of different tools and strategies to make this paper easily reproducible at different levels. There's Docker container for the highest level of reproducibility, and package versions are managed with renv. The data used in the paper is hosted on Zenodo to avoid long queue times when downloading from the Climate Data Store and future-proof for when it goes away and checksumed before using it.

Tags: R Docker climate
Droplet impact onto a spring-supported plate: analysis and simulations

Authors: Michael J. Negus, Matthew R. Moore, James M. Oliver, Radu Cimpeanu

DOI: https://doi.org/10.1007/s10665-021-10107-5

Submitted by MNegus
Mean reproducibility score: 8.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
The direct numerical simulations (DNS) for this paper were conducted using Basilisk (http://basilisk.fr/). As Basilisk is a free software program written in C, it can be readily installed on any Linux machine, and it should be straightforward to then run the driver code to re-produce the DNS from this paper. Given this, the numerical solutions presented in this paper are a result of many high-fidelity simulations, which each took approximately 24 CPU hours running between 4 to 8 cores. Hence the difficulty in reproducing the results should mainly be in the amount of computational resources it would take, so HPC resources will be required. The DNS in this paper were used to validate the presented analytical solutions, as well as extend the results to a longer timescale. Reproducing these numerical results will build confidence in these results, ensuring that they are independent of the system architecture they were produced on.

Tags: HPC C CFD Fluid Dynamics DNS Mathematics Droplets Basilisk
Accelerating the prediction of large carbon clusters via structure search: Evaluation of machine-learning and classical potentials

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu
Number of reviews: 1
Why should we attempt to reproduce this paper?
This paper presents a fine example of high-throughput computational materials screening studies, mainly focusing on the carbon nanoclusters of different sizes. In the paper, a set of diverse empirical and machine-learned interatomic potentials, which are commonly used to simulate carbonaceous materials, is benchmarked against the higher-level density functional theory (DFT) data, using a range of diverse structural features as the comparison criteria. Trying to reproduce the data presented here (even if you only consider a subset of the interaction potentials) will help you devise an understanding as to how you could approach a high-throughput structure prediction problem. Even though we concentrate here on isolated/finite nanoclusters, AIRSS (and other similar approaches like USPEX, CALYPSO, GMIN, etc.,) can also be used to predict crystal structures of different class of materials with applications in energy storage, catalysis, hydrogen storage, and so on.

Tags: Python HPC LAMMPS DFT interatomic potentials Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning
Automatic learning of hydrogen-bond fixes in an AMBER RNA force field

Authors: Thorben Fröhlking, Vojtěch Mlýnský, Michal Janeček, Petra Kührová, Miroslav Krepl, Pavel Banáš, Jiří Šponer, Giovanni Bussi

Submitted by giovannibussi

Why should we attempt to reproduce this paper?
We do care about reproducibility. In case we receive any feedback, we would be really happy to improve our Github repository and/or submitted manuscript so as to make the reproduction easier!

Tags: Python HPC machine learning Molecular Dynamics
Molecular Dynamics of Solids at Constant Pressure and Stress Using Anisotropic Stochastic Cell Rescaling

Authors: Vittorio Del Tatto, Paolo Raiteri, Mattia Bernetti, Giovanni Bussi

DOI: 10.3390/app12031139

Submitted by giovannibussi

Why should we attempt to reproduce this paper?
We do care about reproducibility. In case we receive any feedback, we would be really happy to improve our Github repository so as to make the reproduction easier!

Tags: HPC Molecular Dynamics
Synergistic coupling in ab initio-machine learning simulations of dislocations

Authors: Petr Grigorev, Alexandra M. Goryaeva, Mihai-Cosmin Marinica, James R. Kermode, Thomas D. Swinburnea

DOI: https://arxiv.org/abs/2111.11262

Submitted by jameskermode

Why should we attempt to reproduce this paper?
Systematically improvable machine learning potentials could have a significant impact on the range of properties that can be modelled, but the toolchain associated with using them presents a barrier to entry for new users. Attempting to reproduce some of our results will help us improve the accessibility of the approach.

Tags: HPC interatomic potentials machine learning
Sensitivity and dimensionality of atomic environment representations used for machine learning interatomic potentials

Authors: Berk Onat, Christoph Ortner and James Kermode

DOI: 10.1063/5.0016005

Submitted by jameskermode

Why should we attempt to reproduce this paper?
Popular descriptors for machine learning potentials such as the Behler-Parinello atom centred symmetry functions (ACSF) or the Smooth Overlap of Interatomic Potentials (SOAP) are widely used but so far not much attention has been paid to optimising how many descriptor components need to be included to give good results.

Tags: HPC descriptors interatomic potentials machine learning
Encapsulated Nanowires: Boosting Electronic Transport in Carbon Nanotubes

Authors: Andrij Vasylenko, Jamie Wynn, Paulo Medeiros, Andrew J Morris, Jeremy Sloan, David Quigley

DOI: 10.1103/PhysRevB.95.121408

Submitted by dquigley
Mean reproducibility score: 5.0/10 | Number of reviews: 2
Why should we attempt to reproduce this paper?
DFT calculations are in principle reproducible between different codes, but differences can arise due to poor choice of convergence tolerances, inappropriate use of pseudopotentials and other numerical considerations. An independent validation of the key quantities needed to compute electrical conductivity would be valuable. In this case we have published our input files for calculating the four quantities needed to parametrise the transport simulations from which we compute the electrical conductivity. These are specifically electronic band structure, phonon dispersions, electron-phonon coupling constants and third derivatives of the force constants. Each in turn in more sensitive to convergence tolerances than the last, and it is the final quantity on which the conclusions of the paper critically depend. Reference output data is provided for comparison at the data URL below. We note that the pristine CNT results (dark red line) in figure 3 are an independent reproduction of earlier work and so we are confident the Boltzmann transport simulations are reproducible. The calculated inputs to these from DFT (in the case of Be encapsulation) have not been independently reproduced to our knowledge.

Tags: HPC Atomistic Simulation Electron Transport DFT
New Insight into the Stability of CaCO3 Surfaces and Nanoparticles via Molecular Simulation

Authors: A. Matthew Bano, P. Mark Rodger, and David Quigley

DOI: 10.1021/la501409j

Submitted by dquigley

Why should we attempt to reproduce this paper?
The negative surface enthalpies in figure 5 are surprising. At least one group has attempted to reproduce these using a different code and obtained positive enthalpies. This was attributed to the inability of that code to independently relax the three simulation cell vectors resulting in an unphysical water density. This demonstrates how sensitive these results can be to the particular implementation of simulation algorithms in different codes. Similarly the force field used is now very popular. Its functional form and full set of parameters can be found in the literature. However differences in how different simulation codes implement truncation, electrostatics etc can lead to significant difference in results such as these. It would be a valuable exercise to establish if exactly the same force field as that used here can be reproduced from only its specification in the literature. The interfacial energies of interest should be reproducible with simulations on modest numbers of processors (a few dozen) with run times for each being 1-2 days. Each surface is an independent calculation and so these can be run concurrently during the ReproHack.

Tags: HPC Atomistic Simulation LAMMPS
Thermodynamics of stacking disorder in ice nuclei

Authors: David Quigley

DOI: 10.1063/1.4896376

Submitted by dquigley
Mean reproducibility score: 3.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
The results of this paper have been used in multiple subsequent studies as a benchmark against which other methods of performing the same calculation have been tested. Other groups have challenged the results as suffering from finite size effects, in particular the calculations on mixtures of cubic and hexagonal ice. Should there be time during in the event, participants could check this by performing calculations on larger unit cells. Each individual calculation should converge adequately within 96 hours making it amenable to a HPC ReproHack. Given modern HPC hardware many such calculations could be run concurrently on a single HPC node.

Tags: HPC Fortran Monte Carlo Atomistic Simulation
The viewing angle in AGN SED models, a data-driven analysis

Authors: Andrés Felipe Ramos Padilla, Lingyu Wang, Katarzyna Małek, Andreas Efstathiou, Guang Yang

Submitted by aframosp
Mean reproducibility score: 9.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
Most of the material is available through Jupyter notebooks in GitHub, and it should be easy to reproduce with the help of Binder. With the notebooks, you could experiment with different parameters to the ones analyzed in the paper. It also contains a large dataset of physical parameters of galaxies analysed in this work. We expect this work to be easily reproducible in the steps described in the repository.

Tags: Python Galaxies Astronomy HPC Databases Binder
Finding Efficient Trade-offs in Multi-Fidelity Response Surface Modeling

Authors: Sander van Rijn, Sebastian Schmitt, Matthijs van Leeuwen, Thomas Bäck

Submitted by sjvrijn
Mean reproducibility score: 9.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
Because: - Two fellow PhDs working on different topics have been able to reproduce some figures by following the README instructions and I hope this extends to other people - I've tried to incorporate as many of the best practices as possible to make my code and data open and accessible - I've tried to make sure that my data is exactly reproducible with the specified random seed strategy - the paper suggests a method that should be useful to other researchers in my field, which is not useful unless my results are reproducible

Tags: Python HPC Computer Science
The role of conidia in the dispersal of Ascochyta rabiei

Authors: Khaliq, I., Fanning, J., Melloy, P. et al.

DOI: 10.1007/s10658-020-02126-2

Submitted by hub-admin

Why should we attempt to reproduce this paper?
I suggested a few papers last year. I’m hoping that we’ve improved our reproducibility with this one, this year. We’ve done our best to package it up both in Docker and as an R package. I’d be curious to know what the best way to reproduce it is found to be. Working through vignettes or spinning up a Docker instance. Which is the preferred method?

Tags: R Docker
Algorithm configuration data mining for CMA evolution strategies

Authors: Sander van Rijn, Hao Wang, Bas van Stein, Thomas Bäck

DOI: 10.1145/3071178.3071205

Submitted by sjvrijn
Mean reproducibility score: 10.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
The original data took quite a while to produce for a previous paper, but for this paper, all tables and figures should be exactly reproducible by simply running the jupyter notebook.

Tags: Python HPC Computer Science
Growth Dynamics of Independent Gametophytes of Pleurosoriopsis makinoi ( Polypodiaceae)

Authors: Atsushi Ebihara, Joel H. Nitta, Yurika Matsumoto, Yuri Fukazawa, Marie Kurihara, Hitomi Yokote, Kaoru Sakuma, Otowa Azakami, Yumiko Hirayama, Ryoko Imaichi

Submitted by joelnitta
Mean reproducibility score: 10.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
It uses the drake R package that should make reproducibility of R projects much easier (just run make.R and you're done). However, it does depend on very specific package versions, which are provided by the accompanying docker image.

Tags: R Docker Drake
Population structure and phenotypic variation of Sclerotinia sclerotiorum from dry bean (Phaseolus vulgaris) in the United States

Authors: Kamvar ZN, Amaradasa BS, Jhala R, McCoy S, Steadman JR, Everhart SE

DOI: 10.7717/peerj.4152

Submitted by hub-admin
Mean reproducibility score: 6.0/10 | Number of reviews: 1
Why should we attempt to reproduce this paper?
This paper is reproduced weekly in a docker container on continuous integration, but it is also set up to work via local installs as well. It would be interesting to see if it's reproducible with a human operator who knows nothing of the project or toolchain.

Tags: R make Docker

Search for papers

Filter by tags

Python R GDAL GEOS GIS Shiny PROJ Galaxies Astronomy HPC Databases Binder Social Science Stata make Computer Science Jupyter Notebook tidyverse emacs literate earth sciences clumped isotopes org-mode geology eyetracking LaTeX Git ArcGIS Docker Drake SVN knitr C Matlab Mathematica Meta-analysis swig miniconda tensorflow keras Pandas SQL neuroscience robotics deep learning planner reiforcement learning Plasma physics Hybrid-PIC EPOCH Laser Gamma-ray X-ray radiation Petawatt Fortran plasma PIC physics Monte Carlo Atomistic Simulation LAMMPS Electron Transport DFT descriptors interatomic potentials machine learning Molecular Dynamics Python scripting AIRSS structure prediction density functional theory high-throughput machine-learning RNA bioinformatics CFD Fluid Dynamics OpenFOAM C++ DNS Mathematics Droplets Basilisk Particle-In-Cell psychology Stan Finance SAS Replication crisis Economics Malaria consumer behavior number estimation mental arithmetic psychophysics Archaeology Precipitation Epidemiology Parkrun Health Health Economics HTA plumber science of science Zipf networks city size distribution urbanism literature review Preference Visual Questionnaire Mann-Whitney Correlation Conceptual replication Cognitive psychology Multinomial processing tree (MPT) modeling #urbanism #R k-means cluster analysis city-regions Urban Knowledge Systems Topic modelling Planning Support Systems Software Citation Quarto snakemake Numerical modelling Ocean climate physical oceanography apptainer oceanography All tags Clear tags

Key

Associated with an event
Available for general review
Public reviews welcome

Papers

Browse ReproHack papers

Authors: Elio Campitelli, Leandro Díaz, Carolina Vera

DOI: 10.1007/s00382-023-06780-0

Submitted by eliocamp

Authors: Michael J. Negus, Matthew R. Moore, James M. Oliver, Radu Cimpeanu

DOI: https://doi.org/10.1007/s10665-021-10107-5

Submitted by MNegus

Authors: Bora Karasulu, Jean-Marc Leyssale, Patrick Rowe, Cedric Weber, Carla de Tomas

DOI: 10.1016/j.carbon.2022.01.031

Submitted by bkarasulu

Authors: Thorben Fröhlking, Vojtěch Mlýnský, Michal Janeček, Petra Kührová, Miroslav Krepl, Pavel Banáš, Jiří Šponer, Giovanni Bussi

Submitted by giovannibussi

Authors: Vittorio Del Tatto, Paolo Raiteri, Mattia Bernetti, Giovanni Bussi

DOI: 10.3390/app12031139

Submitted by giovannibussi

Authors: Petr Grigorev, Alexandra M. Goryaeva, Mihai-Cosmin Marinica, James R. Kermode, Thomas D. Swinburnea

DOI: https://arxiv.org/abs/2111.11262

Submitted by jameskermode

Authors: Berk Onat, Christoph Ortner and James Kermode

DOI: 10.1063/5.0016005

Submitted by jameskermode

Authors: Andrij Vasylenko, Jamie Wynn, Paulo Medeiros, Andrew J Morris, Jeremy Sloan, David Quigley

DOI: 10.1103/PhysRevB.95.121408

Submitted by dquigley

Authors: A. Matthew Bano, P. Mark Rodger, and David Quigley

DOI: 10.1021/la501409j

Submitted by dquigley

Authors: David Quigley

DOI: 10.1063/1.4896376

Submitted by dquigley

Authors: Andrés Felipe Ramos Padilla, Lingyu Wang, Katarzyna Małek, Andreas Efstathiou, Guang Yang

Submitted by aframosp

Authors: Sander van Rijn, Sebastian Schmitt, Matthijs van Leeuwen, Thomas Bäck

Submitted by sjvrijn

Authors: Khaliq, I., Fanning, J., Melloy, P. et al.

DOI: 10.1007/s10658-020-02126-2

Submitted by hub-admin

Authors: Sander van Rijn, Hao Wang, Bas van Stein, Thomas Bäck

DOI: 10.1145/3071178.3071205

Submitted by sjvrijn

Authors: Atsushi Ebihara, Joel H. Nitta, Yurika Matsumoto, Yuri Fukazawa, Marie Kurihara, Hitomi Yokote, Kaoru Sakuma, Otowa Azakami, Yumiko Hirayama, Ryoko Imaichi

Submitted by joelnitta

Authors: Kamvar ZN, Amaradasa BS, Jhala R, McCoy S, Steadman JR, Everhart SE

DOI: 10.7717/peerj.4152

Submitted by hub-admin

Search for papers

Filter by tags

Key