Chapter 1 Debugging and Troubleshooting in ArchR

This chapter details steps to debug or troubleshoot common problems. Please read this fully before posting to GitHub or requesting help in any other forum.

1.0.1 Package dependencies

ArchR has a lot of dependencies. On top of that, ArchR’s dependencies have dependencies. These dependencies are used to deliver the feature-rich end-user experience that we find so important but they also can cause problems. When one package changes something, it can have complicated downstream effects on ArchR. Because of this, we now provide a complete R package environment to use with each stable distribution of ArchR. Each package environment specifies the exact package version for each dependency and is run with a specific version of R to provide a uniform and stable end-user experience. See the chapter on renv for how to set up and use this provided environment. If you do not use the provided environment, we may not provide support for addressing your issues. Help us help you by using this standardized set of dependencies with renv. Not only is this helpful for us, it is also good coding practice to maximize reproducibility and stability of code.

1.0.2 Debugging steps

First and foremost, you should read this entire manual and perform all of the different steps on the tutorial data. If you get errors on the master branch with the tutorial data using the tutorial code, then there is likely something wrong with your computational environment. Such environment specific issues are extremely challenging for us to address and will require substantial effort on your part. The tutorial has been run by hundreds of individuals on a wide array of computational set ups and it is very stable.

Common issues that users run into are related to parallelization and HDF5 file access (for ArrowFiles). To ensure that these are not a problem for you, try running without parallelization (threads = 1 and addArchRLocking(locking = TRUE) where applicable). If you are running in RStudio, note that console output messages from parallel processes will likely not display correctly and this is expected. Similarly, some users have reported issues with parallelization in jupyter notebooks and it seems that these are related to settings on the end-user side.

Make sure to carefully read all of the parameter definitions which can either be found through your R console or at the function definition page of our website. Pay special attention to situations where an ellipsis (...) is included as a function parameter. This indicates that additional arguments can be passed to the given function and will often be passed through to other function calls downstream. For example, in addClusters(), the ellipsis indicates that any additional parameters supplied to the addClusters() function will be passed through to Seurat::FindClusters() or scran::buildSNNGraph() depending on which is used. More information on this “three dots” / ellipsis construction here.

1.0.3 Testing ArchR via `testthat`

As of release_1.0.3 ArchR comes with the ability to quickly test the majority of functionalities via the testthat package. In addition to running through the code provided in this entire manual, it is a good idea to run the full suite of functionality tests as described in the chapter on testing.

1.0.4 Posting to GitHub for help

ArchR is now quite mature and it is rare that true bugs are uncovered. Most remaining issues relate to the handling of outlier cases such as those present in non-standard genomes or new features that have been recently implemented and are inherently less stable. If you are getting an error, first thoroughly search the issues and discussions forums to make sure it has not already been answered. To keep things organized, errors and bugs should be posted to issues while usage questions and feature requests should be posted to discussions. When posting an error report or question, please keep in mind that this is free academic software maintained by scientists who have many other commitments.

For errors/bugs, please follow the provided issue template. This ensures that you include all of the information that we need. Always include the log file that ArchR generates, and where relevant include the output of traceback() to help us understand where the error is coming from. Never copy and paste screenshots of your R console to show us the error! Instead, use Markdown code block syntax (three backticks in a row) and copy and paste the actual text. This makes it so the text is searchable in the future for users who may encounter the same error.

1.0.5 Re-installing or upgrading ArchR

Our convention is to have this package accessible in three different ways:

The master branch which will always hold the most recently released stable version of ArchR.
Stable numerically tagged releases that provide persistent snapshots of previous stable releases.
A “development” version on a branch called dev that holds all updates since the last stable release.

When bugs are found or feature requests are added, they will be committed into the dev branch and eventually merged into master to create a stable release. If you create a pull request, make sure that it uses dev as the base branch.

To re-install or upgrade ArchR, you need to ensure that the old version of ArchR, if currently loaded, is unloaded and reattached, forcing your current R session to begin using the changed ArchR codebase. This is most effectively accomplished using the below code. In this example, the branch we are installing is called “dev” but you can install any branch or even any specific commit SHA by passing it to the ref parameter.

#install your desired branch
devtools::install_github("GreenleafLab/ArchR", ref="dev", repos = BiocManager::repositories(), upgrade = "never")
#unload the previously installed version of ArchR
detach("package:ArchR", unload=TRUE)
#load the newly installed version of ArchR
library(ArchR)