addMotifAnnotations.Rd
This function adds information about which peaks contain motifs to a given ArchRProject. For each peak, a binary value is stored indicating whether each motif is observed within the peak region.
addMotifAnnotations(
ArchRProj = NULL,
motifSet = "cisbp",
annoName = "Motif",
species = NULL,
collection = "CORE",
motifPWMs = NULL,
cutOff = 5e-05,
width = 7,
version = 2,
force = FALSE,
logFile = createLogFile("addMotifAnnotations"),
...
)
An ArchRProject
object.
The name of a curated motif set to be used for annotation. Options include: (i) "JASPAR2016", "JASPAR2018", "JASPAR2020"
which gives the 2016, 2018, or 2020 version of JASPAR motifs, (ii) one of "cisbp", "encode", or "homer" which gives the
corresponding motif sets from the chromVAR
package, or (iii) "vierstra" which gives the clustered archetype motifs
created by Jeff Vierstra (https://github.com/jvierstra/motif-clustering).
The name of the peakAnnotation
object to be stored in the provided ArchRProject
The latin name of the species relevant to the supplied ArchRProject
. This is used for identifying which motif to be
used from CisBP/JASPAR. For JASPAR, species
is passed to TFBS::getMatrixSet
and some species names are not recognized. In these cases
it is possible to use the NCBI taxonomy ID. By default, this function will attempt to guess the species based on the value from getGenome()
.
If one of the JASPAR motif sets is used via motifSet
, this parameter allows you to indicate the JASPAR
collection to be used. See getMatrixSet()
from TFBSTools
for all options to supply for collection. If motifSet
is
"vierstra", then this must either be "archetype" (for the v2.1 clustered models) or "individual" (for the original v1 individual motif models).
NOTE: vierstra archetype motifs are currently in beta and have not been finalized by Jeff Vierstra.
A custom set of motif PWMs as a PWMatrixList to be used instead of motifSet
for adding motif annotations.
If motifPWMs
is used, motifSet
will be ignored.
The p-value cutoff to be used for motif search. The p-value is determined vs a background set of sequences
(see MOODS
for more details on this determination).
The width in basepairs to consider for motif matches. See the motimatchr
package for more information.
An integer specifying version 1 or version 2 of chromVARmotifs see github for more info GreenleafLab/chromVARmotifs.
A boolean value indicating whether to force the peakAnnotation
object indicated by annoName
to be overwritten if
it already exists in the given ArchRProject
.
The path to a file to be used for logging ArchR output.
Additional parameters to be passed to TFBSTools::getMatrixSet
for getting a JASPAR PWM object.
# Get Test ArchR Project
proj <- getTestProject()
# Add Motif Annotations
proj <- addMotifAnnotations(proj, motifSet = "cisbptest", annoName = "test")