Skip to contents

A compilation of DNA motifs from various sources.

Usage

motif_db

Format

A data frame with 270340 rows and 9 columns:

motif

Character. The name or identifier of the motif.

pos

Integer. The position within the motif.

A

Numeric. The score for adenine at this position.

C

Numeric. The score for cytosine at this position.

G

Numeric. The score for guanine at this position.

T

Numeric. The score for thymine at this position.

dataset

Character. The source dataset of the motif.

motif_orig

Character. The name of the motif without the database prefix.

cluster

Character. For SCENIC motifs, the cluster to which the motif belongs.

References

HOMER:

Heinz S, Benner C, Spann N, Bertolino E et al. Simple Combinations of Lineage-Determining Transcription Factors Prime cis-Regulatory Elements Required for Macrophage and B Cell Identities. Mol Cell 2010 May 28;38(4):576-589. PMID: 20513432

JASPAR:

Castro-Mondragon JA, Riudavets-Puig R, Rauluseviciute I, Berhanu Lemma R, Turchi L, Blanc-Mathieu R, Lucas J, Boddie P, Khan A, Manosalva Pérez N, Fornes O, Leung TY, Aguirre A, Hammal F, Schmelter D, Baranasic D, Ballester B, Sandelin A, Lenhard B, Vandepoele K, Wasserman WW, Parcy F, and Mathelier A JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles Nucleic Acids Res. 2022 Jan 7;50(D1):D165-D173.; doi: 10.1093/nar/gkab1113

JOLMA:

Jolma, A., Yin, Y., Nitta, K. et al. DNA-dependent formation of transcription factor pairs alters their binding specificity. Nature 534, S15–S16 (2016). https://doi.org/10.1038/nature18912

HOCOMOCO:

Ivan V. Kulakovskiy; Ilya E. Vorontsov; Ivan S. Yevshin; Ruslan N. Sharipov; Alla D. Fedorova; Eugene I. Rumynskiy; Yulia A. Medvedeva; Arturo Magana-Mora; Vladimir B. Bajic; Dmitry A. Papatsenko; Fedor A. Kolpakov; Vsevolod J. Makeev: HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis. Nucl. Acids Res., Database issue, gkx1106 (11 November 2017). https://doi.org/10.1093/nar/gkx1106

SCENIC:

Aibar, S., González-Blas, C., Moerman, T. et al. SCENIC: single-cell regulatory network inference and clustering. Nat Methods 14, 1083–1086 (2017). https://doi.org/10.1038/nmeth.4463

SCENIC+:

Bravo González-Blas, C., De Winter, S., Hulselmans, G. et al. SCENIC+: single-cell multiomic inference of enhancers and gene regulatory networks. Nat Methods 20, 1355–1367 (2023). https://doi.org/10.1038/s41592-023-01938-4

Examples

data(motif_db)
head(motif_db)
#>                                       motif pos          A         C         G
#> 1 SCENIC.kznf__ZNF264_Imbeault2017_OM_RCADE   1  0.1633990  0.163399 96.895421
#> 2 SCENIC.kznf__ZNF264_Imbeault2017_OM_RCADE   2  9.9673210 13.888901 72.712407
#> 3 SCENIC.kznf__ZNF264_Imbeault2017_OM_RCADE   3  0.1633990 98.856209  0.163399
#> 4 SCENIC.kznf__ZNF264_Imbeault2017_OM_RCADE   4 65.5228541 14.542490 15.849689
#> 5 SCENIC.kznf__ZNF264_Imbeault2017_OM_RCADE   5  0.1633990 97.549021  2.124180
#> 6 SCENIC.kznf__ZNF264_Imbeault2017_OM_RCADE   6  0.1633989  9.313725  9.967315
#>            T dataset                         motif_orig cluster
#> 1  2.7777806  SCENIC kznf__ZNF264_Imbeault2017_OM_RCADE     0.1
#> 2  3.4313703  SCENIC kznf__ZNF264_Imbeault2017_OM_RCADE     0.1
#> 3  0.8169931  SCENIC kznf__ZNF264_Imbeault2017_OM_RCADE     0.1
#> 4  4.0849671  SCENIC kznf__ZNF264_Imbeault2017_OM_RCADE     0.1
#> 5  0.1633990  SCENIC kznf__ZNF264_Imbeault2017_OM_RCADE     0.1
#> 6 80.5555605  SCENIC kznf__ZNF264_Imbeault2017_OM_RCADE     0.1