1 Overview

We previously developed an R/BioConductor package called Pathview, which maps, integrates and visualizes a wide range of data onto KEGG pathway graphs. Since its publication, Pathview has been widely used in omics studies and data analyses, and has become the leading tool in its category. Here we introduce the SBGNview package, which adopts Systems Biology Graphical Notation (SBGN) and greatly extends the Pathview project by supporting multiple major pathway databases beyond KEGG.

Key features:

  • Pathway definition by the widely adopted Systems Biology Graphical Notation (SBGN);

  • Supports multiple major pathway databases beyond KEGG (Reactome, MetaCyc, SMPDB, PANTHER, METACROP etc) and user defined pathways;

  • Covers 5,200 reference pathways and over 3,000 species by default;

  • Extensive graphics controls, including glyph and edge attributes, graph layout and sub-pathway highlight;

  • SBGN pathway data manipulation, processing, extraction and analysis.

2 Citation

Please cite the following papers when using this open-source package. This will help the project and our team:

Luo W, Brouwer C. Pathview: an R/Biocondutor package for pathway-based data integration and visualization. Bioinformatics, 2013, 29(14):1830-1831, doi: 10.1093/bioinformatics/btt285

3 Installation

3.1 Prerequisites

SBGNview depends or imports from the following R packages:

  • xml2: parse SBGN-ML files
  • rsvg: convert svg files to other formats (pdf, png, ps). librsvg2 is needed to install rsvg. See this page for more details: https://github.com/jeroen/rsvg
  • igraph: find shortest paths
  • httr: search SBGNhub for mapping files
  • KEGGREST: generate mapping tables from scratch when needed
  • pathview: map between different ID types for gene and chemical compound
  • gage: R package for pathway enrichment analysis.
  • SBGNview.data: demo and supportive datasets for SBGNview package
  • SummarizedExperiment: alternative input user data as SummarizedExperiment objects
  • AnnotationDbi: BioConductor annotation data and infrastructure

Note these dependencies will be automatically installed when SBGNview is installed from BioConductor or GitHub. To install them manually within R:

if (!requireNamespace("BiocManager", quietly = TRUE)){
     install.packages("BiocManager")
}
BiocManager::install(c("xml2", "rsvg", "igraph", "httr", "KEGGREST", "pathview", "gage", "SBGNview.data", "SummarizedExperiment", "AnnotationDbi"))

External dependencies (outside R): Windows 10: none

Linux (Ubuntu): needs additional packages (libxml2-dev, libssl-dev, libcurl4-openssl-dev, librsvg2-dev) to be installed. Run the command below in a terminal to install the necessary packages. The same or similar packages can be found for other distributes of linux.

sudo apt install libxml2-dev libssl-dev libcurl4-openssl-dev librsvg2-dev

3.2 Install SBGNview

Install SBGNview through Bioconductor:

BiocManager::install(c("SBGNview"))

Install SBGNview through GitHub:

install.packages("devtools")
devtools::install_github("datapplab/SBGNview")

Clone the Git repository:

git clone https://github.com/datapplab/SBGNview.git

4 Quick example

library(SBGNview)
# load demo dataset, SBGN pathway data collection and info, which may take a few seconds
data("gse16873.d","pathways.info", "sbgn.xmls")
input.pathways <- findPathways("Adrenaline and noradrenaline biosynthesis")
SBGNview.obj <- SBGNview(
          gene.data = gse16873.d[,1:3], 
          gene.id.type = "entrez",
          input.sbgn = input.pathways$pathway.id,
          output.file = "quick.start", 
          output.formats =  c("png")
          ) 
print(SBGNview.obj)

Two image files (a svg file by default and a png file) will be created in the current working directory.

\label{fig:quickStartFig}Quick start example: Adrenaline and noradrenaline biosynthesis pathway.

Figure 4.1: Quick start example: Adrenaline and noradrenaline biosynthesis pathway.

As a unique and useful feature of SBGNview package, we can highlight nodes, edges and/or paths using the highlight functions. Please read the function documentation and main vignette for details.{#quickhighlight}

outputFile(SBGNview.obj) <- "quick.start.highlights"
SBGNview.obj + highlightArcs(class = "production",color = "red") + 
               highlightArcs(class = "consumption",color = "blue") +
               highlightNodes(node.set = c("tyrosine", "(+-)-epinephrine"), 
                              stroke.width = 4, stroke.color = "green") + 
               highlightPath(from.node = "tyrosine", to.node = "dopamine",
                             from.node.color = "green",
                             to.node.color = "blue",
                             shortest.paths.cols = "purple",
                             input.node.stroke.width = 6,
                             path.node.stroke.width = 5,
                             path.node.color = "purple",
                             path.stroke.width = 5,
                             tip.size = 10 )
\label{fig:quickStartFigHighlight}Quick start example: Highlight arcs, nodes, and path.

Figure 4.2: Quick start example: Highlight arcs, nodes, and path.

5 Additional information

This tutorial is just a brief introduction and quick start. For more info, please check the package documentation and main vignettes.

For more info on SBGN, please check the official SBGN project website

For any questions, please contact Kovidh Vegesna (kvegesna [AT] uncc.edu) or Weijun Luo (luo_weijun [AT] yahoo.com)

6 Session Info

sessionInfo()
## R Under development (unstable) (2024-10-21 r87258)
## Platform: x86_64-pc-linux-gnu
## Running under: Ubuntu 24.04.1 LTS
## 
## Matrix products: default
## BLAS:   /home/biocbuild/bbs-3.21-bioc/R/lib/libRblas.so 
## LAPACK: /usr/lib/x86_64-linux-gnu/lapack/liblapack.so.3.12.0
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_GB              LC_COLLATE=C              
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## time zone: America/New_York
## tzcode source: system (glibc)
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] SBGNview_1.21.0      SBGNview.data_1.19.0 pathview_1.47.0     
## [4] knitr_1.48          
## 
## loaded via a namespace (and not attached):
##  [1] sass_0.4.9                  SparseArray_1.7.0          
##  [3] xml2_1.3.6                  bitops_1.0-9               
##  [5] lattice_0.22-6              RSQLite_2.3.7              
##  [7] digest_0.6.37               magrittr_2.0.3             
##  [9] evaluate_1.0.1              grid_4.5.0                 
## [11] KEGGgraph_1.67.0            bookdown_0.41              
## [13] fastmap_1.2.0               blob_1.2.4                 
## [15] Matrix_1.7-1                jsonlite_1.8.9             
## [17] AnnotationDbi_1.69.0        graph_1.85.0               
## [19] GenomeInfoDb_1.43.0         DBI_1.2.3                  
## [21] httr_1.4.7                  UCSC.utils_1.3.0           
## [23] XML_3.99-0.17               Rgraphviz_2.51.0           
## [25] Biostrings_2.75.0           jquerylib_0.1.4            
## [27] abind_1.4-8                 Rdpack_2.6.1               
## [29] cli_3.6.3                   rlang_1.1.4                
## [31] crayon_1.5.3                rbibutils_2.3              
## [33] XVector_0.47.0              Biobase_2.67.0             
## [35] bit64_4.5.2                 DelayedArray_0.33.0        
## [37] cachem_1.1.0                yaml_2.3.10                
## [39] S4Arrays_1.7.0              tools_4.5.0                
## [41] memoise_2.0.1               GenomeInfoDbData_1.2.13    
## [43] SummarizedExperiment_1.37.0 BiocGenerics_0.53.0        
## [45] vctrs_0.6.5                 R6_2.5.1                   
## [47] org.Hs.eg.db_3.20.0         png_0.1-8                  
## [49] matrixStats_1.4.1           stats4_4.5.0               
## [51] lifecycle_1.0.4             zlibbioc_1.53.0            
## [53] KEGGREST_1.47.0             rsvg_2.6.1                 
## [55] S4Vectors_0.45.0            IRanges_2.41.0             
## [57] bit_4.5.0                   pkgconfig_2.0.3            
## [59] bslib_0.8.0                 highr_0.11                 
## [61] GenomicRanges_1.59.0        xfun_0.48                  
## [63] MatrixGenerics_1.19.0       htmltools_0.5.8.1          
## [65] igraph_2.1.1                rmarkdown_2.28             
## [67] compiler_4.5.0              RCurl_1.98-1.16