RunDynamicEnrichment

This function calculates gene-set scores from the specified database (db) for each lineage using the specified scoring method (score_method). It then treats these scores as expression values and uses them as input to the RunDynamicFeatures function to identify dynamically enriched terms along the lineage.

Usage

RunDynamicEnrichment(
  srt,
  lineages,
  score_method = "AUCell",
  layer = "data",
  assay = NULL,
  min_expcells = 20,
  r.sq = 0.2,
  dev.expl = 0.2,
  padjust = 0.05,
  IDtype = "symbol",
  species = "Homo_sapiens",
  db = "GO_BP",
  db_update = FALSE,
  db_version = "latest",
  convert_species = TRUE,
  Ensembl_version = 103,
  mirror = NULL,
  TERM2GENE = NULL,
  TERM2NAME = NULL,
  minGSSize = 10,
  maxGSSize = 500,
  BPPARAM = BiocParallel::bpparam(),
  seed = 11
)

Arguments

srt: A Seurat object containing the results of differential expression analysis (RunDEtest). If specified, the genes and groups will be extracted from the Seurat object automatically. If not specified, the geneID and geneID_groups arguments must be provided.
lineages: A character vector specifying the lineages to plot.
score_method: The method to use for scoring. Can be "Seurat", "AUCell", or "UCell". Defaults to "Seurat".
layer: A character vector specifying the layer in the Seurat object to use. Default is "counts".
assay: A character vector specifying the assay in the Seurat object to use. Default is NULL.
min_expcells: A numeric value specifying the minimum number of expected cells. Default is 20.
r.sq: A numeric value specifying the R-squared threshold. Default is 0.2.
dev.expl: A numeric value specifying the deviance explained threshold. Default is 0.2.
padjust: A numeric value specifying the p-value adjustment threshold. Default is 0.05.
IDtype: A character vector specifying the type of gene IDs in the srt object or geneID argument. This argument is used to convert the gene IDs to a different type if IDtype is different from result_IDtype.
species: A character vector specifying the species for which the analysis is performed.
db: A character vector specifying the name of the database to be used for enrichment analysis.
db_update: A logical value indicating whether the gene annotation databases should be forcefully updated. If set to FALSE, the function will attempt to load the cached databases instead. Default is FALSE.
db_version: A character vector specifying the version of the database to be used. This argument is ignored if db_update is TRUE. Default is "latest".
convert_species: A logical value indicating whether to use a species-converted database when the annotation is missing for the specified species. The default value is TRUE.
Ensembl_version: Ensembl database version. If NULL, use the current release version.
mirror: Specify an Ensembl mirror to connect to. The valid options here are 'www', 'uswest', 'useast', 'asia'.
TERM2GENE: A data frame specifying the gene-term mapping for a custom database. The first column should contain the term IDs, and the second column should contain the gene IDs.
TERM2NAME: A data frame specifying the term-name mapping for a custom database. The first column should contain the term IDs, and the second column should contain the corresponding term names.
minGSSize: A numeric value specifying the minimum size of a gene set to be considered in the enrichment analysis.
maxGSSize: A numeric value specifying the maximum size of a gene set to be considered in the enrichment analysis.
BPPARAM: A BiocParallelParam object specifying the parallel back-end to be used for parallel computation. Defaults to BiocParallel::bpparam().
seed: The random seed for reproducibility. Defaults to 11.

Examples

data(pancreas_sub)
pancreas_sub <- RunSlingshot(
  pancreas_sub,
  group.by = "SubCellType",
  reduction = "UMAP"
)
#> Warning: No shared levels found between `names(values)` of the manual scale and the
#> data's fill values.
#> Warning: No shared levels found between `names(values)` of the manual scale and the
#> data's fill values.
#> Warning: Removed 2 rows containing missing values or values outside the scale range
#> (`geom_path()`).
#> Warning: Removed 2 rows containing missing values or values outside the scale range
#> (`geom_path()`).
#> Warning: Removed 7 rows containing missing values or values outside the scale range
#> (`geom_path()`).
#> Warning: Removed 7 rows containing missing values or values outside the scale range
#> (`geom_path()`).

pancreas_sub <- RunDynamicFeatures(
  pancreas_sub,
  lineages = "Lineage1",
  n_candidates = 200
)
#> ℹ [2025-07-26 07:21:46] Start RunDynamicFeatures
#> ℹ [2025-07-26 07:21:46] Workers: 2
#> Finding variable features for layer counts
#> ℹ [2025-07-26 07:21:47] Number of candidate features(union): 199
#> ℹ [2025-07-26 07:21:48] Calculate dynamic features for Lineage1...
#> 
  |                                                                            
  |                                                                      |   0%
  |                                                                            
  |===================================                                   |  50%
  |                                                                            
  |======================================================================| 100%
#> 
#> ℹ [2025-07-26 07:21:54] RunDynamicFeatures done
#> ℹ [2025-07-26 07:21:54] Elapsed time:8.34 secs
ht1 <- DynamicHeatmap(
  srt = pancreas_sub,
  lineages = "Lineage1",
  cell_annotation = "SubCellType",
  n_split = 4
)
#> ℹ [2025-07-26 07:21:54] 143 features from Lineage1 passed the threshold (exp_ncells>20 & r.sq>0.2 & dev.expl>0.2 & padjust<0.05): 
#> ℹ [2025-07-26 07:21:54] Gcg,Ghrl,Iapp,Pyy,Rbp4,Chgb,Lrpprc,Slc38a5,Cdc20,Chga...
#> 'magick' package is suggested to install to give better rasterization.
#> 
#> Set `ht_opt$message = FALSE` to turn off this message.
#> ℹ [2025-07-26 07:21:56] 
#> ℹ [2025-07-26 07:21:56] The size of the heatmap is fixed because certain elements are not scalable.
#> ℹ [2025-07-26 07:21:56] The width and height of the heatmap are determined by the size of the current viewport.
#> ℹ [2025-07-26 07:21:56] If you want to have more control over the size, you can manually set the parameters 'width' and 'height'.

ht1$plot


pancreas_sub <- RunDynamicEnrichment(
  srt = pancreas_sub,
  lineages = "Lineage1",
  score_method = "UCell",
  db = "GO_BP",
  species = "Mus_musculus"
)
#> ℹ [2025-07-26 07:21:57] Start RunDynamicFeatures
#> ℹ [2025-07-26 07:21:57] Workers: 2
#> ℹ [2025-07-26 07:21:57] Species: Mus_musculus
#> ℹ [2025-07-26 07:21:57] Loading cached db: GO_BP version:3.21.0 nterm:15445 created:2025-07-26 06:45:24.771567
#> ℹ [2025-07-26 07:21:59] Convert ID types for the database: GO_BP
#> ℹ [2025-07-26 07:21:59] Connect to the Ensembl archives...
#> ℹ [2025-07-26 07:21:59] Using the 103 version of biomart...
#> ℹ [2025-07-26 07:21:59] Connecting to the biomart...
#> ℹ [2025-07-26 07:22:59] Error in `req_perform()`:
#> ℹ [2025-07-26 07:22:59] ! Failed to perform HTTP request.
#> ℹ [2025-07-26 07:22:59] Caused by error in `curl::curl_fetch_memory()`:
#> ℹ [2025-07-26 07:22:59] ! Timeout was reached [feb2021.archive.ensembl.org]:
#> ℹ [2025-07-26 07:22:59] Operation timed out after 60000 milliseconds with 0 bytes received
#> ℹ [2025-07-26 07:22:59] 
#> ℹ [2025-07-26 07:22:59] Get errors when connecting with ensembl mart...
#> ℹ [2025-07-26 07:23:00] Retrying...
#> ℹ [2025-07-26 07:24:00] Error in `req_perform()`:
#> ℹ [2025-07-26 07:24:00] ! Failed to perform HTTP request.
#> ℹ [2025-07-26 07:24:00] Caused by error in `curl::curl_fetch_memory()`:
#> ℹ [2025-07-26 07:24:00] ! Timeout was reached [feb2021.archive.ensembl.org]:
#> ℹ [2025-07-26 07:24:00] Operation timed out after 60000 milliseconds with 0 bytes received
#> ℹ [2025-07-26 07:24:00] 
#> ℹ [2025-07-26 07:24:00] Get errors when connecting with ensembl mart...
#> ℹ [2025-07-26 07:24:01] Retrying...
#> ℹ [2025-07-26 07:24:02] Searching the dataset mmusculus ...
#> ℹ [2025-07-26 07:24:02] Connecting to the dataset mmusculus_gene_ensembl ...
#> ℹ [2025-07-26 07:24:03] Converting the geneIDs...
#> ℹ [2025-07-26 07:28:54] Error in `httr2::req_perform()`:
#> ℹ [2025-07-26 07:28:54] ! HTTP 500 Internal Server Error.
#> ℹ [2025-07-26 07:28:54] 
#> ℹ [2025-07-26 07:28:54] Get errors when retrieving information from the BioMart database
#> ℹ [2025-07-26 07:28:55] Retrying...
#> ℹ [2025-07-26 07:31:16] Error in `httr2::req_perform()`:
#> ℹ [2025-07-26 07:31:16] ! HTTP 500 Internal Server Error.
#> ℹ [2025-07-26 07:31:16] 
#> ℹ [2025-07-26 07:31:16] Get errors when retrieving information from the BioMart database
#> ℹ [2025-07-26 07:31:17] Retrying...
#> ℹ [2025-07-26 07:33:29] Error in `httr2::req_perform()`:
#> ℹ [2025-07-26 07:33:29] ! HTTP 500 Internal Server Error.
#> ℹ [2025-07-26 07:33:29] 
#> ℹ [2025-07-26 07:33:29] Get errors when retrieving information from the BioMart database
#> ℹ [2025-07-26 07:33:30] Retrying...
#> ℹ [2025-07-26 07:43:39] Error in `httr2::req_perform()`:
#> ℹ [2025-07-26 07:43:39] ! HTTP 500 Internal Server Error.
#> ℹ [2025-07-26 07:43:39] 
#> ℹ [2025-07-26 07:43:39] Get errors when retrieving information from the BioMart database
#> ℹ [2025-07-26 07:43:40] Retrying...
#> ℹ [2025-07-26 07:45:52] Error in `httr2::req_perform()`:
#> ℹ [2025-07-26 07:45:52] ! HTTP 500 Internal Server Error.
#> ℹ [2025-07-26 07:45:52] 
#> ℹ [2025-07-26 07:45:52] Get errors when retrieving information from the BioMart database
#> Error in log_message(out, message_type = "error"): Error in `httr2::req_perform()`: ! HTTP 500 Internal Server Error.
ht2 <- DynamicHeatmap(
  srt = pancreas_sub,
  assay = "GO_BP",
  lineages = "Lineage1_GO_BP",
  cell_annotation = "SubCellType",
  n_split = 4,
  split_method = "kmeans-peaktime"
)
#> Error in log_message("lineages: ", l, " is not in the meta data of the Seurat object",     message_type = "error"): Lineages: Lineage1_GO_BP is not in the meta data of the Seurat object
ht2$plot
#> Error: object 'ht2' not found

Usage

Arguments

See also

Examples