Run scVelo workflow

scVelo is a scalable toolkit for RNA velocity analysis in single cells. This function runs an enhanced scVelo workflow on a Seurat object with improved error handling, version compatibility, and modular design.

Usage

RunSCVELO(
  srt = NULL,
  assay_x = "RNA",
  layer_x = "counts",
  assay_y = c("spliced", "unspliced"),
  layer_y = "counts",
  adata = NULL,
  group_by = NULL,
  linear_reduction = NULL,
  nonlinear_reduction = NULL,
  basis = NULL,
  mode = "stochastic",
  fitting_by = "stochastic",
  magic_impute = FALSE,
  knn = 5,
  t = 2,
  min_shared_counts = 30,
  n_pcs = 30,
  n_neighbors = 30,
  filter_genes = TRUE,
  min_counts = 3,
  min_counts_u = 3,
  normalize_per_cell = TRUE,
  log_transform = TRUE,
  use_raw = FALSE,
  diff_kinetics = FALSE,
  stream_smooth = NULL,
  stream_density = 2,
  arrow_length = 5,
  arrow_size = 5,
  arrow_density = 0.5,
  denoise = FALSE,
  denoise_topn = 3,
  kinetics = FALSE,
  kinetics_topn = 100,
  calculate_velocity_genes = FALSE,
  compute_velocity_confidence = TRUE,
  compute_terminal_states = TRUE,
  compute_pseudotime = TRUE,
  compute_paga = TRUE,
  top_n = 6,
  n_jobs = 1,
  palette = "Paired",
  palcolor = NULL,
  show_plot = TRUE,
  save = FALSE,
  dpi = 300,
  dirpath = "./",
  fileprefix = "",
  return_seurat = !is.null(srt)
)

Arguments

srt: A Seurat object.
assay_x: Assay to convert as the main data matrix (X) in the anndata object.
layer_x: Layer name for assay_x in the Seurat object.
assay_y: Assays to convert as layers in the anndata object.
layer_y: Layer names for the assay_y in the Seurat object.
adata: An anndata object.
group_by: Variable to use for grouping cells in the Seurat object.
linear_reduction: Linear reduction method to use, e.g., "PCA".
nonlinear_reduction: Non-linear reduction method to use, e.g., "UMAP".
basis: The basis to use for reduction, e.g., "UMAP".
mode: Velocity estimation models to use. Can be a vector containing "deterministic", "stochastic", and/or "dynamical".
fitting_by: Method used to fit gene velocities for dynamical modeling, e.g., "stochastic".
magic_impute: Flag indicating whether to perform magic imputation.
knn: The number of nearest neighbors for magic.MAGIC.
t: power to which the diffusion operator is powered for magic.MAGIC.
min_shared_counts: Minimum number of counts (both unspliced and spliced) required for a gene.
n_pcs: Number of principal components (PCs) used for velocity estimation.
n_neighbors: Number of nearest neighbors used for velocity estimation.
filter_genes: Whether to filter genes based on minimum counts.
min_counts: Minimum counts for gene filtering.
min_counts_u: Minimum unspliced counts for gene filtering.
normalize_per_cell: Whether to normalize counts per cell.
log_transform: Whether to apply log transformation.
use_raw: Whether to use raw data for dynamical modeling.
diff_kinetics: Whether to use differential kinetics.
stream_smooth: Multiplication factor for scale in Gaussian kernel around grid point.
stream_density: Controls the closeness of streamlines. When density = 2 (default), the domain is divided into a 60x60 grid, whereas density linearly scales this grid. Each cell in the grid can have, at most, one traversing streamline.
arrow_length: Length of arrows.
arrow_size: Size of arrows.
arrow_density: Amount of velocities to show.
denoise: Boolean flag indicating whether to denoise.
denoise_topn: Number of genes with highest likelihood selected to infer velocity directions.
kinetics: Boolean flag indicating whether to estimate RNA kinetics.
kinetics_topn: Number of genes with highest likelihood selected to infer velocity directions.
calculate_velocity_genes: Boolean flag indicating whether to calculate velocity genes.
compute_velocity_confidence: Whether to compute velocity confidence metrics.
compute_terminal_states: Whether to compute terminal states (root and end points).
compute_pseudotime: Whether to compute velocity pseudotime.
compute_paga: Whether to compute PAGA (Partition-based graph abstraction).
top_n: The number of top features to plot.
n_jobs: The number of parallel jobs to run.
palette: The palette to use for coloring cells.
palcolor: A vector of colors to use as the palette.
show_plot: Whether to show the plot.
save: Whether to save the plots.
dpi: The DPI (dots per inch) for saving the plot.
dirpath: The directory to save the plots.
fileprefix: The file prefix to use for the plots.
return_seurat: Whether to return a Seurat object instead of an anndata object. Default is TRUE.

Examples

if (FALSE) { # \dontrun{
data(pancreas_sub)
pancreas_sub <- RunSCVELO(
  srt = pancreas_sub,
  assay_x = "RNA",
  group_by = "SubCellType",
  linear_reduction = "PCA",
  nonlinear_reduction = "UMAP"
)
head(pancreas_sub[[]])
names(pancreas_sub@assays)

FeatureDimPlot(
  pancreas_sub,
  c("stochastic_length", "stochastic_confidence")
)
FeatureDimPlot(
  pancreas_sub,
  "stochastic_pseudotime"
)
VelocityPlot(
  pancreas_sub,
  reduction = "UMAP",
  plot_type = "stream"
)
CellDimPlot(
  pancreas_sub,
  group.by = "SubCellType",
  reduction = "UMAP",
  pt.size = NA,
  velocity = "stochastic"
)

pancreas_sub <- RunSCVELO(
  pancreas_sub,
  assay_x = "RNA",
  group_by = "SubCellType",
  linear_reduction = "PCA",
  nonlinear_reduction = "UMAP",
  mode = c("deterministic", "stochastic"),
  filter_genes = TRUE,
  min_counts = 5,
  compute_velocity_confidence = TRUE,
  compute_terminal_states = TRUE,
  compute_pseudotime = TRUE,
  compute_paga = TRUE
)
} # }

Usage

Arguments

See also

Examples