decoupler.run_gsea

decoupler.run_gsea(mat, net, source='source', target='target', times=1000, batch_size=10000, min_n=5, seed=42, verbose=False, use_raw=True)

Gene Set Enrichment Analysis (GSEA).

Wrapper to run GSEA.

Parameters:
matlist, pd.DataFrame or AnnData

List of [features, matrix], dataframe (samples x features) or an AnnData instance.

netpd.DataFrame

Network in long format.

sourcestr

Column name in net with source nodes.

targetstr

Column name in net with target nodes.

timesint

How many random permutations to do.

batch_sizeint

Size of the samples to use for each batch. Increasing this will consume more memmory but it will run faster.

min_nint

Minimum of targets per source. If less, sources are removed.

seedint

Random seed to use.

verbosebool

Whether to show progress.

use_rawbool

Use raw attribute of mat if present.

Returns:
Returns gsea, norm_gsea activity estimates and p-values or stores them in mat.obsm[‘gsea_estimate’],
mat.obsm[‘gsea_norm’], and mat.obsm[‘gsea_pvals’].