rapids_singlecell.pp.normalize_pearson_residuals#
- rapids_singlecell.pp.normalize_pearson_residuals(adata, *, theta=100, clip=None, check_values=True, layer=None, inplace=True)[source]#
Applies analytic Pearson residual normalization [LBK21]. The residuals are based on a negative binomial offset model with overdispersion
thetashared across genes. By default, residuals are clipped tosqrt(n_obs)and overdispersiontheta=100is used.- Parameters:
- adata
AnnData AnnData object
- theta
float(default:100) The negative binomial overdispersion parameter theta for Pearson residuals. Higher values correspond to less overdispersion
(var = mean + mean^2/theta), andtheta=np.Infcorresponds to a Poisson model.- clip
float|None(default:None) Determines if and how residuals are clipped: If None, residuals are clipped to the interval [-sqrt(n_obs), sqrt(n_obs)], where n_obs is the number of cells in the dataset (default behavior). If any scalar c, residuals are clipped to the interval
[-c, c]. Setclip=np.Inffor no clipping.- check_values
bool(default:True) If True, checks if counts in selected layer are integers as expected by this function, and return a warning if non-integers are found. Otherwise, proceed without checking. Setting this to False can speed up code for large datasets.
- layer
str|None(default:None) - inplace
bool(default:True) If True, update AnnData with results. Otherwise, return results. See below for details of what is returned.
- adata
- Return type:
- Returns:
If
inplace=True,Xor the selected layer inlayersis updated with the normalized values. Ifinplace=Falsethe normalized matrix is returned.