Imputation by Bayesian principal components analysis — impute.pca

Apply imputation to the dataset by Bayesian principal components analysis (Oba et al. 2003) .

Usage

impute.pca_bayes(dataSet, reportImputing = FALSE, nPcs = NULL, maxSteps = 100)

Arguments

dataSet: The 2d dataset of experimental values.
reportImputing: A boolean (default = FALSE) specifying whether to provide a shadow data frame with imputed data labels, where 1 indicates the corresponding entries have been imputed, and 0 indicates otherwise. Alters the return structure.
nPcs: An integer specifying the number of principal components to calculate. The default is set to the minimum between the number of samples and the number of proteins.
maxSteps: An integer (default = 100) specifying the maximum number of estimation steps.

Value

If reportImputing = FALSE, the function returns the imputed 2d dataframe.
If reportImputing = TRUE, the function returns a list of the imputed 2d dataframe and a shadow matrix showing which proteins by replicate were imputed.

References

Oba S, Sato M, Takemasa I, Monden M, Matsubara K, Ishii S (2003). “A Bayesian Missing Value Estimation Method for Gene Expression Profile Data.” Bioinformatics, 19(16), 2088–2096. doi:10.1093/bioinformatics/btg287 .