In this vignette we consider approximating multiple data matrices as a product of multiple low-rank matrices (a.k.a., factor matrices).
Test data is available from toyModel.
library("iTensor")
data1 <- iTensor::toyModel("MICA")
data2 <- iTensor::toyModel("GroupICA")
str(data1, 2)## List of 2
##  $ X: num [1:10001, 1:3] -0.0613 -0.0593 0.5027 0.3647 0.357 ...
##  $ Y: num [1:10001, 1:3] 0.342 0.385 0.473 0.898 1.001 ...str(data2, 2)## List of 2
##  $ X: num [1:5000, 1:6] -2.43 4.07 8.83 -1.56 6.1 ...
##  $ Y: num [1:5000, 1:6] -9.644 -2.587 -0.261 -2.316 -15.774 ...Both data1 and data2 contain two
time-series data X and Y as follows.
plot.ts(data1$X[7700:8000,], main="data1 (X)")plot.ts(data1$Y[7700:8000,], main="data1 (Y)")plot.ts(data2$X[4700:5000,], main="data2 (X)")plot.ts(data2$Y[4700:5000,], main="data2 (Y)")As a formulation that extends ICA (independent component analysis) to the multiple matrices case, Multimodal Independent Component Analysis (MICA) was proposed ((Akaho 1999)). MICA extracts statistically dependent pairs of features from the sources, where the components of feature vector extracted from each source are independent.
MICA can be performed as follows.
t_series <- seq(from = 0.00, to = 1.000, by = 1e-4)
out.MICA <- MICA(data1$X, data1$Y, J=3, gamma_ts = 1 - 1 / (1 + exp(-100 * (t_series - 0.3))))J is the rank parameter for ICA and
gamma_ts is the weighting factor for dependence on
independence. You will see that MICA could extract some
time-series signals.
plot.ts(out.MICA$U[7700:8000, ], main="Source Signal (X)")plot.ts(out.MICA$V[7700:8000, ], main="Source Signal (Y)")Another formulation of the decomposition is Group Independent
Component Analysis (GroupICA (Calhourn 2009;
Pfister 2018)). GroupICA can be performed as
follows.
out_groupica_pooled_infomax <- GroupICA(data2, J1=6,
    algorithm="pooled", ica.algorithm="InfoMax")
out_groupica_Calhoun2009_fastica <- GroupICA(data2, J1=6,
    algorithm="Calhoun2009", ica.algorithm="FastICA")
out_groupica_Pfister2018_amuse <- GroupICA(data2, J1=6,
    algorithm="Pfister2018", ica.algorithm="AMUSE")The rank for each factor matrix can be set as J1 and the
decomposition algorithm can be easily switched by
ica.algorithm (algorithm of ICA
and ICA2 can be specified). To pool the results of ICA
against each data matrix, we implemented three algorithms such as
pooled, Calhoun2009, and
Pfister2018. For the details, see the references (Calhourn 2009; Pfister 2018).
You will see that GroupICA could extract some
time-series signals.
plot.ts(out_groupica_pooled_infomax$Ss[[1]], main="Source Signal (X)")plot.ts(out_groupica_pooled_infomax$Ss[[2]], main="Source Signal (Y)")plot.ts(out_groupica_Calhoun2009_fastica$Ss[[1]], main="Source Signal (X)")plot.ts(out_groupica_Calhoun2009_fastica$Ss[[2]], main="Source Signal (Y)")plot.ts(out_groupica_Pfister2018_amuse$Ss[[1]], main="Source Signal (X)")plot.ts(out_groupica_Pfister2018_amuse$Ss[[2]], main="Source Signal (Y)")Unlike MICA, GroupICA can also be applied
to more than three matrices.
## R version 4.2.2 (2022-10-31)
## Platform: x86_64-conda-linux-gnu (64-bit)
## Running under: CentOS Linux 7 (Core)
## 
## Matrix products: default
## BLAS/LAPACK: /home/koki/miniconda3/envs/r-4.2/lib/libopenblasp-r0.3.21.so
## 
## locale:
##  [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C              
##  [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8    
##  [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8   
##  [7] LC_PAPER=en_US.UTF-8       LC_NAME=C                 
##  [9] LC_ADDRESS=C               LC_TELEPHONE=C            
## [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C       
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] mixOmics_6.22.0 ggplot2_3.4.2   lattice_0.21-8  MASS_7.3-58.3  
## [5] iTensor_1.0.0  
## 
## loaded via a namespace (and not attached):
##  [1] ggrepel_0.9.3       Rcpp_1.0.10         einsum_0.1.0       
##  [4] tidyr_1.3.0         corpcor_1.6.10      digest_0.6.31      
##  [7] utf8_1.2.3          RSpectra_0.16-1     R6_2.5.1           
## [10] plyr_1.8.8          ellipse_0.4.5       evaluate_0.20      
## [13] highr_0.10          pillar_1.9.0        rlang_1.1.0        
## [16] jquerylib_0.1.4     Matrix_1.5-4        rmarkdown_2.21     
## [19] rARPACK_0.11-0      splines_4.2.2       BiocParallel_1.32.6
## [22] geigen_2.3          stringr_1.5.0       igraph_1.4.2       
## [25] munsell_0.5.0       compiler_4.2.2      xfun_0.38          
## [28] pkgconfig_2.0.3     jointDiag_0.4       mgcv_1.8-42        
## [31] htmltools_0.5.5     tidyselect_1.2.0    tibble_3.2.1       
## [34] gridExtra_2.3       codetools_0.2-19    matrixStats_0.63.0 
## [37] fansi_1.0.4         withr_2.5.0         dplyr_1.1.1        
## [40] grid_4.2.2          nlme_3.1-162        jsonlite_1.8.4     
## [43] gtable_0.3.3        lifecycle_1.0.3     magrittr_2.0.3     
## [46] scales_1.2.1        cli_3.6.1           stringi_1.7.12     
## [49] cachem_1.0.7        reshape2_1.4.4      bslib_0.4.2        
## [52] generics_0.1.3      vctrs_0.6.1         RColorBrewer_1.1-3 
## [55] tools_4.2.2         glue_1.6.2          purrr_1.0.1        
## [58] parallel_4.2.2      fastmap_1.1.1       yaml_2.3.7         
## [61] colorspace_2.1-0    groupICA_0.1.1      knitr_1.42         
## [64] rTensor_1.4.8       sass_0.4.5