Home > Publications database > Rank Selection in Non-negative Matrix Factorization: systematic comparison and a new MAD metric |
Contribution to a conference proceedings | FZJ-2019-05146 |
; ; ;
2019
IEEE
This record in other databases:
Please use a persistent id in citations: doi:10.1109/IJCNN.2019.8852146
Abstract: Non-Negative Matrix Factorization (NMF) is apowerful dimensionality reduction and factorization method thatprovides a part-based representation of the data. In the absence ofa priori knowledge about the latent dimensionality of the data, itis necessary to select a rank of the reduced representation. Severalrank selection methods have been proposed, but no consensusexists on when a method is suitable to use. In this work, we proposea new metric for rank selection based on imputation crossvalidation,and we systematically compare it against six othermetrics while assessing the effects of data properties. Usingsynthetic datasets with different properties, our work criticallyevidences that most methods fail to identify the true rank. Weshow that properties of the data heavily impact the ability ofdifferent methods. Imputation-based metrics, including our newMADimput, provided the best accuracy irrespective of the datatype, but no solution worked perfectly in all circumstances. Oneshould therefore carefully assess characteristics of their dataset inorder to identify the most suitable metric for rank selection.
![]() |
The record appears in these collections: |