publication . Article . 2020

Short‐text feature expansion and classification based on nonnegative matrix factorization

Ling Zhang; Wenchao Jiang; Zhiming Zhao;
Open Access English
  • Published: 22 Sep 2020 Journal: International Journal of Intelligent Systems (issn: 0884-8173, eissn: 1098-111X, Copyright policy)
In this paper, a non‐negative matrix factorization feature expansion (NMFFE) approach was proposed to overcome the feature‐sparsity issue when expanding features of short‐text. First, we took the internal relationships of short texts and words into account when segmenting words from texts and constructing their relationship matrix. Second, we utilized the Dual regularization non‐negative matrix tri‐factorization (DNMTF) algorithm to obtain the words clustering indicator matrix, which was used to get the feature space by dimensionality reduction methods. Thirdly, words with close relationship were selected out from the feature space and added into the short‐text ...
Persistent Identifiers
free text keywords: Theoretical Computer Science, Human-Computer Interaction, Software, Artificial Intelligence, correlation, feature extension, nonnegative matrix factorization, short text classification, Non-negative matrix factorization, Regularization (mathematics), Word2vec, Matrix decomposition, Cluster analysis, Dimensionality reduction, Matrix (mathematics), Computer science, Feature vector, Pattern recognition, Artificial intelligence, business.industry, business
Funded by
smART socIal media eCOsytstem in a blockchaiN Federated environment
  • Funder: European Commission (EC)
  • Project Code: 825134
  • Funding stream: H2020 | RIA
Validated by funder
EC| Blue Cloud
Blue Cloud
Blue-Cloud: Piloting innovative services for Marine Research & the Blue Economy
  • Funder: European Commission (EC)
  • Project Code: 862409
  • Funding stream: H2020 | IA
ENVironmental Research Infrastructures building Fair services Accessible for society, Innovation and Research
  • Funder: European Commission (EC)
  • Project Code: 824068
  • Funding stream: H2020 | RIA
Any information missing or wrong?Report an Issue