Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of Riemannian Geometry

descriptionPublicationkeyboard_double_arrow_right Article , Conference object , Preprint 01 Jan 2024Embargo end date: 01 Jan 2024 Italy Publisher:ZenodoJournal:CoRR, volume abs/2407.10484Funded by:EC | ELIAS

Authors: Chen, Ziheng; Song, Yue; Wu, Xiao-Jun; Liu, Gaowen; Sebe, Niculae;

doi: 10.48550/arxiv.2407.10484 , 10.5281/zenodo.17689051 , 10.5281/zenodo.17689050

arXiv: 2407.10484

handle: 11572/461416

Understanding Matrix Function Normalizations in Covariance Pooling through the Lens of Riemannian Geometry

- Summary
- Subjects
- Metrics

Abstract

Global Covariance Pooling (GCP) has been demonstrated to improve the performance of Deep Neural Networks (DNNs) by exploiting second-order statistics of high-level representations. GCP typically performs classification of the covariance matrices by applying matrix function normalization, such as matrix logarithm or power, followed by a Euclidean classifier. However, covariance matrices inherently lie in a Riemannian manifold, known as the Symmetric Positive Definite (SPD) manifold. The current literature does not provide a satisfactory explanation of why Euclidean classifiers can be applied directly to Riemannian features after the normalization of the matrix power. To mitigate this gap, this paper provides a comprehensive and unified understanding of the matrix logarithm and power from a Riemannian geometry perspective. The underlying mechanism of matrix functions in GCP is interpreted from two perspectives: one based on tangent classifiers (Euclidean classifiers on the tangent space) and the other based on Riemannian classifiers. Via theoretical analysis and empirical validation through extensive experiments on fine-grained and large-scale visual classification datasets, we conclude that the working mechanism of the matrix functions should be attributed to the Riemannian classifiers they implicitly respect. The code is available at https://github.com/GitZH-Chen/RiemGCP.git.

Accepted to ICLR 2025

Country

Italy

Related Organizations

Jiangnan University
China (People's Republic of)
Jianghan University
China (People's Republic of)
University of Trento
Italy
UNIVERSITY OF TRENTO
Xiangnan University
China (People's Republic of)

View all View all

Keywords

FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Vision and Pattern Recognition (cs.CV), Computer Science - Computer Vision and Pattern Recognition, Machine Learning (cs.LG)

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average

Green

Funded by

EC| ELIAS