Feature selection for classification of hyperspectral data by SVM

Article English OPEN
Pal, Mahesh ; Foody, Giles M. (2010)
  • Publisher: Institute of Electrical and Electronics Engineers
  • Related identifiers: doi: 10.1109/TGRS.2009.2039484
  • Subject:
    acm: ComputingMethodologies_PATTERNRECOGNITION

SVM are attractive for the classification of remotely sensed data with some claims that the method is insensitive to the dimensionality of the data and so not requiring a dimensionality reduction analysis in pre-processing. Here, a series of classification analyses with two hyperspectral sensor data sets reveal that the accuracy of a classification by a SVM does vary as a function of the number of features used. Critically, it is shown that the accuracy of a classification may decline significantly (at 0.05 level of statistical significance) with the addition of features, especially if a small training sample is used. This highlights a dependency of the accuracy of classification by a SVM on the dimensionality of the data and so the potential value of undertaking a feature selection analysis prior to classification. Additionally, it is demonstrated that even when a large training sample is available feature selection may still be useful. For example, the accuracy derived from the use of a small number of features may be non-inferior (at 0.05% level of significance) to that derived from the use of a larger feature set providing potential advantages in relation to issues such as data storage and computational processing costs. Feature selection may, therefore, be a valuable analysis to include in pre-processing operations for classification by a SVM.
  • References (39)
    39 references, page 1 of 4

    [9] S. Lu, K. Oki, Y. Shimizu, and K. Omasa, “Comparison between several feature extraction/classification methods for mapping complicated agricultural land use patches using airborne hyperspectral data,” International Journal of Remote Sensing, vol. 28, no. 5, pp. 963-984, Jan. 2007.

    [17] D. M. J. Tax, D. de Ridder, and R.P.W. Duin, “Support vector classifiers: A first look,” in: H.E. Bal, H. Corporaal, P.P. Jonker, J.F.M. Tonino (eds.), Proceedings of 3rd Annual Conference of the Advanced School for Computing and Imaging (Heijen, NL, June 2-4), ASCI, Delft, pp. 253-258, 1997.

    [18] J. A. Gualtieri, “The support vector machine (SVM) algorithm for supervised classification of hyperspectral remote sensing data,” In G. Camps-Valls and L. Bruzzone (eds) Kernel Methods for Remote Sensing Data Analysis, Wiley, Chichester, in press, 2009.

    [19] M. Pal, and P. M. Mather, “Assessment of the effectiveness of support vector machines for hyperspectral data,” Future Generation Computer Systems, vol. 20, no. 7, pp. 1215-1225, October 2004.

    [20] M. Pal and P. M. Mather, “Some issue in classification of DAIS hyperspectral data,” International Journal of Remote Sensing, vol. 27, no. 14, pp. 2895-2916, July 2006.

    [21] F. Melgani and L. Bruzzone, “Classification of hyperspectral remote sensing images with support vector machines,” IEEE Transaction of Geoscience and Remote Sensing, vol. 42, no. 8, pp. 1778-1790, August 2004.

    [22] I. Guyon, J. Weston, S. Barnhill, and V. N. Vapnik, “Gene selection for cancer classification using support vector machines,” Machine Learning, vol. 46, no. 1-3, pp. 389-422, Jan. 2002. Information Sciences, Vol. XXXVI, Part 7/C50, p. 258-263, 2007. ISPRS, Davos (CH). ISSN 1682-1777.

    [24] H. Liu, “Evolving feature selection,” IEEE Intelligent Systems, vol. 20, pp. 64-76, November 2005.

    [25] H. Liu and H. Motoda, Feature Extraction, Construction and Selection: A Data Mining Perspective. Massachusetts: Kluwer Academic Publishers, 1998.

    [26] P. M. Mather, Computer Processing of Remotely-Sensed Images: An Introduction. Third Edition, Chichester: John Wiley and Sons, 2004.

  • Metrics
    No metrics available
Share - Bookmark