Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2008
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2008
License: CC BY
Data sources: Datacite
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2008
License: CC BY
Data sources: ZENODO
versions View all 2 versions
addClaim

Evolutionary Feature Selection For Text Documents Using The Svm

Authors: Daniel I. Morariu; Lucian N. Vintan; Volker Tresp;

Evolutionary Feature Selection For Text Documents Using The Svm

Abstract

{"references": ["S. Chakrabarti, \"Mining the Web- Discovering Knowledge from\nhypertext data\", Morgan Kaufmann Press, 2003.", "G. Forman, \"A Pitfall and Solution in Multi-Class Feature Selection for\nText Classification\", Proceedings of the 21st International Conference\non Machine Learning, Banff, Canada, 2004.", "T. Jebara, \"Multi Task Feature and Kernel Selection for SVMs\",\nProceedings of the 21st International Conference on Machine Learning,\nBanff, Canada, 2004.", "T. Mitchell, \"Machine Learning\", McGraw Hill Publishers, 1997.", "D. Mladenic, J. Brank, M. Grobelnik and N. Milic-Frayling, \"Feature\nSelection Using Support Vector Machines\", The 27th Annual\nInternational ACM SIGIR Conference (SIGIR2004), pp 234-241, 2004.", "D. Morariu, \"Classification and Clustering using Support Vector\nMachine\", 2nd PhD Report, University \u00d4\u00c7\u00d7Lucian Blaga\" of Sibiu,\nSeptember, 2005, http://webspace.ulbsibiu.ro/ daniel.morariu/html/Docs\n/Report2.pdf.", "D. Morariu, L. Vintan, \"A Better Correlation of the SVM kernel-s\nParameters\", Proceeding of The 5th RoEduNet International Conference,\nSibiu, June 2006.", "C. Nello, J. Swawe-Taylor, \"An introduction to Support Vector\nMachines\", Cambridge University Press, 2000.", "J. Platt, \"Fast training of support vector machines using sequential\nminimal optimization\". In B. Scholkopf, C. J. C. Burges, and A. J.\nSmola, editors, Advances in Kernel Methods - Support Vector Learning,\npages 185-208, Cambridge, MA, 1999, MIT Press.\n[10] Reuters Corpus: http://about.reuters.com/researchandstandards/corpus/.\nReleased in November 2000.\n[11] B. Schoelkopf, A. Smola, \"Learning with Kernels, Support Vector\nMachines\", MIT Press, London, 2002.\n[12] Whitely, D., A genetic Algorithm Tutorial, Foundations of Genetic\nAlgorithms, ed. Morgan Kaufmann\n[13] G, F. Luger, W. A. Stubblefield, Artificial Intelligence, Addison Wesley\nLongman, Third Edition, 1998\n[14] G. Kim, S. Kim, Feature Selection Using Genetic Algorithms for\nHandwritten Character Recognition, Proceedings of the Seventh\nInternational Workshop on Frontiers in Handwriting Recognition,\nAmsterdam, 2000\n[15] A. E. Eiben, J. E. Smith, Introduction to evolutionary computing,\nSpringer-Verlag, 2003\n[16] D. Morariu, L. Vintan, V. Tresp, Feature Selection Methods for an\nImproved SVM Classifier, Proceedings of the 14th International\nConference on Computational and Information Science, pp 83-89,\nPrague, August 2006"]}

Text categorization is the problem of classifying text documents into a set of predefined classes. After a preprocessing step, the documents are typically represented as large sparse vectors. When training classifiers on large collections of documents, both the time and memory restrictions can be quite prohibitive. This justifies the application of feature selection methods to reduce the dimensionality of the document-representation vector. In this paper, we present three feature selection methods: Information Gain, Support Vector Machine feature selection called (SVM_FS) and Genetic Algorithm with SVM (called GA_SVM). We show that the best results were obtained with GA_SVM method for a relatively small dimension of the feature vector.

Keywords

Genetic Algorithm, Support Vector Machine, Learning with Kernels, and Classification., Feature Selection

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
    OpenAIRE UsageCounts
    Usage byUsageCounts
    visibility views 2
    download downloads 5
  • 2
    views
    5
    downloads
    Powered byOpenAIRE UsageCounts
Powered by OpenAIRE graph
Found an issue? Give us feedback
visibility
download
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
views
OpenAIRE UsageCountsViews provided by UsageCounts
downloads
OpenAIRE UsageCountsDownloads provided by UsageCounts
0
Average
Average
Average
2
5
Green