ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Hits per page

hit 1 - 1 | 1 hit

Select All Export

Unknown

Toward Optimal Feature Selection in Naive Bayes for Text Categorization (2016)

Institute of Electrical and Electronics Engineers (IEEE)

In: IEEE Transactions on Knowledge and Data Engineering

add to mindlist on the mindlist

Details

Publication Date: 2016-08-05

Description: Automated feature selection is important for text categorization to reduce feature size and to speed up learning process of classifiers. In this paper, we present a novel and efficient feature selection framework based on the Information Theory, which aims to rank the features with their discriminative capacity for classification. We first revisit two information measures: Kullback-Leibler divergence and Jeffreys divergence for binary hypothesis testing, and analyze their asymptotic properties relating to type I and type II errors of a Bayesian classifier. We then introduce a new divergence measure, called Jeffreys-Multi-Hypothesis (JMH) divergence, to measure multi-distribution divergence for multi-class classification. Based on the JMH-divergence, we develop two efficient feature selection methods, termed maximum discrimination ( $MD$ ) and methods, for text categorization. The promising results of extensive experiments demonstrate the effectiveness of the proposed approaches.

Print ISSN: 1041-4347

Electronic ISSN: 1558-2191

Topics: Computer Science

Published by Institute of Electrical and Electronics Engineers (IEEE) on behalf of The IEEE Computer Society.

	Location	Call Number	Expected	Availability

Others were also interested in ...

PAPER CURRENT

S·F·X

Fulltext

hit 1 - 1 | 1 hit