ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Electronic Resource

The Importance of Attribute Selection Measures in Decision Tree Induction (1994)

Liu, W.Z. ; White, A.P.

Springer

Machine learning 15 (1994), S. 25-41

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: decision trees ; noisy data ; induction ; attribute selection

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract Recent work by Mingers and by Buntine and Niblett on the performance of various attribute selection measures has addressed the topic of random selection of attributes in the construction of decision trees. This article is concerned with the mechanisms underlying the relative performance of conventional and random attribute selection measures. The three experiments reported here employed synthetic data sets, constructed so as to have the precise properties required to test specific hypotheses. The principal underlying idea was that the performance decrement typical of random attribute selection is due to two factors. First, there is a greater chance that informative attributes will be omitted from the subset selected for the final tree. Second, there is a greater risk of overfitting, which is caused by attributes of little or no value in discriminating between classes being “locked in” to the tree structure, near the root. The first experiment showed that the performance decrement increased with the number of available pure-noise attributes. The second experiment indicated that there was little decrement when all the attributes were of equal importance in discriminating between classes. The third experiment showed that a rather greater performance decrement (than in the second experiment) could be expected if the attributes were all informative, but to different degrees.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1022609119415

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

The importance of attribute selection measures in decision tree induction (1994)

Liu, W. Z. ; White, A. P.

Springer

Machine learning 15 (1994), S. 25-41

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: decision trees ; noisy data ; induction ; attribute selection

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF01000407

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Technical Note: Bias in Information-Based Measures in Decision Tree Induction (1994)

White, Allan P. ; Liu, Wei Zhong

Springer

Machine learning 15 (1994), S. 321-329

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: Decision trees ; noise ; induction ; unbiased attribute selection ; information-based measures

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract A fresh look is taken at the problem of bias in information-based attribute selection measures, used in the induction of decision trees. The approach uses statistical simulation techniques to demonstrate that the usual measures such as information gain, gain ratio, and a new measure recently proposed by Lopez de Mantaras (1991) are all biased in favour of attributes with large numbers of values. It is concluded that approaches which utilise the chi-square distribution are preferable because they compensate automatically for differences between attributes in the number of levels they take.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1022694010754

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Bias in information-based measures in decision tree induction (1994)

White, Allan P. ; Liu, Wei Zhong

Springer

Machine learning 15 (1994), S. 321-329

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: Decision trees ; noise ; induction ; unbiased attribute selection ; information-based measures

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF00993349

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Superstitious learning and induction (1995)

White, A. P. ; Liu, W. Z.

Springer

Artificial intelligence review 9 (1995), S. 3-18

add to mindlist on the mindlist

Details

ISSN: 1573-7462

Keywords: induction ; superstition ; learning ; behaviourism ; obsessive-compulsive disorder

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract The behaviourist perspective on superstition is explained and an analogy is drawn between superstitious behaviour and the phenomenon of excessive branching in tree-based inductive learning algorithms operating under uncertainty. The argument is then extended to include cognitive aspects of learning and superstition. Further analogies are made with obsessive-compulsive disorder. Ways in which these phenomena might be simulated are indicated.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF00857651

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

hits 1 - 5 | 5 hits