ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

Hits per page

hits 1 - 2 | 2 hits

Sorting

Electronic Resource

Prioritized sweeping: Reinforcement learning with less data and less time (1993)

Moore, Andrew W. ; Atkeson, Christopher G.

Springer

Machine learning 13 (1993), S. 103-130

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: Memory-based learning ; learning control ; reinforcement learning ; temporal differencing ; asynchronous dynamic programming ; heuristic search ; prioritized sweeping

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract We present a new algorithm,prioritized sweeping, for efficient prediction and control of stochastic Markov systems. Incremental learning methods such as temporal differencing and Q-learning have real-time performance. Classical methods are slower, but more accurate, because they make full use of the observations. Prioritized sweeping aims for the best of both worlds. It uses all previous experiences both to prioritize important dynamic programming sweeps and to guide the exploration of state-space. We compare prioritized sweeping with other reinforcement learning schemes for a number of different stochastic optimal control problems. It successfully solves large state-space real-time problems with which other methods have difficulty.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1007/BF00993104

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

Electronic Resource

Prioritized Sweeping: Reinforcement Learning with Less Data and Less Time (1993)

Moore, Andrew W. ; Atkeson, Christopher G.

Springer

Machine learning 13 (1993), S. 103-130

add to mindlist on the mindlist

Details

ISSN: 0885-6125

Keywords: Memory-based learning ; learning control ; reinforcement learning ; temporal differencing ; asynchronous dynamic programming ; heuristic search ; prioritized sweeping

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science

Notes: Abstract We present a new algorithm, prioritized sweeping, for efficient prediction and control of stochastic Markov systems. Incremental learning methods such as temporal differencing and Q-learning have real-time performance. Classical methods are slower, but more accurate, because they make full use of the observations. Prioritized sweeping aims for the best of both worlds. It uses all previous experiences both to prioritize important dynamic programming sweeps and to guide the exploration of state-space. We compare prioritized sweeping with other reinforcement learning schemes for a number of different stochastic optimal control problems. It successfully solves large state-space real-time problems with which other methods have difficulty.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1022635613229

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

hits 1 - 2 | 2 hits