ALBERT — All Library Books, journals and Electronic Records Telegrafenberg

1

Electronic Resource

Dilute Solution Properties of Randomly Branched Polymer Systems. I. The Particle Scattering Factor (1974)

Kajiwara, Kanji ; Ribeiro, Carlos A. M.

s.l. : American Chemical Society

Macromolecules 7 (1974), S. 121-128

add to mindlist on the mindlist

Details

ISSN: 1520-5835

Source: ACS Legacy Archives

Topics: Chemistry and Pharmacology , Physics

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1021/ma60037a023

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

2

Electronic Resource

Atmospheric concentration of metals and total suspended particulates in Rio de Janeiro (1981)

Trindade, Helena A. ; Pfeiffer, Wolfgang C. ; Londres, Helena ; [et al.]

s.l. : American Chemical Society

Environmental science & technology 15 (1981), S. 84-89

add to mindlist on the mindlist

Details

ISSN: 1520-5851

Source: ACS Legacy Archives

Topics: Chemistry and Pharmacology , Energy, Environment Protection, Nuclear Power Engineering

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1021/es00083a008

Permalink

	Location	Call Number	Expected	Availability

Others were also interested in ...

Paper (German National Licenses)

Fulltext

3

Electronic Resource

Embedding a Priori Knowledge in Reinforcement Learning (1998)

Ribeiro, Carlos H. C.

Springer

Journal of intelligent and robotic systems 21 (1998), S. 51-71

add to mindlist on the mindlist

Details

ISSN: 1573-0409

Keywords: Q-learning algorithm ; reinforcement learning ; experience generalisation

Source: Springer Online Journal Archives 1860-2000

Topics: Computer Science , Mechanical Engineering, Materials Science, Production Engineering, Mining and Metallurgy, Traffic Engineering, Precision Mechanics

Notes: Abstract In the last years, temporal differences methods have been put forward as convenient tools for reinforcement learning. Techniques based on temporal differences, however, suffer from a serious drawback: as stochastic adaptive algorithms, they may need extensive exploration of the state-action space before convergence is achieved. Although the basic methods are now reasonably well understood, it is precisely the structural simplicity of the reinforcement learning principle – learning through experimentation – that causes these excessive demands on the learning agent. Additionally, one must consider that the agent is very rarely a tabula rasa: some rough knowledge about characteristics of the surrounding environment is often available. In this paper, I present methods for embedding a priori knowledge in a reinforcement learning technique in such a way that both the mathematical structure of the basic learning algorithm and the capacity to generalise experience across the state-action space are kept. Extensive experimental results show that the resulting variants may lead to good performance, provided a sensible balance between risky use of prior imprecise knowledge and cautious use of learning experience is adopted.

Type of Medium: Electronic Resource

URL: http://dx.doi.org/10.1023/A:1007968115863

Permalink