Normal view MARC view ISBD view

Algorithms for reinforcement learning (Record no. 7782)

MARC details
000 -LEADER
fixed length control field	01793nam a2200193 4500
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field	220306b \|\|\|\|\| \|\|\|\| 00\| 0 eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
International Standard Book Number	9781608454921 (pbk.)
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number	006.31
Item number	SZE
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name	Szepesvári, Csaba
245 ## - TITLE STATEMENT
Title	Algorithms for reinforcement learning
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication, distribution, etc.	Williston
Name of publisher, distributor, etc.	Morgan & Claypool
Date of publication, distribution, etc.	2010
300 ## - PHYSICAL DESCRIPTION
Extent	viii, 89p.,
500 ## - GENERAL NOTE
General note	https://www.morganclaypoolpublishers.com/catalog_Orig/product_info.php?products_id=31
520 ## - SUMMARY, ETC.
Summary, etc.	Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Machine learning
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Markov processes
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element	Reinforcement learning
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type	Book
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN)	481
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN)	858
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
9 (RLIN)	859

Holdings
Withdrawn status	Lost status	Source of classification or shelving scheme	Damaged status	Not for loan	Collection code	bill no.	bill date	Home library	Current library	Date acquired	Source of acquisition	Cost, normal purchase price	Total Checkouts	Total Renewals	Full call number	Barcode	Date last seen	Date checked out	Price effective from	Koha item type	Checked out
		Dewey Decimal Classification			Computer science	558/21-22	25/02/2022	Plaksha University Library	Plaksha University Library	06/03/2022	T V Enterprises	3790.00	3	4	006.31 SZE	002861	09/07/2025	28/05/2025	06/03/2022	Book
		Dewey Decimal Classification			Computer science	558/21-22	25/02/2022	Plaksha University Library	Plaksha University Library	06/03/2022	T V Enterprises	3790.00	1		006.31 SZE	002862	22/01/2025	22/01/2025	06/03/2022	Book	21/07/2025