Algorithms for reinforcement learning (Record no. 7782)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 01793nam a2200193 4500 |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 220306b ||||| |||| 00| 0 eng d |
020 ## - INTERNATIONAL STANDARD BOOK NUMBER | |
International Standard Book Number | 9781608454921 (pbk.) |
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER | |
Classification number | 006.31 |
Item number | SZE |
100 ## - MAIN ENTRY--PERSONAL NAME | |
Personal name | Szepesvári, Csaba |
245 ## - TITLE STATEMENT | |
Title | Algorithms for reinforcement learning |
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT) | |
Place of publication, distribution, etc. | Williston |
Name of publisher, distributor, etc. | Morgan & Claypool |
Date of publication, distribution, etc. | 2010 |
300 ## - PHYSICAL DESCRIPTION | |
Extent | viii, 89p., |
500 ## - GENERAL NOTE | |
General note | https://www.morganclaypoolpublishers.com/catalog_Orig/product_info.php?products_id=31 |
520 ## - SUMMARY, ETC. | |
Summary, etc. | Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations. |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name as entry element | Machine learning |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name as entry element | Markov processes |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
Topical term or geographic name as entry element | Reinforcement learning |
942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
Koha item type | Book |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
9 (RLIN) | 481 |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
9 (RLIN) | 858 |
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM | |
9 (RLIN) | 859 |
Withdrawn status | Lost status | Source of classification or shelving scheme | Damaged status | Not for loan | Collection code | bill no. | bill date | Home library | Current library | Date acquired | Source of acquisition | Cost, normal purchase price | Total Checkouts | Full call number | Barcode | Date last seen | Date checked out | Price effective from | Koha item type |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Dewey Decimal Classification | Computer science | 558/21-22 | 25/02/2022 | Plaksha University Library | Plaksha University Library | 06/03/2022 | T V Enterprises | 3790.00 | 1 | 006.31 SZE | 002861 | 29/05/2024 | 18/12/2023 | 06/03/2022 | Book | ||||
Dewey Decimal Classification | Computer science | 558/21-22 | 25/02/2022 | Plaksha University Library | Plaksha University Library | 06/03/2022 | T V Enterprises | 3790.00 | 006.31 SZE | 002862 | 09/07/2023 | 06/03/2022 | Book |