《多主体强化学习协作策略研究》 - 孙若莹，赵刚著 - Meg Book Store - 香港.大書城

	登入帳戶　 \|　訂單查詢　 \|　購物車/收銀台( 0 )　\|　在線留言板　 \|　付款方式　 \|　運費計算　 \|　聯絡我們　 \|　幫助中心　\|　加入書簽
		會員登入新用戶登記

HOME

新書上架

暢銷書架

好書推介

會員書架精選

2023年度TOP

臺灣用戶

品種：超過100萬種各類書籍/音像和精品，正品正價，放心網購，悭钱省心

服務：香港／台灣／澳門／海外

送貨：速遞／郵局／服務站

新書上架：簡體書繁體書
暢銷書架：簡體書繁體書
好書推介：簡體書繁體書

四月出版：大陸書台灣書
三月出版：大陸書台灣書
二月出版：大陸書台灣書
一月出版：大陸書台灣書
12月出版：大陸書台灣書
11月出版：大陸書台灣書
十月出版：大陸書台灣書
九月出版：大陸書台灣書
八月出版：大陸書台灣書
七月出版：大陸書台灣書
六月出版：大陸書台灣書
五月出版：大陸書台灣書
四月出版：大陸書台灣書
三月出版：大陸書台灣書
二月出版：大陸書台灣書

『簡體書』多主体强化学习协作策略研究

書城自編碼： 2446484
分類：簡體書→大陸圖書→計算機/網絡→人工智能
作者：孙若莹，赵刚著
國際書號(ISBN)： 9787302368304
出版社：清华大学出版社
出版日期： 2014-08-01
版次： 1 印次： 1
頁數/字數： 164/222000
書度/開本： 16开釘裝：平装

售價：HK$ 124.8

我要買件

** 我創建的書架 **
未登入.

新書推薦：

桥头楼上

《桥头楼上》
售價：HK$ 63.4

中国近代史（中国史学大家蒋廷黻典作品）

《中国近代史（中国史学大家蒋廷黻典作品）》
售價：HK$ 47.8

先跳了再说：仓本聪的工作与生活哲学

《先跳了再说：仓本聪的工作与生活哲学》
售價：HK$ 67.2

抗焦虑饮食（赠实践手册）

《抗焦虑饮食（赠实践手册）》
售價：HK$ 106.8

生活不是掷骰子：理性决策的贝叶斯思维

《生活不是掷骰子：理性决策的贝叶斯思维》
售價：HK$ 82.8

动态商业战略

《动态商业战略》
售價：HK$ 82.8

万有引力书系·基辅罗斯：东斯拉夫文明的起源

《万有引力书系·基辅罗斯：东斯拉夫文明的起源》
售價：HK$ 141.6

康熙的红票：全球化中的清朝

《康熙的红票：全球化中的清朝》
售價：HK$ 93.6

建議一齊購買：

+

HK$ 100.1
《多智能体机器学习：强化学习方法》

+

HK$ 114.6
《 TensorFlow：实战Google深度学习框架》

+

HK$ 145.2
《基于增强学习的制造系统调度》

+

HK$ 144.0
《大规模强化学习》

+

HK$ 146.2
《决策用强化与系统性机器学习》

+

HK$ 143.0
《运筹学导论（第9版·基础篇）（管理科学与工程经典译丛）》

內容簡介：

多主体的研究与应用是近年来备受关注的热点领域，多主体强化学习理论与方法、多主体协作策略的研究是该领域重要研究方向，其理论和应用价值极为广泛，备受广大从事计算机应用、人工智能、自动控制、以及经济管理等领域研究者的关注。孙若莹、赵刚所著的《多主体强化学习协作策略研究》清晰地介绍了多主体、强化学习及多主体协作等基本概念和基础内容，明确地阐述了有关多主体强化学习、协作策略研究的发展过程及最新动向，深入地探讨了多主体强化学习与协作策略的理论与方法，具体地分析了多主体强化学习与协作策略在相关研究领域的应用方法。
全书系统脉络清晰、基本概念清楚、图表分析直观，注重内容的体系化和实用性。通过本书的阅读和学习，读者即可掌握多主体强化学习及协作策略的理论和方法，更可了解在实际工作中应用这些研究成果的手段。本书可作为从事计算机应用、人工智能、自动控制、以及经济管理等领域研究者的学习和阅读参考，同时高等院校相关专业研究生以及人工智能爱好者也可从中获得借鉴。

目錄：

Chapter 1 Introduction
1.1 Reinforcement Learning
1.1.1 Generality of Reinforcement Learning
1.1.2 Reinforcement Learning on Markov Decision Processes
1.1.3 Integrating Reinforcement Learning into Agent Architecture
1.2 Multiagent Reinforcement Learning
1.2.1 Multiagent Systems
1.2.2 Reinforcement Learning in Multiagent Systems
1.2.3 Learning and Coordination in Multiagent Systems
1.3 Ant System for Stochastic Combinatorial Optimization
1.3.1 Ants Forage Behavior
1.3.2 Ant Colony Optimization
1.3.3 MAX-MIN Ant System
1.4 Motivations and Consequences
1.5 Book Summary
Bibliography
Chapter 2 Reinforcement Learning and Its Combination with Ant Colony System
2.1 Introduction
2.2 Investigation into Reinforcement Learning and Swarm Intelligence
2.2.1 Temporal Differences Learning Method
2.2.2 Active Exploration and Experience Replay in Reinforcement Learning
2.2.3 Ant Colony System for Traveling Salesman Problem
2.3 The Q-ACS Multiagent Learning Method
2.3.1 The Q-ACS Learning Algorithm
2.3.2 Some Properties of the Q-ACS Learning Method
2.3.3 Relation with Ant-Q Learning Method
2.4 Simulations and Results
2.5 Conclusions
Bibliography
Chapter 3 Multiagent Learning Methods Based on Indirect Media Information Sharing
3.1 Introduction
3.2 The Multiagent Learning Method Considering Statistics Features
3.2.1 Accelerated K-certainty Exploration
3.2.2 The T-ACS Learning Algorithm
3.3 The Heterogeneous Agents Learning
3.3.1 The D-ACS Learning Algorithm
3.3.2 Some Discussions about the D-ACS Learning Algorithm
3.4 Comparisons with Related State-of-the-arts
3.5 Simulations and Results
3.5.1 Experimental Results on Hunter Game
3.5.2 Experimental Results on Traveling Salesman Problem
3.6 Conclusions
Bibliography
Chapter 4 Action Conversion Mechanism in Multiagent Reinforcement Learning
4.1 Introduction
4.2 Model-Based Reinforcement Learning
4.2.1 Dyna-Q Architecture
4.2.2 Prioritized Sweeping Method
4.2.3 Minimax Search and Reinforcement Learning
4.2.4 RTP-Q Learning
4.3 The Q-ac Multiagent Reinforcement Learning
4.3.1 Task Model
4.3.2 Converting Action
4.3.3 Multiagent Cooperation Methods
4.3.4 Q-value Update
4.3.5 The Q-ac Learning Algorithm
4.3.6 Using Adversarial Action Instead o{ ~ Probability Exploration
4.4 Simulations and Results
4.5 Conclusions
Bibliography
Chapter 5 Multiagent Learning Approaches Applied to Vehicle Routing Problems
5.1 Introduction
5.2 Related State-of-the-arts
5.2.1 Some Heuristic Algorithms
5.2.2 The Vehicle Routing Problem with Time Windows
5.3 The Multiagent Learning Applied to CVRP and VRPTW
5.4 Simulations and Results
5.5 Conclusions
Bibliography
Chapter 6 Multiagent learning Methods Applied to Multicast Routing Problems
6.1 Introduction
6.2 Multiagent Q-learning Applied to the Network Routing
6.2.1 Investigation into Q-routing
6.2.2 AntNet Investigation
6.3 Some Multicast Routing in Mobile Ad Hoc Networks
6.4 The Multiagent Q-learning in the Q-MAP Multicast Routing Method
6.4.1 Overview of the Q-MAP Multicast Routing
6.4.2 Join Query Packet, Join Reply Packet and Membership Maintenance
6.4.3 Convergence Proof of Q-MAP Method
6.5 Simulations and Results
6.6 Conclusions
Bibliography
Chapter 7 Multiagent Reinforcement Learning for Supply Chain Management
7.1 Introduction
7.2 Related Issues of Supply Chain Management
7.3 SCM Network Scheme with Multiagent Reinforcement Learning
7.3.1 SCM with Multiagent
7.3.2 The RL Agents in SCM Network
7.4 Application of the Q-ACS Method to SCM
7.4.1 The Application Model in SCM
7.4.2 The Q-ACS Learning Applied to the SCM System
7.5 Conclusion
Bibliography
Chapter 8 Multiagent Learning Applied in Supply Chain Ordering Management
8.1 Introduction
8.2 Supply Chain Management Model
8.3 The Multiagent Learning Model for SC Ordering Management
8.4 Simulations and Results
8.5 Conclusions
Bibliography

書城介紹　 |　合作申請　|　索要書目　 |　新手入門　|　聯絡方式　 |　幫助中心　|　找書說明　 |　送貨方式　|　付款方式 香港用户　 |　台灣用户　|　大陸用户　|　海外用户

megBook.com.hk

Copyright © 2013 - 2024 （香港）大書城有限公司　 All Rights Reserved.