Deep Reinforcement Learning with Python - Second Edition

預購

台灣風情茄芷袋Supercard造型悠遊卡-台灣(裁型)

可愛ｘ實用ｘ好旅伴，台味悠遊卡熱銷中！

9折 2340元
~~2600~~元
認購希望書包，幫助弱勢孩童上學不中斷！

預計最高可得金幣115點 ? 可100%折抵
活動加倍另計
HAPPY GO享100累1點 4點抵1元 折抵無上限

分類：
英文書＞自然科普＞電腦資訊＞電腦概論
追蹤

? 追蹤分類後，您會在第一時間收到分類新品通知。
作者： Sudharsan,Ravichandiran 追蹤 ? 追蹤作者後，您會在第一時間收到作者新書通知。
出版社： Packt 追蹤 ? 追蹤出版社後，您會在第一時間收到出版社新書通知。
出版日：2020/10/01

信用卡分期： 6期 0利率每期 390元更多分期

分期價：除不盡餘數於第一期收取
3期0利率	每期780 元	接受26 家銀行
6期0利率	每期390 元	接受26 家銀行

3期0利率接受26家銀行

土地銀行、合作金庫、第一銀行、華南銀行、上海銀行、台北富邦、兆豐商銀、花旗(台灣)銀行、澳盛銀行、臺灣企銀、渣打商銀、滙豐(台灣)銀行、臺灣新光商銀、陽信銀行、三信銀行、聯邦銀行、遠東銀行、元大銀行、永豐銀行、玉山銀行、星展銀行、台新銀行、日盛銀行、安泰銀行、中國信託、台灣樂天

6期0利率接受26家銀行

12期0利率接受26家銀行

24期0利率接受22家銀行

土地銀行、合作金庫、第一銀行、華南銀行、上海銀行、台北富邦、花旗(台灣)銀行、澳盛銀行、臺灣企銀、渣打商銀、滙豐(台灣)銀行、臺灣新光商銀、陽信銀行、聯邦銀行、遠東銀行、元大銀行、玉山銀行、星展銀行、台新銀行、日盛銀行、安泰銀行、中國信託

立即代訂

※此為代訂海外書籍，不接受退換貨

※ 本商品會員日滿額金幣加碼回饋最高15倍

購買後進貨　

預訂門市商品

門市庫存

活動訊息

全館滿$1,200送150點金幣，4月歡慶兒童節，童書、玩具、文具滿1000元再送200點金幣！

內容簡介

An example-rich guide for beginners to start their reinforcement and deep reinforcement learning journey with state-of-the-art distinct algorithms

Key FeaturesCovers a vast spectrum of basic-to-advanced RL algorithms with mathematical explanations of each algorithmLearn how to implement algorithms with code by following examples with line-by-line explanationsExplore the latest RL methodologies such as DDPG, PPO, and the use of expert demonstrations

Book Description

With significant enhancements in the quality and quantity of algorithms in recent years, this second edition of Hands-On Reinforcement Learning with Python has been revamped into an example-rich guide to learning state-of-the-art reinforcement learning (RL) and deep RL algorithms with TensorFlow 2 and the OpenAI Gym toolkit.

In addition to exploring RL basics and foundational concepts such as Bellman equation, Markov decision processes, and dynamic programming algorithms, this second edition dives deep into the full spectrum of value-based, policy-based, and actor-critic RL methods. It explores state-of-the-art algorithms such as DQN, TRPO, PPO and ACKTR, DDPG, TD3, and SAC in depth, demystifying the underlying math and demonstrating implementations through simple code examples.

The book has several new chapters dedicated to new RL techniques, including distributional RL, imitation learning, inverse RL, and meta RL. You will learn to leverage stable baselines, an improvement of OpenAI's baseline library, to effortlessly implement popular RL algorithms. The book concludes with an overview of promising approaches such as meta-learning and imagination augmented agents in research.

By the end, you will become skilled in effectively employing RL and deep RL in your real-world projects.

What you will learnUnderstand core RL concepts including the methodologies, math, and codeTrain an agent to solve Blackjack, FrozenLake, and many other problems using OpenAI GymTrain an agent to play Ms Pac-Man using a Deep Q NetworkLearn policy-based, value-based, and actor-critic methodsMaster the math behind DDPG, TD3, TRPO, PPO, and many othersExplore new avenues such as the distributional RL, meta RL, and inverse RLUse Stable Baselines to train an agent to walk and play Atari games

Who this book is for

If you're a machine learning developer with little or no experience with neural networks interested in artificial intelligence and want to learn about reinforcement learning from scratch, this book is for you.

Basic familiarity with linear algebra, calculus, and the Python programming language is required. Some experience with TensorFlow would be a plus.

配送方式

台灣
- 國內宅配：本島、離島
- 到店取貨：
  
  不限金額免運費
海外
- 國際快遞：全球
- 港澳店取：

詳細資料

- 語言
- 英文
- 裝訂
- 紙本平裝
- ISBN
- 9781839210686
- 分級
- 普通級
- 頁數
- 0
- 商品規格
- 出版地
- 美國
- 適讀年齡
- 全齡適讀
- 注音
- 級別

英文書＞自然科普＞電腦資訊＞電腦概論

商品評價

訂購/退換貨須知

加入金石堂 LINE 官方帳號『完成綁定』，隨時掌握出貨動態：

商品運送說明：

本公司所提供的產品配送區域範圍目前僅限台灣本島。注意！收件地址請勿為郵政信箱。
商品將由廠商透過貨運或是郵局寄送。消費者訂購之商品若無法送達，經電話或 E-mail無法聯繫逾三天者，本公司將取消該筆訂單，並且全額退款。
當廠商出貨後，您會收到E-mail出貨通知，您也可透過【訂單查詢】確認出貨情況。
產品顏色可能會因網頁呈現與拍攝關係產生色差，圖片僅供參考，商品依實際供貨樣式為準。
如果是大型商品（如：傢俱、床墊、家電、運動器材等）及需安裝商品，請依商品頁面說明為主。訂單完成收款確認後，出貨廠商將會和您聯繫確認相關配送等細節。
偏遠地區、樓層費及其它加價費用，皆由廠商於約定配送時一併告知，廠商將保留出貨與否的權利。

提醒您！！
金石堂及銀行均不會請您操作ATM! 如接獲電話要求您前往ATM提款機，請不要聽從指示，以免受騙上當！

退換貨須知：

**提醒您，鑑賞期不等於試用期，退回商品須為全新狀態**

依據「消費者保護法」第19條及行政院消費者保護處公告之「通訊交易解除權合理例外情事適用準則」，以下商品購買後，除商品本身有瑕疵外，將不提供7天的猶豫期：
1. 易於腐敗、保存期限較短或解約時即將逾期。（如：生鮮食品）
2. 依消費者要求所為之客製化給付。（客製化商品）
3. 報紙、期刊或雜誌。（含MOOK、外文雜誌）
4. 經消費者拆封之影音商品或電腦軟體。
5. 非以有形媒介提供之數位內容或一經提供即為完成之線上服務，經消費者事先同意始提供。（如：電子書、電子雜誌、下載版軟體、虛擬商品…等）
6. 已拆封之個人衛生用品。（如：內衣褲、刮鬍刀、除毛刀…等）
若非上列種類商品，均享有到貨7天的猶豫期（含例假日）。
辦理退換貨時，商品（組合商品恕無法接受單獨退貨）必須是您收到商品時的原始狀態（包含商品本體、配件、贈品、保證書、所有附隨資料文件及原廠內外包裝…等），請勿直接使用原廠包裝寄送，或於原廠包裝上黏貼紙張或書寫文字。
退回商品若無法回復原狀，將請您負擔回復原狀所需費用，嚴重時將影響您的退貨權益。