[RL] (Spinning Up) Intro to Policy Optimization
( 본 글은 OpenAI Spinning Up을 개인적으로 정리한 글입니다. 원본) Part 3: Intro to Policy Optimization — Spinning Up documentation In this section, we’ll discuss the mathematical foundations of policy optimization algorithms, and connect the material to sample code. We will cover three key results in the theory of policy gradients: In the end, we’ll tie those results together and desc spinningup.openai.com 이번 글에서는..
Study/AI
2019. 5. 22. 05:11
공지사항
최근에 올라온 글
최근에 달린 댓글
- Total
- Today
- Yesterday
TAG
- Expression Blend 4
- 한빛미디어
- End-To-End
- processing
- Off-policy
- RL
- 강화학습
- arduino
- Windows Phone 7
- Policy Gradient
- Kinect
- SketchFlow
- Kinect SDK
- ai
- bias
- DepthStream
- Offline RL
- TensorFlow Lite
- PowerPoint
- ColorStream
- dynamic programming
- Variance
- reward
- Pipeline
- Distribution
- 파이썬
- 딥러닝
- Gan
- windows 8
- Kinect for windows
일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | |||
5 | 6 | 7 | 8 | 9 | 10 | 11 |
12 | 13 | 14 | 15 | 16 | 17 | 18 |
19 | 20 | 21 | 22 | 23 | 24 | 25 |
26 | 27 | 28 | 29 | 30 | 31 |
글 보관함