( 본 글은 OpenAI Spinning Up을 개인적으로 정리한 글입니다. 원본) Part 3: Intro to Policy Optimization — Spinning Up documentation In this section, we’ll discuss the mathematical foundations of policy optimization algorithms, and connect the material to sample code. We will cover three key results in the theory of policy gradients: In the end, we’ll tie those results together and desc spinningup.openai.com 이번 글에서는..
(본 글은 OpenAI Spinning Up을 개인적으로 정리한 글입니다. 원본) Part 2: Kinds of RL Algorithms — Spinning Up documentation We’ll start this section with a disclaimer: it’s really quite hard to draw an accurate, all-encompassing taxonomy of algorithms in the modern RL space, because the modularity of algorithms is not well-represented by a tree structure. Also, to make somethin spinningup.openai.com RL Algorithm의 ..
- Total
- Today
- Yesterday
- Off-policy
- ColorStream
- Pipeline
- Expression Blend 4
- Policy Gradient
- 강화학습
- End-To-End
- SketchFlow
- Kinect
- ai
- processing
- PowerPoint
- Distribution
- arduino
- RL
- Kinect for windows
- Offline RL
- windows 8
- Windows Phone 7
- 파이썬
- Variance
- Gan
- DepthStream
- Kinect SDK
- 딥러닝
- dynamic programming
- 한빛미디어
- reward
- TensorFlow Lite
- bias
일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | |||
5 | 6 | 7 | 8 | 9 | 10 | 11 |
12 | 13 | 14 | 15 | 16 | 17 | 18 |
19 | 20 | 21 | 22 | 23 | 24 | 25 |
26 | 27 | 28 | 29 | 30 | 31 |