Home temporal-difference

temporal-difference

时间差学习中的双重计数

xiaolong · 2025年4月10日 · 0 Comment

我正在研究一个时间差学习的例子（https://www…

Gradient Temporal Difference Lambda without Function Approximation

xiaolong · 2025年4月10日 · 0 Comment

在每一种GTD(λ)的形式化描述中，似乎都将其定义为基…

Q-learning与时间差分与基于模型的强化学习

xiaolong · 2025年4月8日 · 0 Comment

我在大学里上了一门名为“智能机器”的课程。我们被介绍了…

实现TD-Gammon算法

xiaolong · 2025年4月4日 · 0 Comment

我正在尝试实现Gerald Tesauro在TD-Ga…

神经网络与时间差分学习

xiaolong · 2025年4月4日 · 0 Comment

我读了几篇关于时间差分学习的论文和讲座（其中一些与神经…

使用Delphi/Pascal实现TD(λ) (时序差分学习)

xiaolong · 2025年4月2日 · 0 Comment

我有一个玩井字游戏的神经网络，但尚未完成。我已完成的…

时序差分中的更新规则

xiaolong · 2025年4月2日 · 0 Comment

TD(0) Q-学习的更新规则： Q(t-1) = (…