We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
#这是你的源代码 def __update(self, value: float): """ 更新节点的访问次数 `N(s, a)`、节点的累计平均奖赏 `Q(s, a)` Parameters ---------- value: float 用来更新节点内部数据 """ self.Q = (self.N * self.Q + value)/(self.N + 1) self.N += 1 def backup(self, value: float): """ 反向传播 """ if self.parent: self.parent.backup(-value) #there is the negative sign, why ? self.__update(value)
thanks advance !
The text was updated successfully, but these errors were encountered:
No branches or pull requests
thanks advance !
The text was updated successfully, but these errors were encountered: