最終更新日:2022/12/06
正解を見る
(computing theory) A model-free reinforcement learning algorithm to learn a policy telling an agent what action to take under what circumstances.
編集履歴(0)
元となった辞書の項目
Q-learning
noun