最終更新日:2022/12/07
正解を見る
(computing theory) A model-free reinforcement learning algorithm to learn a policy telling an agent what action to take under what circumstances.
編集履歴(0)
元となった辞書の項目
Q-learning
noun