最終更新日:2024/08/08

(probability theory, machine learning) An algorithm that allocates a fixed limited set of resources between competing alternative choices so as to maximize the expected gain, when each choice's properties are only partially known at the time of allocation, and may become better understood as time passes or allocations are made.

正解を見る

multi-armed bandit

編集履歴(0)

Dictionary quizzes to help you remember vocabulary

編集履歴(0)

ログイン / 新規登録

 

アプリをダウンロード!
DiQt

DiQt(ディクト)

無料

★★★★★★★★★★