元となった辞書の項目
multi-armed bandit
名詞
(probability
theory,
machine
learning)
An
algorithm
that
allocates
a
fixed
limited
set
of
resources
between
competing
alternative
choices
so
as
to
maximize
the
expected
gain,
when
each
choice's
properties
are
only
partially
known
at
the
time
of
allocation,
and
may
become
better
understood
as
time
passes
or
allocations
are
made.
日本語の意味
限られた資源を複数の選択肢(腕)に配分し、各選択肢の特性が部分的にしか把握されていない状況で、期待される収益を最大化するためのアルゴリズムである。 / 実験と活用のバランスをとるため、各選択肢の性質が順次明らかになる中で、最適な選択を導くための手法と解釈できる。
意味(1)
(probability
theory,
machine
learning)
An
algorithm
that
allocates
a
fixed
limited
set
of
resources
between
competing
alternative
choices
so
as
to
maximize
the
expected
gain,
when
each
choice's
properties
are
only
partially
known
at
the
time
of
allocation,
and
may
become
better
understood
as
time
passes
or
allocations
are
made.
( plural )