|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectedu.iastate.jrelm.demo.bandit.FiveArmedBandit
public class FiveArmedBandit
Payout mechanism. All you have to do is pull one of the arm and receive a payout. Each arm has a distribution of three different payouts that are delivered with different probabilities. The expected values for each arm are as follows: arm 0 : 210 arm 1 : 510 arm 2 : 595 arm 3 : 147.5 arm 4 : 790 The GamblerAgent, using a RothErevLearner, should eventually learn that the last arm is the best to pull. We expect to see the history of arm choices to converge to 4.
Constructor Summary | |
---|---|
FiveArmedBandit()
|
Method Summary | |
---|---|
double |
pullArm(int armChoice)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public FiveArmedBandit()
Method Detail |
---|
public double pullArm(int armChoice)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |