Checkout
Start free trial
Take Naologic for a spin today, no credit card needed and no obligations.
Start free trial
Question

Thompson Sampling - What is normal Thompson sampling?

Answer

Regular Thompson As a policy, sampling chooses an arm depending on the likelihood that it is the optimal decision. For each ˆθi(t) = (¯xi(t)),Si(t)), the posterior π(µi|ˆθi(t)) determines this probability separately.