Checkout
Personalized AI apps
Build multi-agent systems without code and automate document search, RAG and content generation
Start free trial
Question

Thompson Sampling - What is normal Thompson sampling?

Answer

Regular Thompson As a policy, sampling chooses an arm depending on the likelihood that it is the optimal decision. For each ˆθi(t) = (¯xi(t)),Si(t)), the posterior π(µi|ˆθi(t)) determines this probability separately.