reinforcement-learning 3 Tighter Time-Uniform Concentration Inequality for the UCB Algorithm Jan 3, 2024 Correct Proof for the UCB Algorithm Dec 26, 2023 Multi-Armed Bandit and UCB Algorithm Nov 1, 2023