[1]
А. Колногоров, A. Kolnogorov, А. Назин, A. Nazin, Д. Шиян, and D. Shiyan, “Two-armed bandit problem and batch version of the mirror descent algorithm”, mgta, vol. 13, no. 2, pp. 9-39, Oct. 2021.