The aim and goal of this study is to optimize the parameters used for reinforcement learning. Since the study is still in the preprint stage, the search for methods and method developments suitable for optimization techniques are still ongoing.