The AI uses a variation of the mini-max algorithm, known as the expectimax algorithm, which takes into account the probability of any given event. For each move, it evaluates the probability of every ...
A clean, well-documented reinforcement learning framework for training AI agents to play 2048. This project features multiple RL algorithms, comprehensive documentation, and professional code quality ...