Daniel Yekini - Applied Mathematics & Computing

Tic-Tac-Toe was my first hands-on project in adversarial AI. On the surface, it looks almost too trivial to be interesting. Yet, precisely because it is so simple, it became the perfect entry point into the world of decision-making algorithms. This project was my first encounter with the Minimax algorithm, and it taught me how recursive search can mimic intelligent behaviour in ways that appear surprisingly lifelike. Although the game itself is “solved” and every outcome can be predicted, working through the mechanics gave me a framework for thinking about games, strategies, and how agents anticipate each other’s actions.

Problem Statement / Motivation

I was introduced to AI concepts through Robert Miles’ discussions of AI safety, where examples like Tic-Tac-Toe are used to show how systems can be both simple and profound. I wanted to explore this duality: how a child’s game could illustrate such deep computational principles. My motivation was not to make the strongest Tic-Tac-Toe agent possible, but rather to understand why optimal play is guaranteed, how algorithms discover it, and what this says about adversarial reasoning more generally. Building this project gave me a tangible sense of how deterministic systems can create the illusion of adaptive, almost human-like decisions.

Features & Technical Details

Minimax Algorithm in Depth: The Minimax algorithm is built on the idea of anticipation. In a two-player, zero-sum environment, every move I make is assumed to be countered by the opponent’s best possible move. This forces the agent to simulate not only its own decisions but also the opponent’s responses. The recursive logic flows downward through the game tree, exploring all possible continuations, then flows upward as the algorithm assigns scores to each move based on the eventual outcomes.

V(s) = \begin{cases} +1 & \text{if state } s \text{ is a win for Max} \\\\ -1 & \text{if state } s \text{ is a win for Min} \\\\ 0 & \text{if state } s \text{ is a draw} \\\\ \max_{a \in A(s)} V(T(s,a)) & \text{if it’s Max’s turn} \\\\ \min_{a \in A(s)} V(T(s,a)) & \text{if it’s Min’s turn} \end{cases}

Where $T(s,a)$ is the new state reached by applying action $a$ in state $s$ .

In practical terms, the program generates the full game tree of Tic-Tac-Toe, evaluates terminal states (wins, losses, draws), and propagates those results upward to inform earlier moves. Because the state space is so small, the algorithm can search exhaustively with ease.

TicTacToe AI: Human vs AI Baseline

Problem Statement / Motivation

Features & Technical Details