The optimal state-value function is the maximum state-value function over all policies. In other words, is the resulting state-value function when the agent follows the optimal policy (treat as a parameter in the equation below).