Given learning agent A, sample frozen opponent B from set of canidates C with probability
Where is some weighting function
Choosing make PFSP focus on the hardest players where scales the distribution.
Choosing means that the agent prefers opponents around its own level, which is best for agents which are much weaker overall than the strongest agents.
In the Transitive Cyclic Decomposition paper, this method is criticized because it contracts the space of agents rather than expands it. More precisely, it lowers the Effective diversity of the strategies.
In particular, in the disk game, this method contracts the space of solutions instead of growing them (this method is version C in the diagram).
They instead propose Response to Rectified Nash to grow the landscape.