원문을 보면 AlphaDev를 만든 연구진들이 그런 의도를 갖고 학습시켰다는 걸로 보이네요.

We focused on improving sorting algorithms for shorter sequences of three to five elements. These algorithms are among the most widely used because they are often called many times as a part of larger sorting functions. Improving these algorithms can lead to an overall speedup for sorting any number of items.