ARM: support different levels of loop unrolling in bilinear scaler