SSE4.1 optimiation of cv::Moments CV_16U