use universal intrinsic for FP16
authorTomoaki Teshima <tomoaki.teshima@gmail.com>
Fri, 2 Sep 2016 12:57:46 +0000 (21:57 +0900)
committerTomoaki Teshima <tomoaki.teshima@gmail.com>
Sun, 4 Sep 2016 23:13:52 +0000 (08:13 +0900)
commit903789f7afc7fd88e28c9bc62cdbbe00e911b972
tree44c8b7efed1ee4e7aa93ae2d3ed7f497399c4961
parent28db4a220741b33ba59a30fac6dd8e470f444cf6
use universal intrinsic for FP16
  * use v_float16x4 (universal intrinsic) instead of raw SSE/NEON implementation
  * define v_load_f16/v_store_f16 since v_load can't be distinguished when short pointer passed
  * brush up implementation on old compiler (guard correctly)
  * add test for v_load_f16 and round trip conversion of v_float16x4
  * fix conversion error
modules/core/include/opencv2/core/hal/intrin_neon.hpp
modules/core/include/opencv2/core/hal/intrin_sse.hpp
modules/core/src/convert.cpp
modules/core/test/test_intrin.cpp