Multiply only for Transform Matrix + NEON comment clean up 36/288536/8
authorEunki Hong <eunkiki.hong@samsung.com>
Sat, 18 Feb 2023 07:05:50 +0000 (16:05 +0900)
committerEunki, Hong <eunkiki.hong@samsung.com>
Mon, 20 Feb 2023 12:22:05 +0000 (21:22 +0900)
commit240fbf98d20a5e2537c4664b25749e5e0f8a22cc
treeaa7953254dce87a3d84070d56f908e4d7514da05
parent9c6cd5cbe26eb355f2f0daeaae85a28a8d81a3b3
Multiply only for Transform Matrix + NEON comment clean up

If 4x4 matrix form as Transform, we can optimize matrix multiply function.
It will be reduce the time of Transform Update time.

Below are some test result.
1. VLD1.F32 each time is more faster than VLDM.
2. Transpose lhs -> multply -> transpose tmp is slower than current logic
3. "+r"(temp) at Output Operand is slower than "r"(temp) Intput Oprerand with "%r0"(why?)
 --> But when we make current Multiply with Output Operand as Input Operand, it makes slow down. (why?)

Change-Id: Ibc5e1c252ec200d356e649ed6448cd45b3a5d980
Signed-off-by: Eunki Hong <eunkiki.hong@samsung.com>
automated-tests/src/dali-internal/utc-Dali-Internal-MatrixUtils.cpp
dali/internal/common/matrix-utils.cpp
dali/internal/common/matrix-utils.h
dali/internal/event/actors/actor-coords.cpp
dali/internal/update/manager/render-instruction-processor.cpp
dali/internal/update/manager/transform-manager.cpp
dali/internal/update/render-tasks/scene-graph-camera.cpp