projects
/
platform
/
core
/
ml
/
nntrainer.git
/ shortlog
commit
grep
author
committer
pickaxe
?
search:
re
summary
| shortlog |
log
|
commit
|
commitdiff
|
tree
first
⋅
prev
⋅
next
platform/core/ml/nntrainer.git
2024-07-03
Donghak PARK
[FP16][Tensor] Remove unnecessary copy on save
commit
|
commitdiff
|
tree
|
snapshot
2024-07-03
Donghyeon Jeong
[Tensor] Remove NaN check for integer
commit
|
commitdiff
|
tree
|
snapshot
2024-07-02
Donghyeon Jeong
[CI] Add PR review from clang-format
commit
|
commitdiff
|
tree
|
snapshot
2024-07-02
heka1024
[Layer] Introduce `upsample2d` layer
commit
|
commitdiff
|
tree
|
snapshot
2024-07-02
Debadri Samaddar
[blas/OpenCL] Updated doxygen docs
commit
|
commitdiff
|
tree
|
snapshot
2024-07-02
Debadri Samaddar
[blas/OpenCL] Added multiply OpenCL kernel and unit...
commit
|
commitdiff
|
tree
|
snapshot
2024-07-01
Donghak PARK
[Trivial] Fix Typo
commit
|
commitdiff
|
tree
|
snapshot
2024-06-28
skykongkong8
[ hgemm/trivial ] Use aligned memory allocation in...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-28
skykongkong8
[ BLAS ] Implement transpose case functions for K=1...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-28
skykongkong8
[ hgemm ] Consider K=1 changes
commit
|
commitdiff
|
tree
|
snapshot
2024-06-28
Donghyeon Jeong
[Layer] Fix logic: SwiGLU Layer Training Incompatibility
commit
|
commitdiff
|
tree
|
snapshot
2024-06-28
skykongkong8
[ hgemm ] Use aligned memory allocation in transpose...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-28
skykongkong8
[ hgemm ] Use zero padding in Non-8-divisible GEMM...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-28
skykongkong8
[ hgemm/trivial ] Wrap multi-line expressions
commit
|
commitdiff
|
tree
|
snapshot
2024-06-27
skykongkong8
[ trivial ] Fix typo
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
Niket Agarwal
[GPU/OpenCL] Initial version of SwiGLU Layer with OpenC...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
yash.singh
[GPU/OpenCL] Added fp16 support for Addition Layer...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
yash.singh
[GPU/OpenCL] Addition Kernel added in reusable blas...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
yash.singh
[GPU/OpenCL] Initial version of Addition Layer with...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
Jubilee.Yang
[DOCS] Update README.md
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
Eunju Yang
[Docs/trivial] fix typo in main `README.md`
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
Eunju Yang
[Docs] add recent proceeding to main README.md
commit
|
commitdiff
|
tree
|
snapshot
2024-06-25
MyungJoo Ham
action/ubuntu: fp16 on/off handled by matrix
commit
|
commitdiff
|
tree
|
snapshot
2024-06-21
skykongkong8
[ layer ] Bugfix for enabling unittest_models on Android
commit
|
commitdiff
|
tree
|
snapshot
2024-06-21
Seungbaek Hong
[Application] Bug fix about meson setting
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
Debadri Samaddar
[refactor] Moved blas_kernels to tensor directory
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
Debadri Samaddar
[refactor] Removed experimental OpenCL kernel files
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
MyungJoo Ham
actions: gbs build test
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
MyungJoo Ham
action: Yocto devtool test
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
heka1024
[CI] Add fp-16 build in github action
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
MyungJoo Ham
action: add Ubuntu pdebuild
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
skykongkong8
[ layer ] Optimize LSTM fp16 computation
commit
|
commitdiff
|
tree
|
snapshot
2024-06-20
skykongkong8
[ Tensor ] Implement add_i_partial
commit
|
commitdiff
|
tree
|
snapshot
2024-06-19
heka1024
[Doc] Update activation function in `README.md`
commit
|
commitdiff
|
tree
|
snapshot
2024-06-19
MyungJoo Ham
android: consistant ML_API_COMMON macro
commit
|
commitdiff
|
tree
|
snapshot
2024-06-19
MyungJoo Ham
action: add Android build test
commit
|
commitdiff
|
tree
|
snapshot
2024-06-19
MyungJoo Ham
action: add check if rebuild required module
commit
|
commitdiff
|
tree
|
snapshot
2024-06-19
skykongkong8
[ hgemm ] Use hgemm kernel in transpose cases
commit
|
commitdiff
|
tree
|
snapshot
2024-06-19
Donghyeon Jeong
[trivial] fix typo error
commit
|
commitdiff
|
tree
|
snapshot
2024-06-13
wchang kim
Fixed the build error for gcc-14
commit
|
commitdiff
|
tree
|
snapshot
2024-06-10
skykongkong8
[ layer ] Enable mha gtest and match version
commit
|
commitdiff
|
tree
|
snapshot
2024-06-10
Debadri Samaddar
[bugfix/unittest] Using LayerSemanticsGpu for FC Layer...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-10
skykongkong8
[ docs ] Add lldb-server debugger guide file
commit
|
commitdiff
|
tree
|
snapshot
2024-06-10
skykongkong8
[ neon/trivial ] Compare float scaling factors more...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-10
skykongkong8
[ Trivial ] Fix typo and use better iterating index
commit
|
commitdiff
|
tree
|
snapshot
2024-06-10
skykongkong8
[ hgemm ] Support scaling factor beta in kernel-based...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-10
skykongkong8
[ hgemm ] Consider tiny gemm case
commit
|
commitdiff
|
tree
|
snapshot
2024-06-08
hyeonseok
[acti_func] implement quick gelu
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
Debadri Samaddar
[blas/neon] isamax edge cases unit tests
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
Debadri Samaddar
[blas/neon] isamax improvement for larger input length
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
skykongkong8
[ trivial ] Add doxygen tags in matrix transpose functions
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
skykongkong8
[ BLAS ] Support non-4-divisible case in matrix transpose
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
skykongkong8
[ Tensor ] Use SIMD accelerated transpose if possible
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
skykongkong8
[ blas ] Add transpose_matrix function
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
skykongkong8
[ matrix_transpose_neon ] Implement NEON-accelereated...
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
Debadri Samaddar
[GPU/OpenCL] Added fp16 support for FC layer on GPU
commit
|
commitdiff
|
tree
|
snapshot
2024-06-04
Debadri Samaddar
[unittest/gpu] Added LayerSemanticsGpu test suite
commit
|
commitdiff
|
tree
|
snapshot
2024-06-03
Seungbaek Hong
[Application] yolo v2 bug fix
commit
|
commitdiff
|
tree
|
snapshot
2024-06-01
Debadri Samaddar
[GPU/OpenCL] fp16(half) support
commit
|
commitdiff
|
tree
|
snapshot
2024-05-24
Debadri Samaddar
[goldendata] Added script to generate Swiglu data
commit
|
commitdiff
|
tree
|
snapshot
2024-05-23
Seungbaek Hong
[Application] update yolo v2 modeling
commit
|
commitdiff
|
tree
|
snapshot
2024-05-23
Debadri Samaddar
[bugfix/refactor] OpenCL buffer creation fix and optimi...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-23
Debadri Samaddar
[bugfix] Used global memmory for result in dot_cl kernel
commit
|
commitdiff
|
tree
|
snapshot
2024-05-23
Debadri Samaddar
[bugfix] Renamed variables in unittest of FC Layer
commit
|
commitdiff
|
tree
|
snapshot
2024-05-23
Debadri Samaddar
[GPU/OpenCL] Resuable blas OpenCL kernels
commit
|
commitdiff
|
tree
|
snapshot
2024-05-23
Debadri Samaddar
[unittest] Added test for incremental forwarding for...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-23
Debadri Samaddar
[GPU/OpenCL] Initial version of FC Layer with OpenCL ops
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ Trivial ] Remove redundant comments and format
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ hgemm ] Refactor kernel init process
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ hgemm/bugfix ] Adaptive macro kernel usage in 4x4...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ hgemm ] Apply acc16 partial sum strategy and adaptive...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ hgemm ] Apply ACC16 partial sum strategy & adaptive...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ hgemm ] Apply macro kernel in 4x4 noTrans
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ hgemm ] Add 4x4 kernel-using f16-f32 hgemm_noTrans
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
skykongkong8
[ hgemm ] Implement 4x4 f16-f32 kernel
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
Udit Jain
Edited build instructions for Resnet18 test
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
Seungbaek Hong
[Trivial] Update gitignore file
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
Donghyeon Jeong
[coverity] fix coverity issue
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
Donghyeon Jeong
[bugfix] Fix LoRA indices array size in the FC layer
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
Seungbaek Hong
[Application] update yolo v2 python for building pre...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-22
hyunil park
[Nnstreamer-subplugin] Add save_path to setProperty
commit
|
commitdiff
|
tree
|
snapshot
2024-05-20
Seungbaek Hong
[Application] cuda support for example of pytorch yolo v2
commit
|
commitdiff
|
tree
|
snapshot
2024-05-16
Seungbaek Hong
[Application] Rename yolo -> yolo v2
commit
|
commitdiff
|
tree
|
snapshot
2024-05-13
Debadri Samaddar
[hgemm] Optimizing dimension checks using bitmask
commit
|
commitdiff
|
tree
|
snapshot
2024-05-13
Debadri Samaddar
[hgemm] Added K divisible condition for 1x8 and 1x4...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-13
Debadri Samaddar
[hgemm] Interchanged hgemm_noTrans_1x8 and hgemm_noTran...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-10
skykongkong8
[ hdot ] Use precision-enhanced hdot
commit
|
commitdiff
|
tree
|
snapshot
2024-05-09
Donghak PARK
[Trivial] Removing unnecessary files from the repo...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-09
Donghak PARK
[CI] Remove Pylinter in CI
commit
|
commitdiff
|
tree
|
snapshot
2024-05-07
Seungbaek Hong
[Application] fix LLaMA application example error
commit
|
commitdiff
|
tree
|
snapshot
2024-05-06
Seungbaek Hong
[Application] Update weights_converter
commit
|
commitdiff
|
tree
|
snapshot
2024-05-04
jijoong.moon
[ NEURALNET ] change the loss scale property to Rigid...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-04
jijoong.moon
[ Weight ] split variable dim and grad dim to set separ...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-04
jijoong.moon
[ Weight ] Add Loss Scale factor in Weight
commit
|
commitdiff
|
tree
|
snapshot
2024-05-04
Jiho Chu
[Property] Add loss scale property
commit
|
commitdiff
|
tree
|
snapshot
2024-05-03
MyungJoo Ham
meson: fix fp16 support conditions for arm/aarch64
commit
|
commitdiff
|
tree
|
snapshot
2024-05-02
Seungbaek Hong
[Wait for #2536][application] add generate_multiple_tok...
commit
|
commitdiff
|
tree
|
snapshot
2024-05-01
kimhan0515
Add SELU activation function
commit
|
commitdiff
|
tree
|
snapshot
2024-05-01
skykongkong8
[ hnrm2 ] Use precision-enhanced hscal
commit
|
commitdiff
|
tree
|
snapshot
2024-05-01
Jaeyun Jung
[Build] dependency to api
commit
|
commitdiff
|
tree
|
snapshot
next