unicode: reduce the size of utf8data[]
authorOlaf Weber <olaf@sgi.com>
Thu, 25 Apr 2019 17:49:18 +0000 (13:49 -0400)
committerTheodore Ts'o <tytso@mit.edu>
Thu, 25 Apr 2019 17:49:18 +0000 (13:49 -0400)
commita8384c68797ee022f5fd7bcef5f4cc57863d4042
tree5cab2a7a1e9aa4a284354236868aed6eccfb5d5c
parent44594c2fbf42528001dfb1597d26adb40ba6d178
unicode: reduce the size of utf8data[]

Remove the Hangul decompositions from the utf8data trie, and do
algorithmic decomposition to calculate them on the fly. To store the
decomposition the caller of utf8lookup()/utf8nlookup() must provide a
12-byte buffer, which is used to synthesize a leaf with the
decomposition. This significantly reduces the size of the utf8data[]
array.

Changes made by Gabriel:
  Rebase to mainline
  Fix checkpatch errors
  Extract robustness fixes and merge back to original mkutf8data.c patch
  Regenerate utf8data.h

Signed-off-by: Olaf Weber <olaf@sgi.com>
Signed-off-by: Gabriel Krisman Bertazi <krisman@collabora.co.uk>
Signed-off-by: Theodore Ts'o <tytso@mit.edu>
fs/unicode/README.utf8data
fs/unicode/utf8-norm.c
fs/unicode/utf8data.h
fs/unicode/utf8n.h
scripts/mkutf8data.c