
Reference data for conversion tests
===================================

SPDX-FileType: DOCUMENTATION
SPDX-FileCopyrightText: NONE
SPDX-License-Identifier: CC0-1.0


Input files ("*.raw")
---------------------
Input data that is feeded into the library by the test suite.

"US-ASCII.raw"   : All octets from US-ASCII subset of Unicode
"UTF-8.raw"      : Contains codepoint beyond the BMP without surrogate-pair
                   Should be accepted for error tolerance
"CESU-8.raw"     : Contains surrogate-pair as defined for CESU-8
"NUL_*.raw"      : Contain NUL control characters at different positions
                   NUL should terminate conversion (if not ignored on request)
                   Incomplete sequences at the end should not be consumed
"NONIDENT_*.raw" : Contain invalid characters at different positions
                   Nonidentical conversions should be executed on request
"UTF-7_plus.raw" : Contains an UTF-7 "+-" sequence for literal '+'
"UTF-7_shift.raw": Contains UTF-7 shift sequences mixed with direct encoding
"UTF-7_eof.raw"  : Contains an UTF-7 shift sequence that is terminated by EOF
                   The shift mark '+' is the first octet.
"UTF-7_plus.raw" : Contains an UTF-7 shift sequence with a surrogate-pair.
"UTF-7_empty.raw": Contains the empty UTF-7 shift sequence "+ ".
"UTF-7_NI_*.raw" : Contain invalid characters at different positions
                   Nonidentical conversions should be executed on request


Reference files ("*.utf8")
--------------------------
UTF-8 reference files. Used to compare with the output of the library.
These files contain '�' (Unicode codepoint U+FFFD) for non-identical
conversions.
