Homec4science

Reject nonminmal representations of UTF8 at the beginning of the 3-byte BMP…

Authored by epriestley <git@epriestley.com> on Feb 24 2014, 01:20.

Description

Reject nonminmal representations of UTF8 at the beginning of the 3-byte BMP range

Summary: Ref T1191. These byte ranges (\xE0\x80\x80 through \xE0\x9F\xBF) are alternate representations of characters with a different preferred minimal representation. MySQL and mbstring both reject them, and we should too.

Test Plan: Ran unit tests.

Reviewers: btrahan, arice

Reviewed By: arice

CC: chad, arice, aran

Maniphest Tasks: T1191

Differential Revision: https://secure.phabricator.com/D8313

Details

Committed
epriestley <git@epriestley.com>Feb 24 2014, 01:20
Pushed
aubortMar 17 2017, 12:03
Parents
rPHUba77b3501a15: Implement phutil_is_utf8_with_only_bmp_characters() without segfaulting
Branches
Unknown
Tags
Unknown

Event Timeline

epriestley <git@epriestley.com> committed rPHU563e600eba85: Reject nonminmal representations of UTF8 at the beginning of the 3-byte BMP… (authored by epriestley <git@epriestley.com>).Feb 24 2014, 01:20