Module core::str::validations
source · Expand description
Operations related to UTF-8 validation.
Constants
- Mask of the value bits of a continuation byte.
Functions
- Returns
trueif any byte in the wordxis nonascii (>= 128). - next_code_point⚠ExperimentalReads the next code point out of a byte iterator (assuming a UTF-8-like encoding).
- Reads the last code point out of a byte iterator (assuming a UTF-8-like encoding).
- Walks through
vchecking that it’s a valid UTF-8 sequence, returningOk(())in that case, or, if it is invalid,Err(err). - Returns the value of
chupdated with continuation bytebyte. - utf8_char_widthExperimentalGiven a first byte, determines how many bytes are in this UTF-8 character.
- Returns the initial codepoint accumulator for the first byte. The first byte is special, only want bottom 5 bits for width 2, 4 bits for width 3, and 3 bits for width 4.
- Checks whether the byte is a UTF-8 continuation byte (i.e., starts with the bits
10).