pub struct Utf8Chunk<'a> {
valid: &'a str,
invalid: &'a [u8],
}
🔬This is a nightly-only experimental API. (
utf8_chunks
#99543)Expand description
An item returned by the Utf8Chunks
iterator.
A Utf8Chunk
stores a sequence of u8
up to the first broken character
when decoding a UTF-8 string.
Examples
#![feature(utf8_chunks)]
use std::str::Utf8Chunks;
// An invalid UTF-8 string
let bytes = b"foo\xF1\x80bar";
// Decode the first `Utf8Chunk`
let chunk = Utf8Chunks::new(bytes).next().unwrap();
// The first three characters are valid UTF-8
assert_eq!("foo", chunk.valid());
// The fourth character is broken
assert_eq!(b"\xF1\x80", chunk.invalid());
RunFields§
§valid: &'a str
🔬This is a nightly-only experimental API. (
§utf8_chunks
#99543)invalid: &'a [u8]
🔬This is a nightly-only experimental API. (
utf8_chunks
#99543)Implementations§
source§impl<'a> Utf8Chunk<'a>
impl<'a> Utf8Chunk<'a>
sourcepub fn valid(&self) -> &'a str
🔬This is a nightly-only experimental API. (utf8_chunks
#99543)
pub fn valid(&self) -> &'a str
utf8_chunks
#99543)Returns the next validated UTF-8 substring.
This substring can be empty at the start of the string or between broken UTF-8 characters.
sourcepub fn invalid(&self) -> &'a [u8] ⓘ
🔬This is a nightly-only experimental API. (utf8_chunks
#99543)
pub fn invalid(&self) -> &'a [u8] ⓘ
utf8_chunks
#99543)Returns the invalid sequence that caused a failure.
The returned slice will have a maximum length of 3 and starts after the
substring given by valid
. Decoding will resume after this sequence.
If empty, this is the last chunk in the string. If non-empty, an unexpected byte was encountered or the end of the input was reached unexpectedly.
Lossy decoding would replace this sequence with U+FFFD REPLACEMENT CHARACTER
.
Trait Implementations§
source§impl<'a> PartialEq<Utf8Chunk<'a>> for Utf8Chunk<'a>
impl<'a> PartialEq<Utf8Chunk<'a>> for Utf8Chunk<'a>
impl<'a> StructuralEq for Utf8Chunk<'a>
impl<'a> StructuralPartialEq for Utf8Chunk<'a>
Auto Trait Implementations§
impl<'a> RefUnwindSafe for Utf8Chunk<'a>
impl<'a> Send for Utf8Chunk<'a>
impl<'a> Sync for Utf8Chunk<'a>
impl<'a> Unpin for Utf8Chunk<'a>
impl<'a> UnwindSafe for Utf8Chunk<'a>
Blanket Implementations§
source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere T: ?Sized,
source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
Mutably borrows from an owned value. Read more