Tags: simdutf/simdutf
Tags
Version 7.7.0 (release candidate) (#863) * Return more information from utf8_length_from_utf16_with_replacement (#860) The next step after utf8_length_from_utf16_with_replacement is almost always going to be to allocate a UTF-8 buffer and then convert the string. Sadly, we have to insert a third pass, to_well_formed_utf16, which converts the unpaired surrogates. Since surrogates are relatively rare, and the _with_replacement functions have already scanned the input, we could skip the conversion if we were given this information along with the utf-8 length. In my measurements on Icelake this doesn't slow down utf8_length_from_utf16_with_replacement at all. * lint * better documentation. * version bump. * [no-ci] minor simplification * correct macro name. (!!!) * removing silly space --------- Co-authored-by: Erik Corry <erik@arbat.com> Co-authored-by: Daniel Lemire <dlemire@lemire.me>
PreviousNext