Optimize short string parsing #76

chriso · 2021-03-22T04:41:03Z

We use the same SWAR technique used elsewhere in the repo to check 8 bytes for the end quote in a string, before falling back to bytes.IndexByte:

name           old time/op    new time/op    delta
CodeDecoder-4    5.30ms ± 3%    5.08ms ± 2%  -4.19%  (p=0.000 n=9+10)

name           old speed      new speed      delta
CodeDecoder-4   366MB/s ± 2%   382MB/s ± 2%  +4.36%  (p=0.000 n=9+10)

IndexByte is pretty quick, but it still has to load the input byte into an XMM register and shuffle it around before starting the routine, and then it also has various length and CPU checks to see whether to engage AVX2 or SSE. You also have the function call overhead to consider.

I think it's worth always checking the first chunk of bytes and paying the minor penalty each time you parse a string — it's common to see short strings in object keys, for example.

chriso added 2 commits March 22, 2021 14:32

Optimize parsing of short strings

8835ad0

Merge branch 'master' into short-string-parsing

f2cfd4a

chriso requested a review from achille-roussel March 22, 2021 04:49

achille-roussel approved these changes Mar 22, 2021

View reviewed changes

chriso merged commit 1e9a692 into master Mar 22, 2021

chriso deleted the short-string-parsing branch March 22, 2021 04:58

chriso mentioned this pull request Dec 8, 2021

Optimize short string parsing (part 2) #113

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize short string parsing #76

Optimize short string parsing #76

chriso commented Mar 22, 2021 •

edited

Loading

Optimize short string parsing #76

Optimize short string parsing #76

Conversation

chriso commented Mar 22, 2021 • edited Loading

chriso commented Mar 22, 2021 •

edited

Loading