Monday, October 06, 2014

[ooqhkwzv] UTF-8 large integers

UTF-8 could be extended to encode arbitrarily large positive integers, but it would be awkward.  The length of the encoding would be encoded in unary in the many (not just one) leading bytes.

Better would be to convert to base 64 and encode using single byte characters.

Previous thoughts on encoding numbers: decimal-line-lengths, standard-base-and-width

No comments :