Rizin
unix-like reverse engineering framework and cli tools
vli.h File Reference

Variable-length integer handling. More...

Go to the source code of this file.

Macros

#define LZMA_VLI_MAX   (UINT64_MAX / 2)
 Maximum supported value of a variable-length integer. More...
 
#define LZMA_VLI_UNKNOWN   UINT64_MAX
 VLI value to denote that the value is unknown. More...
 
#define LZMA_VLI_BYTES_MAX   9
 Maximum supported encoded length of variable length integers. More...
 
#define LZMA_VLI_C(n)   UINT64_C(n)
 VLI constant suffix. More...
 
#define lzma_vli_is_valid(vli)    ((vli) <= LZMA_VLI_MAX || (vli) == LZMA_VLI_UNKNOWN)
 Validate a variable-length integer. More...
 

Typedefs

typedef uint64_t lzma_vli
 Variable-length integer type. More...
 

Functions

 LZMA_API (lzma_ret) lzma_vli_encode(lzma_vli vli
 Encode a variable-length integer. More...
 
 LZMA_API (uint32_t) lzma_vli_size(lzma_vli vli) lzma_nothrow lzma_attr_pure
 Get the number of bytes required to encode a VLI. More...
 

Variables

size_tvli_pos
 
size_t uint8_tout
 
size_t uint8_t size_tout_pos
 
size_t uint8_t size_t size_t out_size lzma_nothrow
 
size_t const uint8_tin
 
size_t const uint8_t size_tin_pos
 

Detailed Description

Variable-length integer handling.

In the .xz format, most integers are encoded in a variable-length representation, which is sometimes called little endian base-128 encoding. This saves space when smaller values are more likely than bigger values.

The encoding scheme encodes seven bits to every byte, using minimum number of bytes required to represent the given value. Encodings that use non-minimum number of bytes are invalid, thus every integer has exactly one encoded representation. The maximum number of bits in a VLI is 63, thus the vli argument must be less than or equal to UINT64_MAX / 2. You should use LZMA_VLI_MAX for clarity.

Definition in file vli.h.

Macro Definition Documentation

◆ LZMA_VLI_BYTES_MAX

#define LZMA_VLI_BYTES_MAX   9

Maximum supported encoded length of variable length integers.

Definition at line 44 of file vli.h.

◆ LZMA_VLI_C

#define LZMA_VLI_C (   n)    UINT64_C(n)

VLI constant suffix.

Definition at line 49 of file vli.h.

◆ lzma_vli_is_valid

#define lzma_vli_is_valid (   vli)     ((vli) <= LZMA_VLI_MAX || (vli) == LZMA_VLI_UNKNOWN)

Validate a variable-length integer.

This is useful to test that application has given acceptable values for example in the uncompressed_size and compressed_size variables.

Returns
True if the integer is representable as VLI or if it indicates unknown value.

Definition at line 75 of file vli.h.

◆ LZMA_VLI_MAX

#define LZMA_VLI_MAX   (UINT64_MAX / 2)

Maximum supported value of a variable-length integer.

Definition at line 34 of file vli.h.

◆ LZMA_VLI_UNKNOWN

#define LZMA_VLI_UNKNOWN   UINT64_MAX

VLI value to denote that the value is unknown.

Definition at line 39 of file vli.h.

Typedef Documentation

◆ lzma_vli

typedef uint64_t lzma_vli

Variable-length integer type.

Valid VLI values are in the range [0, LZMA_VLI_MAX]. Unknown value is indicated with LZMA_VLI_UNKNOWN, which is the maximum value of the underlying integer type.

lzma_vli will be uint64_t for the foreseeable future. If a bigger size is needed in the future, it is guaranteed that 2 * LZMA_VLI_MAX will not overflow lzma_vli. This simplifies integer overflow detection.

Definition at line 63 of file vli.h.

Function Documentation

◆ LZMA_API() [1/2]

LZMA_API ( lzma_ret  )

Encode a variable-length integer.

Decode a variable-length integer.

This function has two modes: single-call and multi-call. Single-call mode encodes the whole integer at once; it is an error if the output buffer is too small. Multi-call mode saves the position in *vli_pos, and thus it is possible to continue encoding if the buffer becomes full before the whole integer has been encoded.

Parameters
vliInteger to be encoded
vli_posHow many VLI-encoded bytes have already been written out. When starting to encode a new integer in multi-call mode, *vli_pos must be set to zero. To use single-call encoding, set vli_pos to NULL.
outBeginning of the output buffer
out_posThe next byte will be written to out[*out_pos].
out_sizeSize of the out buffer; the first byte into which no data is written to is out[out_size].
Returns
Slightly different return values are used in multi-call and single-call modes.

Single-call (vli_pos == NULL):

  • LZMA_OK: Integer successfully encoded.
  • LZMA_PROG_ERROR: Arguments are not sane. This can be due to too little output space; single-call mode doesn't use LZMA_BUF_ERROR, since the application should have checked the encoded size with lzma_vli_size().

Multi-call (vli_pos != NULL):

  • LZMA_OK: So far all OK, but the integer is not completely written out yet.
  • LZMA_STREAM_END: Integer successfully encoded.
  • LZMA_BUF_ERROR: No output space was provided.
  • LZMA_PROG_ERROR: Arguments are not sane.

Like lzma_vli_encode(), this function has single-call and multi-call modes.

Parameters
vliPointer to decoded integer. The decoder will initialize it to zero when *vli_pos == 0, so application isn't required to initialize *vli.
vli_posHow many bytes have already been decoded. When starting to decode a new integer in multi-call mode, *vli_pos must be initialized to zero. To use single-call decoding, set vli_pos to NULL.
inBeginning of the input buffer
in_posThe next byte will be read from in[*in_pos].
in_sizeSize of the input buffer; the first byte that won't be read is in[in_size].
Returns
Slightly different return values are used in multi-call and single-call modes.

Single-call (vli_pos == NULL):

  • LZMA_OK: Integer successfully decoded.
  • LZMA_DATA_ERROR: Integer is corrupt. This includes hitting the end of the input buffer before the whole integer was decoded; providing no input at all will use LZMA_DATA_ERROR.
  • LZMA_PROG_ERROR: Arguments are not sane.

Multi-call (vli_pos != NULL):

  • LZMA_OK: So far all OK, but the integer is not completely decoded yet.
  • LZMA_STREAM_END: Integer successfully decoded.
  • LZMA_DATA_ERROR: Integer is corrupt.
  • LZMA_BUF_ERROR: No input was provided.
  • LZMA_PROG_ERROR: Arguments are not sane.

◆ LZMA_API() [2/2]

LZMA_API ( uint32_t  ) const

Get the number of bytes required to encode a VLI.

Returns
Number of bytes on success (1-9). If vli isn't valid, zero is returned.

Get the number of bytes required to encode a VLI.

Run-time version number as an integer.

Get the number of processor cores or threads.

Calculate CRC32.

Although not all Check IDs have a check algorithm associated, the size of every Check is already frozen. This function returns the size (in bytes) of the Check field with the specified Check ID. The values are: { 0, 4, 4, 4, 8, 8, 8, 16, 16, 16, 32, 32, 32, 64, 64, 64 }

If the argument is not in the range [0, 15], UINT32_MAX is returned.

Calculate CRC32 using the polynomial from the IEEE 802.3 standard.

Parameters
bufPointer to the input buffer
sizeSize of the input buffer
crcPreviously returned CRC value. This is used to calculate the CRC of a big buffer in smaller chunks. Set to zero when starting a new calculation.
Returns
Updated CRC value, which can be passed to this function again to continue CRC calculation.

Variable Documentation

◆ in

Definition at line 155 of file vli.h.

◆ in_pos

Definition at line 155 of file vli.h.

◆ lzma_nothrow

Definition at line 116 of file vli.h.

◆ out

Definition at line 116 of file vli.h.

◆ out_pos

size_t uint8_t size_t* out_pos

Definition at line 116 of file vli.h.

◆ vli_pos

size_t * vli_pos

Definition at line 115 of file vli.h.

Referenced by LZMA_API().