pandas.util.hash_array

pandas.util.hash_array(vals, encoding='utf8', hash_key='0123456789123456', categorize=True)[source]

Given a 1d array, return an array of deterministic integers.

Parameters:
valsndarray or ExtensionArray
encodingstr, default ‘utf8’

Encoding for data & key when strings.

hash_keystr, default _default_hash_key

Hash_key for string key to encode.

categorizebool, default True

Whether to first categorize object arrays before hashing. This is more efficient when the array contains duplicate values.

Returns:
ndarray[np.uint64, ndim=1]

Hashed values, same length as the vals.