Skip to content

Releases: Bergvca/string_grouper

v0.7.1

26 Feb 12:57

Choose a tag to compare

Changed

  • Code wise, nothing changed. However, the version number is cleaned up in pyproject.toml
  • Cleaned up documentation and readme.md. Most documentation is moved from the readme.md to:
    https://bergvca.github.io/string_grouper.

v0.7.0

28 Jan 20:03

Choose a tag to compare

  • sparse_dot_topn_for_blocks and topn dependencies are removed and replaced by sparse_dot_topn official library from ING Bank, this is a big change: it may have impacts from old code using string_grouper
  • n_blocks None is now the default value for n_blocks and optimal numbers of blocks will be guessed based on empirical observation to split data into smaller chunks (based on input data size)
  • sparse_dot_topn now integrates a block/chunk strategy. This strategy is used in string_grouper.

Added

  • a new parameter normalize_to_ascii to normalize unicode character to ascii ones
  • loguru dependency is introduced to print messages to user

v0.6.0

15 Oct 18:55

Choose a tag to compare

Added matrix-blocking/splitting as a performance-enhancer

v0.5.0

02 Jul 13:32

Choose a tag to compare

Updated version number