Releases: jftuga/deidentification
Releases · jftuga/deidentification
v1.3.2
v1.3.1
What's Changed
- Use single quotes inside a double-quoted f-string by @danielmlow in #14
- add support for 'her' object pronoun by @jftuga in #15
New Contributors
- @danielmlow made their first contribution in #14
Full Changelog: v1.3.0...v1.3.1
v1.3.0
add --exclude option
Add ability to exclude entities from de-identification with -x, --exclude.
- This uses a comma as the delimiter to allow for multiple entities.
- Comma can be overridden by setting the
DEIDENTIFY_EXCLUDE_DELIMenvironment variable.
The Python API can also use this option be setting a DeidentificationConfig.excluded_entities option to a Python set data type.
Improve Python API
- reset all internal variables at the beginning of the
deidentifymethod - lower-case all
config.excluded_entities - added API testing with
api_test.py
v1.2.1
prepare for PiPY deployment
- create and/or update files for PyPI
- Created
Makefileandget_project_name.pyto deploy totestandprodPyPI servers - updated install instructions in
README.md - set minimum Python version to
3.10
allow for multiple languages
- allow for multiple languages in the future by making
GENDER_PRONOUNSa dict which uses theDeidentificationLanguagesEnum-style class as keys - moved helper classes to
deidentification_constants.pyto avoid a circular dependency DeidentificationLanguagesnow maps the defaultDeidentificationConfig.replacementword to a language-specific noun, such asPERSON
v1.2.0
v1.1.2
Small Bug Fixes
get_identified_elements()will now always return pronouns-
- If multiple passes were needed in
deidentify(), thenget_identified_elements()would not have returned any pronouns.
- If multiple passes were needed in
- use
self.textinstead ofself.replaced_textinget_identified_elements() - Include small refinements to
README.md
v1.1.0
CLI Improvements
- added third-party
VeryPrettyTablemodule as a dependency - documented the CLI program,
deidentifyinREADME.md - added
-tto save detected entities to a JSON file to the CLI - added
-dfor debug mode to the CLI - use the third-party
chardetmodule to detect file character encodings for input files - updated
Deidentificationclass to accommodate these CLI options