Releases · jftuga/deidentification

10 Jan 21:06

jftuga

v1.3.0

72a5a7d

v1.3.0 Latest

Latest

add --exclude option

Add ability to exclude entities from de-identification with -x, --exclude.

This uses a comma as the delimiter to allow for multiple entities.
Comma can be overridden by setting the DEIDENTIFY_EXCLUDE_DELIM environment variable.

The Python API can also use this option be setting a DeidentificationConfig.excluded_entities option to a Python set data type.

Improve Python API

reset all internal variables at the beginning of the deidentify method
lower-case all config.excluded_entities
added API testing with api_test.py

Assets 2

04 Jan 22:55

jftuga

v1.2.1

1a3fd96

v1.2.1

prepare for PiPY deployment

create and/or update files for PyPI
Created Makefile and get_project_name.py to deploy to test and prod PyPI servers
updated install instructions in README.md
set minimum Python version to 3.10

allow for multiple languages

allow for multiple languages in the future by making GENDER_PRONOUNS a dict which uses the DeidentificationLanguages Enum-style class as keys
moved helper classes to deidentification_constants.py to avoid a circular dependency
DeidentificationLanguages now maps the default DeidentificationConfig.replacement word to a language-specific noun, such as PERSON

Assets 2

03 Jan 22:13

jftuga

v1.2.0

26c0fa7

v1.2.0

Model Download

When a spaCy model has not been downloaded, advise the user on how to manually download it.

Assets 2

03 Jan 03:19

jftuga

v1.1.2

afb0cd1

v1.1.2

Small Bug Fixes

get_identified_elements() will now always return pronouns
- If multiple passes were needed in deidentify(), then get_identified_elements() would not have returned any pronouns.
use self.text instead of self.replaced_text in get_identified_elements()
Include small refinements to README.md

Assets 2

02 Jan 13:41

jftuga

v1.1.0

b2d8f1e

v1.1.0

CLI Improvements

added third-party VeryPrettyTable module as a dependency
documented the CLI program, deidentify in README.md
added -t to save detected entities to a JSON file to the CLI
added -d for debug mode to the CLI
use the third-party chardet module to detect file character encodings for input files
updated Deidentification class to accommodate these CLI options

Assets 2

02 Jan 01:40

jftuga

v1.0.0

1c10533

v1.0.0

1.0.0

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add --exclude option

Improve Python API

prepare for PiPY deployment

allow for multiple languages

Model Download

Small Bug Fixes

CLI Improvements

Releases: jftuga/deidentification

v1.3.0

add --exclude option

Improve Python API

v1.2.1

prepare for PiPY deployment

allow for multiple languages

v1.2.0

Model Download

v1.1.2

Small Bug Fixes

v1.1.0

CLI Improvements

v1.0.0