Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
5
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
0c2987b
document_redaction
/
tools
Ctrl+K
Ctrl+K
3 contributors
History:
78 commits
seanpedrickcase
Corrected image resizing method for instances where the image is very large.
0c2987b
4 months ago
__init__.py
Safe
0 Bytes
Initial commit
about 1 year ago
auth.py
Safe
2.65 kB
You can now have output redaction boxes in grey according to an environment variable. Review files are now saved every time page is changed.
4 months ago
aws_functions.py
Safe
7.37 kB
Fixed issue where redactions were sometimes not removing text underneath boxes. You can now redact in different colours from review page
5 months ago
aws_textract.py
Safe
11.8 kB
Fix bug to identify all handwriting labels. Now only concatenates entity_type boxes if they have different labels.
5 months ago
cli_redact.py
Safe
4.73 kB
Allowed for overwriting of default output folder in choose_and_run_redactor function.
6 months ago
custom_csvlogger.py
Safe
6.65 kB
Created custom csvlogger to try to overcome AWS Lambda's incompatibility with multithread locks
6 months ago
custom_image_analyser_engine.py
Safe
39.6 kB
App should now resize images that are too large before sending to Textract. Textract now more robust to failure. Improved reliability of json conversion to review dataframe
4 months ago
data_anonymise.py
Safe
20.9 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
7 months ago
file_conversion.py
Safe
36.1 kB
Corrected image resizing method for instances where the image is very large.
4 months ago
file_redaction.py
Safe
102 kB
Corrected image resizing method for instances where the image is very large.
4 months ago
helper_functions.py
Safe
11.8 kB
App should now resize images that are too large before sending to Textract. Textract now more robust to failure. Improved reliability of json conversion to review dataframe
4 months ago
load_spacy_model_custom_recognisers.py
Safe
6.69 kB
Greatly improved regex for direct matching with custom entities
4 months ago
presidio_analyzer_custom.py
Safe
4.94 kB
Added support for AWS Comprehend for PII identification. OCR and detection results now written to main output
7 months ago
redaction_review.py
14.3 kB
Corrected image resizing method for instances where the image is very large.
4 months ago