data_text_search / requirements.txt

Commit History

Cognito authorisation option added to app, some other minor changes.
759001a

seanpedrickcase commited on

Changed embedding model to MiniLM-L6 as faster. Compressed embeddings are now int8. General improvements to API mode
ea0dd40

seanpedrickcase commited on

General code improvements and refinements.
a95ef9f

seanpedrickcase commited on

Some package updates and minor changes
2754a2b

seanpedrickcase commited on

Fixed cleaning for semantic search. Handles text with backslashes in (if cleaned). Updated packages. requirements file for only keyword search added.
8466e45

seanpedrickcase commited on

Gradio 4.21. Limitations on file size and creating embeddings. Added AWS integration
e0fe055

seanpedrickcase commited on

Updated to latest Gradio version that hopefully solved root_url issues. Added ssl_verify=False to launch
813f823

Sean-Case commited on

set gradio version to 4.18.0, before root path issue reappears
8ba34be

Sean-Case commited on

Added highlight search term functionality to keyword search output
36a404e

seanpedrickcase commited on

Updated to Gradio 4.16.0. Now works correctly with BGE embeddings
2bcd818

seanpedrickcase commited on

Upgraded to Gradio 4.16.0. Added Spacy fuzzy search functionality.
4ce2224

Sean-Case commited on

Changed requirements to download correct spacy model version
4e77f14

Sean-Case commited on

Cut out semantic search temporarily while issues with Jina gated model resolved. Improved error/progress tracking and messaging. Placeholder for Spacy fuzzy search.
739b386

seanpedrickcase commited on

Switch back to Gradio version 3.50.2 as 4 is not working with pyinstaller app creation yet
c6dc87d

Sean-Case commited on

Better error checking. Doesn't load in embeddings file twice now.
63049fe

Sean-Case commited on

Added accelerate to requirements. Minor doc changes
f2db299

seanpedrickcase commited on

Many changes to code organisation. More efficient searches from using intermediate outputs. Version 0.1
99d6fba

seanpedrickcase commited on

Faster embedding with GPU, fast document split, writes to chromadb file correctly. No longer needs FAISS or langchain
2cb9977

seanpedrickcase commited on

Added to requirements: transformers
acfac99

Sean-Case commited on

Initial commit
a9c2120

Sean-Case commited on