topic_modelling / funcs /helper_functions.py

Commit History

Should now parse custom regex correctly. Will now wipe previously created embeddings if 'low resource mode' option switched.
0a543a0

Sean-Case commited on

Allowed for uploading custom regex for cleaning. Fixed calculate all probabilities, reduce outliers. Added text tree for hierarchical modelling.
381f959

Sonnyjim commited on

Upgraded to Gradio 4.16.0. Guide for converting to exe added.
0a177ca

Sonnyjim commited on

Fixed file load with files including capital letters
9c6425d

Sonnyjim commited on

Added clean data options, improved re-representation options and visualisation. General format changes
4effac0

Sonnyjim commited on

Model save now checks and makes a folder before writing the model
356791c

Sonnyjim commited on

Lots of general fixes. New visualisations, fixed hierarchical vis for zero shot. Added calc all probabilities.
b4510a6

Sonnyjim commited on

Split off LLM representation, visualisation, and reduce outliers from main function. Added hierarchical visualisation and logs
5d87c3c

Sonnyjim commited on

App should now check if embeddings are loaded before topic modelling. And will save only once.
9eeba1e

Sonnyjim commited on

Model export changed to safetensors. Improved representational model function. Got zero shot topic modelling working
4cfed8e

Sean-Case commited on

first commit
9dbf344

Sonnyjim commited on