Spaces:
Running
Running
Commit History
Rearranged functions for embeddings creation to be compatible with zero GPU space. Updated packages.
cc495e1
Added example of how to run function from command line. Updated packages. Embedding model default now smaller and at fp16.
34f1e83
App now retains original index following cleaning to allow for referring back to original data
90553eb
Can split passages into sentences. Improved embedding, LLM representation models, improved zero shot capabilities
55f0ce3
Updated packages. Improve hierarchy vis. Better models - mixedbread and phi3. Now option to split texts into sentences before modelling.
04a15c5
Hopefully now LLM download from hub should work
cdcd7af
Some text changes. Fixed a couple of TF-IDF embeddings issues
87306c7
Sean-Case
commited on
Fixed file load with files including capital letters
9c6425d
Lots of general fixes. New visualisations, fixed hierarchical vis for zero shot. Added calc all probabilities.
b4510a6
Switched embeddings model to BGE Small 1.5 as Jina seemed unable to do zero shot topic modelling properly
be094ee
Split off LLM representation, visualisation, and reduce outliers from main function. Added hierarchical visualisation and logs
5d87c3c
App should now check if embeddings are loaded before topic modelling. And will save only once.
9eeba1e
Returned TruncatedSVD components to 100 - higher values don't seem to help
43ac0d8
Sean-Case
commited on
Greatly increased low resource process dimensions for higher quality. Visualisations disabled by default to increase speed.
fac3624
Sean-Case
commited on
Greatly improved low resource mode speed (at cost of potential quality)
aa3df37
Sean-Case
commited on
Fixed llm_config, environmental variable, zero shot topic model errors with quick embeddings
ff32b4a
Sean-Case
commited on
Model export changed to safetensors. Improved representational model function. Got zero shot topic modelling working
4cfed8e
Sean-Case
commited on