github_search_visualizations / text_visualization.py
lambdaofgod's picture
feat: Add WordCloudExtractor class
21d27ae
raw
history blame
254 Bytes
from sklearn import TfIdfVectorizer
import wordcloud
from pydantic import BaseModel
class WordCloudExtractor:
tfidf_params: Dict[str, Any]
def extract_from_corpus(self, texts: Iterable[str], n_words: int) -> wordcloud.WordCloud:
pass