Statistical Methods for Text Data Analysis
This course teaches various statistical methods for modeling and analysing text data. Contents are planned to include models for representing text including vector space models and neural embedding models; document content processing stages such as lemmatization and keyphrase extraction; probabilistic models of content variation including n-grams and topic models; neural models of text; and methods for various text analysis tasks.
Bachelor’s level, Master’s level