Course in English
This course proposes an introduction to automatic text processing, from how to numerically represent text to basic machine learning algorithms develloped for these representations. It should be followed in parallel to SD-TSIA 210, which introduces general machine learning methods. This course does not address deep learning for natural language processing, as SD-TSIA 203 is in the following period. Rather, it provides a detailled tour of pre-deep learning methods of natural language processing, and will help contextualize the development of deep learning - as this represents one of its main application domain. It is strongly advised to students wishing to choose courses about NLP/LLMs in their third year.