Language and Computation Courses
Multiword Expressions in a Nutshell,
Carlos Ramisch (Aix Marseille University, France), Agata Savary (Université François Rabelais Tours – IUT de Blois, France) and Aline Villavicencio (University of Essex, UK and Universidade Federal do Rio Grande do Sul, Brazil)
Much has been said and written about multiword expressions (MWEs). Even though a “pain in the neck”, they have become a hot topic in computational linguistics, as focus has moved from automatic discovery to in-context identification, parsing, semantic interpretation and machine translation. Nonetheless, the current treatment of MWEs in language technology is far from satisfactory, given their complex and heterogeneous behaviour. The goal of this hands-on course is to provide a broad introduction to MWEs, with strong multilingual emphasis. It covers theoretical foundations, discussing properties and guidelines for their annotation, possible scenarios for their computational treatment, and techniques for idiomaticity prediction. Laboratory sessions provide students with an opportunity to use tools like FLAT for corpus annotation and the mwetoolkit for idiomaticity prediction. Laboratory sessions provide students with an opportunity to use tools like FLAT for corpus annotation and the mwetoolkit for idiomaticity prediction. This course is addressed to students and researchers in computational linguistics who wish to analyse and integrate MWEs into their computational tools and linguistic studies.