Linguistics for Language Technology#
You are looking at course notes for a course called ‘Linguistics for Language Technology’, the 2023 (very first!) edition. This is a first-year course in the BA program ‘Information Science’ at the University of Groningen. I am Lisa Bylinina, I designed this course and I am teaching it.
Why does this course exist? Here’s the motivating paragraph from the official syllabus, because I spent some time formulating it and it’s kind of nice:
The way information science is currently developing calls for future practitioners in the area of language technology to be truly equipped with linguistic knowledge specifically tailored to the challenges and research typical of work in Natural Language Processing (NLP). This requires relevant working knowledge of the basic concepts of linguistic theory: words, morphology, syntax, cross-linguistic variation, semantics (word meaning, sentence meaning), discourse and pragmatics.
So, the course tries to convey two things at the same time:
Some basics of linguistics: fundamental notions, tools, distinctions, facts;
The relevance of these basics for language technology: how the basics of linguistics can help in NLP practice, providing the language needed to look at NLP systems and data analytically.
Here is the current week-by-week schedule:
Week |
Topic |
---|---|
Week 1 |
Intro |
Weeks 2-3 |
Transmitting and capturing language |
Week 4 |
Grammar I |
Week 5 |
Grammar II |
Week 6 |
Meaning |
Week 7 |
Beyond human and natural languages |
Let’s go!
Acknowledgements
I want to thank everybody who helped me shape this course [names will be added later!]. Special thanks go to the Telegram group ‘ling for langtech playground’!