Linguistics for Language Technology#
You are looking at course notes for a course called ‘Linguistics for Language Technology’, the 2023 (very first!) edition. This is a first-year course in the BA program ‘Information Science’ at the University of Groningen. I am Lisa Bylinina, I designed this course and I am teaching it.
Why does this course exist? Here’s the motivating paragraph from the official syllabus, because I spent some time formulating it and it’s kind of nice:
The way information science is currently developing calls for future practitioners in the area of language technology to be truly equipped with linguistic knowledge specifically tailored to the challenges and research typical of work in Natural Language Processing (NLP). This requires relevant working knowledge of the basic concepts of linguistic theory: words, morphology, syntax, cross-linguistic variation, semantics (word meaning, sentence meaning), discourse and pragmatics.
So, the course tries to convey two things at the same time:
Some basics of linguistics: fundamental notions, tools, distinctions, facts;
The relevance of these basics for language technology: how the basics of linguistics can help in NLP practice, providing the language needed to look at NLP systems and data analytically.
Here is the current week-by-week schedule:
Week |
Topic |
---|---|
Week 1 |
Intro |
Weeks 2-3 |
Transmitting and capturing language |
Week 4 |
Grammar I |
Week 5 |
Grammar II |
Week 6 |
Meaning |
Week 7 |
Outro |
Let’s go!
Acknowledgements
I want to thank everybody who helped me shape this course: my colleages at GroNLP; special thanks to the Telegram group ‘ling for langtech playground’, with a spotlight mention of David Dalé, Gabriele Sarti, Alexey Tikhonov, Natalia Perkova and Dmitry Jemerov!
Use and citation
Feel free to use my notes for your courses or whatever you feel like using them for. Happy to help! If you cite these notes, I’ll be very glad, I’m vain this way.
@misc{Bylinina2023,
author = {Bylinina, Lisa},
title = {Lecture notes: Linguistics for Language Technology},
year = {2023},
howpublished = {\url{https://bylinina.github.io/ling_course/}},
place = {Groningen, Netherlands}
}