COURSES

Advanced Language Technologies

5

ECTS Credits

Lecturers
  • doc. dr. Senja Pollak
Programmes
  • None

Goals

Language technologies comprise methods and applications of computer processing of natural language. Students will gain basic theoretical understanding and practical experience of language technologies and computational linguistics, which is a prerequisite for effective work on computer processing of language data. The course objectives are to (a) introduce the basics of language technologies, (b) present the coding and annotation of language resources, and (c) present selected methodologies and techniques used in language technologies. The focus of the course is on the processing of Slovene language and cross-lingual methods. The students will master the basics of language technologies and will be capable of using selected methods and tools in practice.

Curriculum

Introduction: Development of linguistics and computational linguistics, complexity of language, levels of linguistic analysis, overview of applications and methods. Text analysis with machine learning methods: Relevant methods of machine learning, use cases: automatic morphological, syntactic and semantic annotation. Encoding standards: History of standardisation, coding of characters, XML, Text Encoding Initiative, ISO, evaluation methods. Research infrastructures for linguistics: Open science, Digital humanities, ethical and legal considerations of dealing with language data, CLARIN research infrastructure.

Obligations

Completed second-cycle studies in information or communication technologies or completed second-cycle studies in other fields with knowledge of fundamentals in the field of this course. Basic knowledge of mathematics, computer science and informatics is also requested.

Examination

Literature and references

More
Hide