Multilingual Lens: Investigating Large Text Corpora from Different Methodological Perspectives

January 16, 2026

16.1. 2026, C247

UNCE – Multilingual lens: Investigating large text corpora from different methodological perspectives

10:00-10:25 Federica Gamba Bootstrapping UMRs from UD for scalable multilingual annotation
10:25-10:50 Abishek Stephen Automated quality control for language documentation: Detecting phonotactic inconsistencies in a Kokborok wordlist
10:50-11:15 Vojtěch John Cross-linguistic statistical patterns in morphologically annotated corpora

Break

11:30-11:55 Klára Pivoňková When data meet tools: Using the monitor corpus for the analysis of laguage development
11:55-12:20 Michal Olbrich Assembling a large diachronic corpus of Czech books (1850–1950)
12:20-12:45 Jan Henyš Exploring Register Diversity in Czech Internet Language
12:20-12:45 Konstantin Sulimenko Enhancing corpus-assisted discourse studies with sentiment analysis

Break

13:00-13:25 Khatia Buskivadze Elaborative discourse markers in Georgian conditional constructions
13:25-13:50 Hana Hledíková Lexicalized valency alternations occurring with prefixation in Czech
13:50-14:15 Martin Sedláček Arrival in Czech and English: A holistic spatial semantics analysis

Lunch & discussion

 

 

Upravit


Primární navigace

  • Project description
  • Events
    • May 5, 2025
    • January 16, 2026

Jazyky

  • Čeština
  • English

Patička webu

E-mail
Telefon
© FF UK 2026
  • Events
  • January 16, 2026
  • May 23, 2025
  • Project description
Design: Red Peppers