Jörg Tiedemann


Description of research and teaching


Curriculum Vitae

I work as professor of language technology at the Department of Digital Humanities at the University of Helsinki. My main research interest is in cross-lingual NLP and machine translation.

  • Since August 2015: Professor of Language Technology at the Department of Digital Humanities / HELDIG (formerly at the Department of Modern Languages), University of Helsinki
  • September 2014 – July 2015: Senior Researcher at the Department of Linguistics and Philology, Uppsala University
  • September 2009 – August 2014: Visiting Professor at the Department of Linguistics and Philology, Uppsala University
  • September 2004 – August 2009: PostDoc researcher at the Department of Information Science/Humanities Computing (Informatiekunde), University of Groningen
  • January 2004 – August 2004: Lecturer in computational linguistics and coordinator for the language technology programme, Department of Linguistics and Philology, Uppsala University
  • 2000 – 2003: Ph.D. research at the Department of Linguistics, Uppsala University
  • 2001 – 2002: Visiting Ph.D. student, Division of Informatics, Edinburgh University, UK
  • 1997 – 1999: Research assistent, Department of Linguistics, Uppsala University
  • 1991 – 1997: Masters in Computer Science (Diplom für Informatik), “Otto-von-Guericke” University, Magdeburg, Germany

Recent Projects

Resources and Tools

  • OPUS – a collection of freely available parallel corpora and tools
  • fiskmö translator – a translation demo for the Nordic languages
  • efmaral and eflomal – tools for efficient word alignment
  • WMT en-fi 20162017: official MT test sets for Finnish-English
  • HNMT – the Helsinki Neural Machine Translation system
  • Lingua::Align – a toolbox for tree-to-tree alignment
  • Uplug – a toolbox for processing parallel corpora
  • Lingua::Ident::Blacklists – language identifier for related languages
  • Docent – a document-level SMT decoder
  • pdf2xml – a converter for PDF documents
  • subalign – tools for converting and aligning movie subtitles
  • Helsinki-NLP at github and bitbucket

Active PhD Students

Former PhD Students

Highlighted publications

  1. Emerging Language Spaces Learned From Massively Multilingual Corpora

    Tiedemann, J., 2018, Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018). Mäkelä, E., Tolonen, M. & Tuominen, J. (eds.). Helsinki: CEUR Workshop Proceedings, Vol. 2084. p. 188-197 (CEUR Workshop Proceedings).

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

  2. Synthetic Treebanking for Cross-Lingual Dependency Parsing

    Tiedemann, J. & Agi, Z., Jan 2016, In : Journal of Artificial Intelligence Research. 55, p. 209-248 40 p.

    Research output: Contribution to journalArticleScientificpeer-review

  3. Efficient word alignment with Markov Chain Monte Carlo

    Östling, R. & Tiedemann, J., 1 Oct 2016, In : The Prague Bulletin of Mathematical Linguistics. 106, p. 125-146 22 p.

    Research output: Contribution to journalArticleScientificpeer-review

  4. Parallel Data, Tools and Interfaces in OPUS

    Tiedemann, J., 1 May 2012, Unknown host publication. Calzolari, N., Choukri, K., Declerck, T., Dogan, M. U., Maegaard, B., Mariani, J., Odijk, J. & Piperidis, S. (eds.). p. 2214-2218 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

  5. Bitext Alignment

    Tiedemann, J. & Hirst, G. (ed.), 2011, San Rafael, California (USA): Morgan & Claypool. (Synthesis Lectures on Human Language Technologies)

    Research output: Book/ReportBookScientificpeer-review

View all (99) »

Highlighted activities

  1. What is the meaning of life? I found it in translation ...

    Jörg Tiedemann (Speaker)
    12 Dec 2018

    Activity: Talk or presentation typesOral presentation

  2. FoTran 2018: Found in translation

    Jörg Tiedemann (Scientific Committee Chair), Hande Celikkanat (Member of organizing committee), Juan Raul Vazquez Carrillo (Member of organizing committee)
    28 Sep 2018

    Activity: Participating in or organising an event typesOrganisation and participation in conferences, workshops, courses, seminars

View all (92) »

Highlighted projects

  1. MeMAD: Methods for Managing Audiovisual Data: Combining Automatic Efficiency with Human Accuracy

    Hirvonen, M., Tiedemann, J., Tiittula, L., Sulubacak, U. & Vazquez , R.

    European Commission / Horizon 2020


    Project: Research project_Research

  2. fiskmö: Creation of a parallel corpus of translated documents and machine translation for Finnish and Swedish

    Tiedemann, J., Ginter, F., Papula, N., Aulamo, M., Nieminen, T. & Kanerva, J.

    Svenska kulturfonden


    Project: Research project

  3. NLPL: Nordic Language Processing Laboratory

    Tiedemann, J. & Scherrer, Y.


    Project: Other project

View all (12) »

ID: 53542946