Jörg Tiedemann

Person

Information om forskning och undervisning

Stäng

Meritförteckning

I work as professor of language technology at the Department of Digital Humanities at the University of Helsinki. My main research interest is in cross-lingual NLP and machine translation.

  • Since August 2015: Professor of Language Technology at the Department of Digital Humanities / HELDIG (formerly at the Department of Modern Languages), University of Helsinki
  • September 2014 – July 2015: Senior Researcher at the Department of Linguistics and Philology, Uppsala University
  • September 2009 – August 2014: Visiting Professor at the Department of Linguistics and Philology, Uppsala University
  • September 2004 – August 2009: PostDoc researcher at the Department of Information Science/Humanities Computing (Informatiekunde), University of Groningen
  • January 2004 – August 2004: Lecturer in computational linguistics and coordinator for the language technology programme, Department of Linguistics and Philology, Uppsala University
  • 2000 – 2003: Ph.D. research at the Department of Linguistics, Uppsala University
  • 2001 – 2002: Visiting Ph.D. student, Division of Informatics, Edinburgh University, UK
  • 1997 – 1999: Research assistent, Department of Linguistics, Uppsala University
  • 1991 – 1997: Masters in Computer Science (Diplom für Informatik), “Otto-von-Guericke” University, Magdeburg, Germany

Recent Projects

Resources and Tools

  • OPUS – a collection of freely available parallel corpora and tools
  • fiskmö translator – a translation demo for the Nordic languages
  • efmaral and eflomal – tools for efficient word alignment
  • WMT en-fi 20162017: official MT test sets for Finnish-English
  • HNMT – the Helsinki Neural Machine Translation system
  • Lingua::Align – a toolbox for tree-to-tree alignment
  • Uplug – a toolbox for processing parallel corpora
  • Lingua::Ident::Blacklists – language identifier for related languages
  • Docent – a document-level SMT decoder
  • pdf2xml – a converter for PDF documents
  • subalign – tools for converting and aligning movie subtitles
  • Helsinki-NLP at github and bitbucket

Active PhD Students

Former PhD Students

Ett urval av publikationer

  1. Emerging Language Spaces Learned From Massively Multilingual Corpora

    Tiedemann, J., 2018, Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018). Mäkelä, E., Tolonen, M. & Tuominen, J. (red.). Helsinki: CEUR Workshop Proceedings, Vol. 2084. s. 188-197 (CEUR Workshop Proceedings).

    Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

  2. Synthetic Treebanking for Cross-Lingual Dependency Parsing

    Tiedemann, J. & Agi, Z., jan 2016, I : Journal of Artificial Intelligence Research. 55, s. 209-248 40 s.

    Forskningsoutput: TidskriftsbidragArtikelVetenskapligPeer review

  3. Efficient word alignment with Markov Chain Monte Carlo

    Östling, R. & Tiedemann, J., 1 okt 2016, I : The Prague Bulletin of Mathematical Linguistics. 106, s. 125-146 22 s.

    Forskningsoutput: TidskriftsbidragArtikelVetenskapligPeer review

  4. Parallel Data, Tools and Interfaces in OPUS

    Tiedemann, J., 1 maj 2012, Unknown host publication. Calzolari, N., Choukri, K., Declerck, T., Dogan, M. U., Maegaard, B., Mariani, J., Odijk, J. & Piperidis, S. (red.). s. 2214-2218 5 s.

    Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

  5. Bitext Alignment

    Tiedemann, J. & Hirst, G. (red.), 2011, San Rafael, California (USA): Morgan & Claypool. (Synthesis Lectures on Human Language Technologies)

    Forskningsoutput: Bok/rapportBokVetenskapligPeer review

Visa alla (99) »

Markerade aktiviteter

  1. What is the meaning of life? I found it in translation ...

    Jörg Tiedemann (!!Speaker)
    12 dec 2018

    Aktivitet: Typer för tal eller presentation!!Oral presentation

  2. FoTran 2018: Found in translation

    Jörg Tiedemann (Ordförande för vetenskaplig kommitté), Hande Celikkanat (Medlem i organisationskommitté), Juan Raul Vazquez Carrillo (Medlem i organisationskommitté)
    28 sep 2018

    Aktivitet: Typer för deltagande i eller organisering av evenemangArrangemang av och deltagande i konferens/workshop/kurs/seminarium

Visa alla (92) »

Ett urval av projekt

  1. MeMAD: Methods for Managing Audiovisual Data: Combining Automatic Efficiency with Human Accuracy

    Hirvonen, M., Tiedemann, J., Tiittula, L., Sulubacak, U. & Vazquez , R.

    European Commission / Horizon 2020

    01/01/201831/12/2020

    Projekt: Forskningsprojekt_Forskning

  2. fiskmö: Creation of a parallel corpus of translated documents and machine translation for Finnish and Swedish

    Tiedemann, J., Ginter, F., Papula, N., Aulamo, M., Nieminen, T. & Kanerva, J.

    Svenska kulturfonden

    01/05/201831/05/2020

    Projekt: Forskningsprojekt

  3. NLPL: Nordic Language Processing Laboratory

    Tiedemann, J. & Scherrer, Y.

    01/01/201731/12/2019

    Projekt: Annan projekt

Visa alla (12) »

ID: 53542946