This repository has moved to https://github.com/bplank/danish_ner_transfer (March 5, 2020)
Repository accompanying NER (Named Entity Recognition) transfer paper (NoDaLiDa 2019)
@inproceedings{plank-2019-neural,
title = "Neural Cross-Lingual Transfer and Limited Annotated Data for Named Entity Recognition in {D}anish",
author = "Plank, Barbara",
booktitle = "Proceedings of the 22nd Nordic Conference on Computational Linguistics",
month = "30 " # sep # " {--} 2 " # oct,
year = "2019",
address = "Turku, Finland",
publisher = {Link{\"o}ping University Electronic Press},
url = "https://www.aclweb.org/anthology/W19-6143",
pages = "370--375",
abstract = "Named Entity Recognition (NER) has greatly advanced by the introduction of deep neural architectures. However, the success of these methods depends on large amounts of training data. The scarcity of publicly-available human-labeled datasets has resulted in limited evaluation of existing NER systems, as is the case for Danish. This paper studies the effectiveness of cross-lingual transfer for Danish, evaluates its complementarity to limited gold data, and sheds light on performance of Danish NER.",
}
and the references for the UD (Universal Dependencies) conversion of the Danish Dependency Treebank:
-
Johannsen, Anders, Martínez Alonso, Héctor and Plank, Barbara. "Universal Dependencies for Danish". TLT14, 2015.
-
Buch-Kromann, Matthias T., Line Mikkelsen, and Stine Kern Lynge. "Danish dependency treebank.". TLT. 2003.
-
Keson, Britt (1998). Documentation of The Danish Morpho-syntactically Tagged PAROLE Corpus. Technical report, DSL