Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Adding Conll format with POS tag and Lemmatization #9

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

alabrashJr
Copy link

BIO format has been transformed to BILOU format within conll format using (bio2biluo.py)[https://github.com/ufal/acl2019_nested_ner/blob/master/bio2bilou.py].

Lemmatization and POS tags have been done using(UDPipe)
(http://ufal.mff.cuni.cz/udpipe)

The columns are as follows.

  • FORM: Word form or punctuation symbol.
  • LEMMA: Lemma or stem of word form.
  • XPOS: Language-specific part-of-speech tag; underscore if not available.
  • Labels joined with bar

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant