Never forget to glick the save-icon before closing the annotation-we-page! Otherwise You have to do all the annotations again.
If you want to do a brake, hit the save icon and close the web-browser. If you open your annotation-web-page again, it will continue where you left.
Our KBC-concept recognition pipeline works like this:
So we have 2 annotation phases:
First, we have a NER (Named Entity Recognition) Phase, where we simply search entities (medical terms) in the texts. It is a binary NER model which simply marks sections in the texts, which might be a medical concept.
Each sentence of a report is presented to the annotator one after the other. Each presented sentence is already pre-annotated by a simple NER model.
The annotator has to check the pre-annotated sentence and adjust the yellow highlighted sections, if necessary.
Make sure that all medical terms in the sentence are highlighted correctly at the end. You can also remove or create new highlighted sections using your mouse.
- Do not highlight numbers, but please highlight codes like ICD's or tumor stagings.
- Please highlight each medical term, also the ones which are not present in the KBC or SNOMED or any other terminology.
- prefer longer, combined medical terms over shorter ones.
- e.g.prefer [scarred globally] = one entity over [scarred] [globally] = 2 entities.
- reason: The chance that the KBC-concept-linker finds a fitting KBC concept is higher for longer entities.
These are the buttons of the web interface:
Left button: Accept, right button Undo.
As soon as you have marked all medical terms in the text, click on the green "accept" button. After that, the next sentence is presented to you.
If you want to re-edit previously annotated sentence, click on the undo button to drop the current annotated sentence and go back to the previous one. You can only go back as long as you did not press the save button!
Second, we have a NEL (Named Entity Linking) or EL (Entity Linking) Phase. In this phase, we select one concept from the given terminology (in this case: KBC) for each highlighted medical term/entity.
For each marked text excerpt, the KBC terminology is searched for KBC concepts that could match to it. The KBC concepts that might fit best (according to GPT) are offered in the web interface.
if there is a well-fitting KBC concept, but it is not listed in the web interface, the concept must be entered manually, in the text field below.
If the terminology does not contain any concept that fits to the given text excerpt, click on accept without selecting one of the suggested concepts. (or click on reject, it doesn't really matter?).