Show morphological breakdown structure (inflectional morpheme boundaries first) #397

aarppe · 2020-04-23T22:37:04Z

EDIT (29.2.2022): removed exception for morpheme boundaries in conjunction with hyphens -.

Subtasks (added July 20):

1. implement marking of inflectional morpheme boundaries in FST output (now indicated with < and > marks for inflectional boundaries, and / for derivational boundaries): nôhkom+N+A+D+Px1Pl+Pl --> ni4<ohkom>i2nân>ak --> n<ôhkom>inân>ak
2. implement back-end interpretation and representation of morpheme boundaries, looking up FST morpheme boundaries (which could be done upon importation of dictionary content and the dynamic generation of the paradigm content) and communicating that appropriately to the front-end.
3. implement front-end representation of morpheme boundaries (e.g. with middle-dot). For now, this could even be implemented as not showing anything, until we decide how to best represent morpheme boundaries.

Since our FST already outputs morpheme boundaries (primarily inflectional ones), there would be many circumstance when it would be advantageous to show those, in the standardized version of the search string, as well as in the generated inflectional paradigms:

One way to achieve this would be to represent the inflectional morpheme boundaries that the FST outputs as < and > with a middle-dot ·, something like the following:

Ideally, that middle dot (or any other character) would not be copyable, so when one paints and copies any wordform, one only gets the actual characters.

Alternatives could be using different colors or shading to differentiate the morphemes, or some visual animation effects such as slight magnification when hovering over individual morphemes. On the other hand, having the morpheme boundaries immediately but non-intrusively available might be the simpler solution - or one might have the morpheme-boundary-output option as a output setting that can be triggered similar to the selection of orthography. Also, we might want to keep magnification or pop-ups till later for giving the plain-language definition of each morpheme. Finally, one might provide such a breakdown explicitly when going after the full paradigms.

First, we could implement this for inflectional morpheme boundaries, and later on for derivational boundaries as well.

The text was updated successfully, but these errors were encountered:

aarppe · 2020-04-24T08:04:02Z

And here's a draft visualization for providing further information about inflectional morphemes as pop-ups:

aarppe · 2020-04-30T05:23:45Z

@kobexamoh Note - I updated the mock-up vizualizations above.
The first (i) refers to the linguistic breakdown of the inflected word form.
The second (i) refers to inflectional subcategory information for the dictionary entry.

These two are different types of information - keeping them separate clarifies things, as well as moving the (Verb/Noun - ...) information next to the dictionary entry rather than the word form.

aarppe · 2020-07-31T19:44:09Z

As discussed in our meeting this last week of July, we might want to have multiple forms of information available for each paradigm layout cell. For instance, the following:

V+TA+Ind+1Sg+2SgO:	kiwâpamitin : (1) surface word-form without morpheme boundaries
			ki<wâpam>iti>n : (2) surface word-form with morpheme boundaries
			kit2<wâpam>i2ti >n : (3) underlying word-form with original morphemes and boundaries
			I see me, I witness me : (4) generated English translation of cell word-form
			4: (5) corpus-frequency
			(6) human recording
			(7) generated robot recording

nienna73 · 2022-03-29T17:46:45Z

I found a demo of how this currently works, here's how it looks:

Is this more or less the expected behaviour on the main page?

Here's another with multiple morphemes:

If this is visually what we're going for, then I can work on adding the option to see morpheme boundaries to the settings page, as well as showing morpheme boundaries within paradigm layouts.

aarppe · 2022-03-29T17:55:14Z

@nienna73 Yes, it looks like what we were expecting visually. I think we thought that the middle-dot would be a good way to indicate the morpheme boundaries. I think we might want to show the middle-dot also in conjunction with hyphens (to the right of the hyphen, where the FST outputs the prefix boundary marker <), to indicate that there's a morpheme boundary there as well, i.e. ni·kî-·wâpam·âw·ak (I realize I'm changing my mind from what I had written earlier above). Further inspirations can be found in #505.

nienna73 · 2022-03-29T20:14:52Z

I added morpheme boundaries to the settings:

This is what they look like in the paradigm:

The one place I can't seem to get them to show up is here:

aarppe · 2022-03-29T21:07:23Z

Great progress! Looks good! The reason for the latter case, i.e. wâpamêw, is that the word-form comes out of the lexical database, which is statically defined and doesn't contain morpheme boundaries. We'd have to add those as a separate computational step -- generally not too difficult, but I would not be surprised by edge cases that caused some extra head aches.

aarppe · 2022-05-30T22:50:18Z

Next step of showing information on individual morphemes moved over to #1093, and for individual word-forms in paradigm cells to #1094.

aarppe added feature Improvement Expansion or improvement of a current functionality that does already work and meets previous specs labels Apr 23, 2020

aarppe changed the title ~~Show morphological breakdown structure (inflectional first)~~ Show morphological breakdown structure (inflectional morpheme boundaries first) Apr 25, 2020

This was referenced Apr 25, 2020

Settings: implement toggling morpheme boundaries #398

Closed

Overview with meta-issues for second milestone version #346

Closed

aarppe added requires-backend-work Requires work to Python, scripts, automation, etc. requires-frontend-work Work needs to be done on HTML, CSS, and/or JavaScript labels Jul 20, 2020

This was referenced May 30, 2022

Show information for individual morphemes in breakdown #1093

Open

Show information about a word-form in pop-ups per each cell in paradigm #1094

Open

aarppe closed this as completed May 30, 2022

fbanados added this to Second release Aug 2, 2024

fbanados moved this to Done in Second release Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show morphological breakdown structure (inflectional morpheme boundaries first) #397

Show morphological breakdown structure (inflectional morpheme boundaries first) #397

aarppe commented Apr 23, 2020 •

edited

Loading

aarppe commented Apr 24, 2020

aarppe commented Apr 30, 2020

aarppe commented Jul 31, 2020 •

edited

Loading

nienna73 commented Mar 29, 2022

aarppe commented Mar 29, 2022 •

edited

Loading

nienna73 commented Mar 29, 2022

aarppe commented Mar 29, 2022 •

edited

Loading

aarppe commented May 30, 2022

Show morphological breakdown structure (inflectional morpheme boundaries first) #397

Show morphological breakdown structure (inflectional morpheme boundaries first) #397

Comments

aarppe commented Apr 23, 2020 • edited Loading

aarppe commented Apr 24, 2020

aarppe commented Apr 30, 2020

aarppe commented Jul 31, 2020 • edited Loading

nienna73 commented Mar 29, 2022

aarppe commented Mar 29, 2022 • edited Loading

nienna73 commented Mar 29, 2022

aarppe commented Mar 29, 2022 • edited Loading

aarppe commented May 30, 2022

aarppe commented Apr 23, 2020 •

edited

Loading

aarppe commented Jul 31, 2020 •

edited

Loading

aarppe commented Mar 29, 2022 •

edited

Loading

aarppe commented Mar 29, 2022 •

edited

Loading