Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Sketch how to present observed vs. unobserved forms vs. lacunae + morpheme boundaries + recordings #505

Open
aarppe opened this issue Jul 11, 2020 · 7 comments
Labels
Improvement Expansion or improvement of a current functionality that does already work and meets previous specs

Comments

@aarppe
Copy link
Contributor

aarppe commented Jul 11, 2020

@kobexamoh Here's a sketch of the various layouts combining 1) observed vs. 2) unobserved forms vs. 3) lacunae + 4) morpheme boundaries + 5) recordings.

image

crk-itwêwina-paradigm-layout-mockups.docx

@aarppe aarppe added the Improvement Expansion or improvement of a current functionality that does already work and meets previous specs label Jul 11, 2020
@aarppe
Copy link
Contributor Author

aarppe commented Jul 28, 2020

For recordings, we should also be able to distinguish, in the inflectional paradigms, between a) a recording spoken by a person (like: 🧑🏽🔈), and b) a recording generated by a speech synthesizer (like: 🤖🔈)

@aarppe
Copy link
Contributor Author

aarppe commented Jul 30, 2020

Here's the visualization we discussed today (UPDATE: removed squiggly lines due to English spell-checking):

image

[below with English translation pop-up:]
image

We have robo-speech available in this case for all inflected word forms, indicated by 🤖🔈, except the few word-forms for which recording(s) spoken by a person exist, indicated by 🧑🏽🔈. Note also that sometimes a word-form might exists in a corpus (i.e. it has been observed), but we do not have spoken recording of that word, and thus only a robo-snippet can be made available.

Note also that I'm using 1) em-dash to indicate a lacuna (a word-form that doesn't exist for this particular lemma but is possible for other members of this particular word-class), in contrast to 2) grey color to indicate a cell for which no word-forms can exist in this paradigm - the cell can be seen as an artifact of the organization of the table for this particular word-class (deliberately and explicitly showing the impossibility of a form/feature combination), potentially reflecting features available for other similar word-classes but not this particular one.

Since for reasons of storage space we might not be able to provide a generated recording for each and every form of every paradigm, there could be a substantial number of lemmas with word forms that would have neither indicator of a recording. Also, we might choose to implement an option to reduce the clutter where one can opt to see if word-forms have recordings (the above symbols) or not (without any symbols, but also without access to such recordings).

Besides all the above, we might even have pop-ups for the various cells, providing a generated English translation (an attempt above for kôhkomak --> your grand-mothers; your respected female elders). Also, we can show the frequency of a word-form in corpora (already imported and included in the underlying data structure, but not shown). And we might also have pop-ups indicating what the various morphemes mean in the word-forms in the paradigms - though where we would be able to squeeze that remains to be seen/explored.

@aarppe
Copy link
Contributor Author

aarppe commented Jul 31, 2020

@kobexamoh In the above comment, I hope to have sketched out all the possible combinations of features we might want to be able to show. As discussed earlier, we might have an option to see, or not to see, word-forms for which we have a human or robot recording - the former resulting in more clutter than the latter, so it's up to user preferences.

@aarppe
Copy link
Contributor Author

aarppe commented Aug 19, 2020

Some further mock-ups. These represent two strategies: 1) using toggles to supplement across-the-board information to a basic paradigm; and 2) showing the full extent of available information as pop-ups for individual cells. Both strategies could be deployed at the same time

a. Initial set-up: only show paradigm-wide if a word-form has been observed or not

image

b. Show paradigm-wide also morpheme boundaries

image

c. Show paradigm-wide also recordings

image

d. Show paradigm-wide also morpheme boundaries and recordings

image

e. Cell-wise pop-up

image

@aarppe
Copy link
Contributor Author

aarppe commented Mar 26, 2022

@nienna73 When getting to the implementation of showing morpheme boundaries (and other things), the sketches above may be worthwhile to review.

@nienna73
Copy link
Contributor

nienna73 commented Apr 8, 2022

I added the robot and person emojis, how is this looking so far?

I did choose the "person with curly hair" emoji, but it always comes off a little masculine to me.
Screen Shot 2022-04-08 at 4 00 17 PM

@aarppe
Copy link
Contributor Author

aarppe commented Apr 8, 2022

Looks nice. Eddie was using the gender-neutral generic person emoji 🧑 (with a darker skin/hair tone). But with actual people speaking, we probably could use the genders-specific emojis, since we should know that information. On a lighter side, we could have speakers choose the emoji they'd prefer - perhaps later on.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
Improvement Expansion or improvement of a current functionality that does already work and meets previous specs
Projects
Status: To do
Development

No branches or pull requests

2 participants