Skip to content
This repository has been archived by the owner on Oct 12, 2023. It is now read-only.

Is bing spell check for integration with luis helpful with the dispatch pattern and is it useful or harmful to entity assurance? #23

Open
xtianus79 opened this issue Sep 20, 2019 · 1 comment

Comments

@xtianus79
Copy link

An article here gave some caution to using the Bing spellchecker api for washing all of your user utterances into your bot channel.

Link to article here

Point being the entity could become unravled when the spellchecker is turned on? So if you have an obscure word style entity starblazingIT this might be reconstructed to star blazing IT. Thus harming the entity.

With that said the article was written in 2016 so perhaps this concern has been addressed but I can't find any documentation on it.

The documentation I did find on it gave an outlet for ignoring spelling mistakes and simply label utterances and the LUIS would learn proper spelling as well as typos... That is a loaded statement. What does that mean exactly and is luis really learning proper spelling or just adding the bad spellings in there as an alternative answer?

Because if that is the case then one would have to watch misspelled utterances and add in variations all of the time. Or would Luis overtime training learn that a particular word can have variations and be misspelled and to add that in with the correct intent model of utterances?

Here is a link to the article and quote below:

Ignore spelling mistakes
If you don't want to use the Bing Spell Check API v7 service, you can label utterances that have spelling mistakes so that LUIS can learn proper spelling as well as typos. This option requires more labeling effort than using a spell checker.

Overall what is the best practice of word misspelling correction for your luis chatbot application?

@diberry
Copy link
Contributor

diberry commented May 14, 2020

@aliciaep Can you answer this one?

# for free to subscribe to this conversation on GitHub. Already have an account? #.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants