Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

test Google search for document retrieval #248

Open
alexissmirnov opened this issue Mar 22, 2020 · 1 comment
Open

test Google search for document retrieval #248

alexissmirnov opened this issue Mar 22, 2020 · 1 comment

Comments

@alexissmirnov
Copy link
Contributor

No description provided.

@breuleux
Copy link

I've compiled some results using the custom search Sacha made, applied on the thousand or so questions we have:

I've associated each question to the number of results, the snippet given by Google, an estimate of the relevant sections (I searched for each snippet in the scraped data), and the link. These columns are associated to the first result given by Google (I see little point looking at the others).

The results are quite mediocre. It answers the French question "Quels sont les symptômes" properly, but the snippet given also contains "Quels sont les services d'enseignement visés par les fermetures?" and there's no way to automatically prioritize the correct section.

It does not answer "What are the symptoms" in English properly, however. It provides this link which lists symptoms related to, stress, anxiety and depression. That's not what the user expects, obviously. Elasticsearch and other solutions will probably have the same problem if this section is included in the search.

The code is messy and contains an API key, but I can try to clean it up and push it if necessary.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Development

No branches or pull requests

2 participants