Develop Benchmark User Journey #11

cbfrance · 2023-09-05T18:06:44Z

Request from Paul

I only have one research question, it’d be great if we can develop a benchmark user journey with a sequence of questions and thought process (perhaps fill out Decision Workflow.md) cc/

cbfrance · 2023-09-05T18:10:48Z

Two phases perhaps:

I think a good first test would be to ask questions of a single PDF, right at the start, almost as part of kickoff. (feature centric exploration of what is the new benchmark, qualitative/vibe check.)
Then (during "Immersion" phase) we will answer the question more fully — what are they actually doing at the microscope, who do they talk to, what are their outputs?

This second question, which I think is the actual "benchmark user journey" this ticket represents is likely a deliverable at end of Immersion, after three weeks of research.

cbfrance · 2023-09-05T18:21:56Z

Related to to user journey mapping, I think there is also a question of how to do a "SonoEval" — we need a list of questions that we can automatically evaluate, each question would have associated acceptance criteria.

So this could be in phases too:

Initial evaluation could be just a few questions about the single PDF scenario.
later evaluation set could be more comprehensive, across more types of supported interactions.

gutelius · 2023-09-06T16:59:53Z

Could also just ask to inspect the final deliverable from the consulting company

gutelius · 2023-09-06T17:04:21Z

I'm intersted in documenting the catalog of questions a team of researchers might potentially ask of any corpus. It's clearly a spectrum. It's also partially dependent on the state of the art today - single researchers, scanning, current IR - versus what might be possible with a new product - e.g. TEAMs of experts actively creating collaborative inquiry experiences together that might themselves create new incremental knoweldge artifacts, and that could be used as inputs into generative-powered exploration and creation.

I'm less interested in making a slightly more clever IR engine than in looking towards creating something that's fundamentally different and better than generative-powereed search. It might be a while until we can fully realize some of this, but I don't want to lose the longer term disruptive vision we might build towards.

cbfrance mentioned this issue Sep 8, 2023

Evaluation strategy #15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop Benchmark User Journey #11

Develop Benchmark User Journey #11

cbfrance commented Sep 5, 2023 •

edited

Loading

cbfrance commented Sep 5, 2023 •

edited

Loading

cbfrance commented Sep 5, 2023 •

edited

Loading

gutelius commented Sep 6, 2023

gutelius commented Sep 6, 2023

Develop Benchmark User Journey #11

Develop Benchmark User Journey #11

Comments

cbfrance commented Sep 5, 2023 • edited Loading

cbfrance commented Sep 5, 2023 • edited Loading

cbfrance commented Sep 5, 2023 • edited Loading

gutelius commented Sep 6, 2023

gutelius commented Sep 6, 2023

cbfrance commented Sep 5, 2023 •

edited

Loading

cbfrance commented Sep 5, 2023 •

edited

Loading

cbfrance commented Sep 5, 2023 •

edited

Loading