Skip to content

add observe page evals #332

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Merged
merged 2 commits into from
Dec 23, 2024
Merged

add observe page evals #332

merged 2 commits into from
Dec 23, 2024

Conversation

kamath
Copy link
Member

@kamath kamath commented Dec 23, 2024

evals that build off of #329 on stagehand.page instead of stagehand for observe

Copy link

changeset-bot bot commented Dec 23, 2024

⚠️ No Changeset found

Latest commit: 9f785cf

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

@kamath kamath changed the base branch from main to v2 December 23, 2024 06:23
@kamath kamath changed the base branch from v2 to anirudh/add-extract-page-evals December 23, 2024 06:24
@kamath kamath force-pushed the anirudh/add-observe-page-evals branch from 2525746 to 935cbfe Compare December 23, 2024 06:38
@kamath kamath marked this pull request as ready for review December 23, 2024 06:39
@kamath kamath requested a review from seanmcguire12 December 23, 2024 12:48
@kamath kamath changed the base branch from anirudh/add-extract-page-evals to v2 December 23, 2024 12:48
@kamath kamath force-pushed the anirudh/add-observe-page-evals branch from 455943a to 9f785cf Compare December 23, 2024 12:49
@kamath kamath merged commit 9a3e789 into v2 Dec 23, 2024
9 checks passed
kamath added a commit that referenced this pull request Dec 24, 2024
* Use CI on v2 branch

* branch

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* stagehand.act -> page.act (#326)

* need to actually move to act to page now

* move act -> page

* fix e2e

* fix tests

* readme

* changeset

* package json and changeset

* don't fail on combo evals

* Add act evals on `stagehand.page` (#328)

* move act evals to stagehand.page

* add basic act and make act necessary in type

* move extract and observe to page (#329)

* move act evals to stagehand.page

* add basic act and make act necessary in type

* move extract and observe

* example

* changeset

* More playwright tests (#330)

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* Use CI on v2 branch

* branch

* stagehand.page tests

* dont run on BB

* prettier

* pls dont fail

* headless

---------

Co-authored-by: Anirudh Kamath <github@kamath.io>

* add extract evals for stagehand.page (#331)

* add extract evals for stagehand.page

* fix typign

* smh i didn't actually run extract

* add observe page evals (#332)

* change stagehand.observe to stagehand.page.observe in evals

* changeset

* Browsercontext playwright tests (#334)

* add docs, move scoring functions to scoring.ts, move experiment naming to utils.ts

* add initStagehand.ts

* break up index.evals.ts and utils into smaller files

* export LogLineEval

* typing

* follow StagehandConfig pattern

* choose api key based on model name

* Use CI on v2 branch

* branch

* BrowserContext tests

* file path

---------

Co-authored-by: Anirudh Kamath <github@kamath.io>

* changeset minor

* ci yml

---------

Co-authored-by: seanmcguire12 <seanmcguire1@outlook.com>
Co-authored-by: Sean McGuire <75873287+seanmcguire12@users.noreply.github.com>
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants