Costing for `caseList` and `caseData` #6929

kwxm · 2025-03-09T21:58:35Z

This is an initial attempt to cost the caseList and caseData builtins, but it's turning out to be a little tricky. The problem is that these functions really return a function and a list of arguments for that function and then the evaluator has to carry on and do some more work to turn this into a real application and evaluate it. I used λx.() and λx.λy.() in the benchmarks and assumed that the cost of evaluating these would be minimal, but that may not be true.

Extensive benchmarking shows that both functions are constant time (or at least approximately: see later), but the CPU costs that get inferred from the benchmark results are as follows.

caseList: 2297053
caseData: 4121410

In contrast, the costs for chooseList and chooseData are

chooseList: 132994
chooseData: 94375

(These costs are based on some quite old benchmark results, but I re-ran the benchmarks and got numbers that were pretty similar).

The cost of caseList exceeds that of chooseList by 2164059 units, equivalent to about 135 CEK steps, and the cost of caseData exceeds that of chooseData by 4027035 units, or about 251 CEK steps. If we really use these numbers then it'll probably be cheaper to use the choose functions (and do some extra work) than it is to use the case functions, which kind of defeats the point of adding the builtins.

Why is this happening? The mean times for the raw benchmark results are

chooseList: 1.101 µs
caseList:      3.255 µs

chooseData: 1.4257 µs
caseData:      5.4173 µs

so the case functions are more expensive than the choose ones, but only by a factor of 3 or 4, whereas the inferred costs for the case functions are 17 and 43 times more expensive than for the choose ones. The reason for this is that we try to account for the time taken for the machine to load the function argument in the benchmarks and just cost the time required for actual execution. There are some Nop functions that do nothing except load some arguments and return a constant result, and we subtract these from the raw benchmark results and fit some modelling function to the adjusted data. The time for the two-argument Nop is 0.838µs and for the six-argument Nop it's 1.331 µs, and these are quite close to the raw times for the choose functions, so the adjusted time is pretty small. However, the Nop times are much smaller than the times for the case functions so the adjusted times remain quite large.

I think that we really will have to account for the extra work done by the evaluator after the case functions return, and it now occurs to me that maybe subtracting the Nop times twice would help because the Nop builtins are very similar to the functions that I've supplied to the case builtins in the costing benchmarks. I'll try that and see what happens. [UPDATE: I've tried that and it reduces the cost of caseList from 2790214 to 1328893 and the cost of caseData from 4121410 to 2790214, so it's not that effective.]

It would also be useful to have some realistic benchmarks that make a lot of use of the choose and case builtins so that we can see how the cost model predictions compare to actual execution times.

kwxm · 2025-03-09T22:06:00Z

There's also a slight peculiarity in the benchmark results for caseData. The results for caseList are pretty uniform, but for caseData they look like this:

The benchmark first applies caseData to 30 Constr objects, then 30 Map objects, then the same for List, I, and B. The vertical red lines separate these and it appears that Map uniformly takes longer than the average time and Constr uniformly takes less than the average (and if anything you'd expect it to take longer, since it returns a deferred application with two arguments and all of the rest only have one). The difference is actually too small to worry about, but I'm mildly curious as to why this might happen (and I've observed a similar pattern over multiple benchmark runs). Maybe it's a sign that I got something wrong ...

plutus-core/cost-model/create-cost-model/CreateBuiltinCostModel.hs

…l.hs

kwxm added 18 commits March 8, 2025 00:26

Intial costing benchmarks for CaseList

2e184d8

Intial costing benchmarks for CaseData

8060b3e

Remove one level of nesting

ab6fb6b

Order constructors

3d742df

Move CaseList

3c71e88

Preliminay costing code

3042ebc

WIP

2d924dc

WIP

77dceaf

Trying to resolve conflicts deriving from 566a319

7ff1f08

WIP: add stubs to resolve array costing problem

f29ba52

WIP

224aff0

Almost finished

899b489

Almost finished

b837688

Reduce number of samples for ChooseList

84c80fd

Force

6bb0455

Initial cost models

2b416dd

Force term arguments

31a99d8

Update cost models

ae0cc62

kwxm temporarily deployed to github-pages March 9, 2025 21:58 — with GitHub Actions Inactive

kwxm added No Changelog Required Add this to skip the Changelog Check Builtins Costing Anything relating to costs, fees, gas, etc. labels Mar 9, 2025

kwxm requested a review from effectfully March 9, 2025 22:06

kwxm marked this pull request as draft March 9, 2025 22:09

Experiment: remove more overhead

f318136

kwxm commented Mar 11, 2025

View reviewed changes

plutus-core/cost-model/create-cost-model/CreateBuiltinCostModel.hs Outdated Show resolved Hide resolved

Update plutus-core/cost-model/create-cost-model/CreateBuiltinCostMode…

001bd3b

…l.hs

kwxm temporarily deployed to github-pages March 11, 2025 13:54 — with GitHub Actions Inactive

Merge branch 'master' into kwxm/costing/case-list-case-data

c0c4aab

Fix conflict

3a905ff

kwxm temporarily deployed to github-pages March 13, 2025 15:12 — with GitHub Actions Inactive

Slightly more informative errors

7d9bc24

kwxm temporarily deployed to github-pages March 13, 2025 18:30 — with GitHub Actions Inactive

WIP

72e614c

kwxm temporarily deployed to github-pages March 13, 2025 21:06 — with GitHub Actions Inactive

Remove accidentally-committed file

a521963

kwxm temporarily deployed to github-pages March 13, 2025 21:23 — with GitHub Actions Inactive

kwxm added 3 commits March 13, 2025 21:24

WIP

924e27f

Make errors clearer

e682316

Make plutus-ledger-api-test work

ad295b2

kwxm temporarily deployed to github-pages March 13, 2025 22:21 — with GitHub Actions Inactive

kwxm added 2 commits March 16, 2025 16:35

Merge branch 'master' into kwxm/costing/case-list-case-data

0b3a719

Nops for functions returning deferred applications

cdcd659

kwxm temporarily deployed to github-pages March 16, 2025 17:03 — with GitHub Actions Inactive

kwxm added 2 commits March 17, 2025 22:10

WIP

3971e59

Try better benchmarks for nops returning deferred applications

693311e

kwxm temporarily deployed to github-pages March 18, 2025 23:27 — with GitHub Actions Inactive

Fixes

e46cd35

kwxm temporarily deployed to github-pages March 19, 2025 00:40 — with GitHub Actions Inactive

Wrong type in Nop4r

20b8293

kwxm temporarily deployed to github-pages March 19, 2025 01:01 — with GitHub Actions Inactive

More experiments

e767e0d

kwxm temporarily deployed to github-pages March 19, 2025 12:34 — with GitHub Actions Inactive

Back to using last argument

a67849e

kwxm temporarily deployed to github-pages March 19, 2025 15:14 — with GitHub Actions Inactive

Return unit

611f21a

kwxm temporarily deployed to github-pages March 20, 2025 10:45 — with GitHub Actions Inactive

Return unit

be10180

kwxm temporarily deployed to github-pages March 20, 2025 11:59 — with GitHub Actions Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Costing for `caseList` and `caseData` #6929

Costing for `caseList` and `caseData` #6929

kwxm commented Mar 9, 2025 •

edited

Loading

kwxm commented Mar 9, 2025

Costing for caseList and caseData #6929

Are you sure you want to change the base?

Costing for caseList and caseData #6929

Conversation

kwxm commented Mar 9, 2025 • edited Loading

kwxm commented Mar 9, 2025

Costing for `caseList` and `caseData` #6929

Costing for `caseList` and `caseData` #6929

kwxm commented Mar 9, 2025 •

edited

Loading