[Proposal] Differentiate cold and hot contracts #97

maxzaver · 2020-07-20T21:00:28Z

Contract compilation is expensive. We have introduced caching for compilation, but unfortunately we currently cannot have different fees for contracts that are in the cache versus that are not. This means that contract calls are priced based on the worst case scenario -- when every call leads to a compilation. Unfortunately, we cannot predict when contract will be compiled or not, because different nodes that implement the protocol can have different cache settings. However, we can enforce it:

We can require that each node for each shards keep track of the top-200 invoked contracts (by recency) in the 1-epoch long moving window;
Contracts in top-200 list are considered to be "hot" and when they are invoked we do not apply contract_compile_base and contract_compile_bytes fees, https://github.com/nearprotocol/nearcore/blob/master/neard/res/genesis_config.json#L137 . Requiring the node to store the compiled contract in cache.

We would need to introduce 2 parameters for the runtime config:

The size of the hotness list;
The size of the moving window.

We would need to store the list of the 200 hottest contracts in the trie the way we store delayed receipts. Note, they don't need to be ordered, we just need to store the entries: (code hash, number of times the code was called in the moving window).

The text was updated successfully, but these errors were encountered:

evgenykuzyakov · 2020-07-20T21:34:38Z

On the side note, the idea to target 1 second of compute may not be correct. There should be extra time to propagate receipts and execution outcome before starting the next chunk of compute. If we target 1 second block time, then it has to be taken into consideration.

maxzaver · 2020-07-20T21:36:56Z

@evgenykuzyakov Good point. However, we are implicitly targeting 1/2 second because blocks that are more than 1/2 full will lead to a steady gas price increase.

maxzaver · 2020-07-20T21:47:14Z

For the record, here are the formulas to compute our current TPS.

Transfer TPS:
min(gas_limit /2/(action_receipt_creation_config. send_not_sir + transfer_cost.send_no_sir), gas_limit /2/(action_receipt_creation_config. execution + transfer_cost. execution))
Which is 2.2k TPS as of 2020-07-20

Contract call TPS for 300KiB contracts:
min(gas_limit /2/(action_receipt_creation_config. send_not_sir + function_call_cost.send_no_sir), gas_limit /2/(action_receipt_creation_config. execution + function_call_cost. execution + contract_compile_base+ contract_compile_bytes*300*1024))
Which is 200 TPS as of 2020-07-20

evgenykuzyakov · 2020-08-13T19:35:48Z

We would need to store the list of the 200 hottest contracts in the trie the way we store delayed receipts. Note, they don't need to be ordered, we just need to store the entries: (code hash, number of times the code was called in the moving window).

We don't need hottest contracts in the trie, but we need all contracts calls ordered by block-number in a queue. Then we can order them in memory using BTreeSet<(num_calls, hash)> + HashMap<hash, num_calls>. The first for ordering, the second is for lookup and updates.

Once a node syncs the trie for a shard, it has to parse the moving window and reconstruct the cache of the contracts. Once the cache is constructed, the node has to pre-compile contracts to avoid delaying blocks.

@bowenwang1996 Is there a callback when the sync is complete to finalize in-memory operations or do we update them on the fly? If we update them on the fly, then the node has to compile contracts on the fly from the moving window.

bowenwang1996 · 2020-08-13T19:58:33Z

Is there a callback when the sync is complete to finalize in-memory operations

There is no such thing. We can do this operation when we finalize state sync. How do you want to store the cache information in state?

evgenykuzyakov · 2020-08-13T20:15:28Z

We need to have a history of successful calls in a trie similar to delayed receipts for a tracking window.

evgenykuzyakov · 2020-08-13T21:40:55Z

To simplify everything we can store a singleton key-value record that keeps Vec<CryptoHash> in LRU order without duplicates. At the beginning of the block you read it, at the end of the block you commit it.

No need for state sync changes. The in-memory cache doesn't need changes, but it has to be at least the size of the persistent cache.

The persistent cache will only be used for charging compilation fees

evgenykuzyakov · 2020-08-13T22:20:54Z

@bowenwang1996 pointed out that it's too easily abusable by having 200 smaller contract and calling/compiling them.

Another suggestion is to create a time-based cache (based on block height), but don't have expiration for the previous calls. Instead let them decay by half every epoch.

A simple version is to increase weight per contract hash call by

coef**block_height

Now we need to maintain top200 based on weight.

The issue is smaller contracts (143 bytes) can kick out contracts from the top200 much cheaper than putting 300Kb contracts into it. Which makes it abusable.

We can switch top200 cache into 128Mb weighted cache based on the input contract size. But this requires us to properly maintain this cache in the store

evgenykuzyakov · 2020-08-14T21:04:21Z

@bowenwang1996 and @mikhailOK suggested another option.
Before we thought the compilation was fast relative to disk read/write we relied on in-memory cache. But looking at our contract sizes and the time it takes for a single-pass to compile a contract, we should consider an alternative to always keep the compiled version locally. Instead of dropping it from the in-memory cache we can rely on the disk cache to have pre-compiled version.

We can do this at deploy time and increase the cost of deploy operation. It will be a one-off event and will not affect future function calls. Function calls will assume the contract is already pre-compiled and pre-processed. So the only extra cost is to read the cached version from disk. This assumes you've tracked the shard node from the beginning of times, but obviously it might not be the case. The disk cache can be shared cross-shards, so if you tracked a shard then when it splits you still have all of them pre-compiled. But when you sync to a shard, you have to start pre-compiling all contracts that you don't have or try to do this on demand.

Pros:

Base for a function call will be dirty cheap. On the order of a regular transfer.
No complicated persistent-cache logic on the protocol level.
No DevX effects, contracts can still do cheap cross-contract calls without worrying about the cold cache compilation fee.
Fairly simple to implement with the existing crates and solutions.
Allows to use LLVM for contracts calls without protocol changes (so long as node does it in parallel).

Caveats:

Extra disk cost for contract cache. When a contract is deleted, the cache may not be deleted. There are no protocol for on-disk cache, so the node can be implemented either way. The protocol just say, that you better have it ready to avoid delaying the block.
Potential disruptions and new vectors of attack due to cold cache after node shard sync.

bowenwang1996 · 2020-08-15T15:30:29Z

Potential disruptions and new vectors of attack due to cold cache after node shard sync.

I suggest that we not consider state sync done until the contracts are compiled to avoid potential cold cache attacks. We can spawn several thread to parallelize the process. In fact I don't think this is a concern for validators because they start catching up in the previous epoch and I think one epoch is for sure enough time for them to compile all the contracts.

evgenykuzyakov · 2020-08-15T16:26:00Z

We can spawn several thread to parallelize the process. In fact I don't think this is a concern for validators because they start catching up in the previous epoch and I think one epoch is for sure enough time for them to compile all the contracts.

But it means you have to inspect all accounts and extract code that you need to prepare and compile. Some code compilation will fail, but we still need to cache the result.

bowenwang1996 · 2020-08-15T16:29:51Z

But it means you have to inspect all accounts and extract code that you need to prepare and compile

We can store the hashes of contracts in state so that it is easier to look them up.

Some code compilation will fail, but we still need to cache the result.

Do you mean that people maliciously submit binary that cannot be compiled? If so why do we need to cache the result?

evgenykuzyakov · 2020-08-15T16:34:03Z

We can store the hashes of contracts in state so that it is easier to look them up.

We already have it on every account. Otherwise we have to do ref-counting per contract hash, but it's complicated during resharding.

Do you mean that people maliciously submit binary that cannot be compiled?

Yes, you need to remember the attempt, so you don't retry

ilblackdragon · 2020-08-17T05:24:50Z

What is the current speed difference between the best WASM interpreter and executing compiled code?

Also can we save compiled code somewhere on disk?
What are the difference in time loading from disk compiled code vs loading WASM + compiling?

Ideally we should compile on deployment and charging gas for that on deployment time (during state sync would also need to recompile, but there is time for that) and store already compiled code in a separate storage location.

SkidanovAlex · 2020-08-17T05:27:33Z

What are the difference in time loading from disk compiled code vs loading WASM + compiling?

WASM is also loaded from disk. We can measure precisely, but generally the speed of reading from a random location doesn't depend that much on the side that is being read.

Also can we saved compiled code somewhere on the disk?

That is the current plan I believe.

ilblackdragon · 2020-08-17T05:28:17Z

That is the current plan I believe.

Not based on the proposal outlined in this issue, as far as I understand

SkidanovAlex · 2020-08-17T05:33:54Z

#97 (comment)

Before we thought the compilation was fast relative to disk read/write we relied on in-memory cache. But looking at our contract sizes and the time it takes for a single-pass to compile a contract, we should consider an alternative to always keep the compiled version locally. Instead of dropping it from the in-memory cache we can rely on the disk cache to have pre-compiled version.

maxzaver · 2020-08-17T18:30:33Z

FYI @bowenwang1996 and @mikhailOK 's proposal is still a protocol-level change that will also affect Applayer, because now contract calls cannot return preparation/compilation errors.

maxzaver · 2020-08-17T21:06:17Z

Discussed it with @evgenykuzyakov . I agree that the modified proposals would work. I did quick computation. Compiling 100 200KiB contracts takes approximately 8 seconds.

@evgenykuzyakov also has a good idea how to retrofit it with error messages without breaking our protocol too much.

maxzaver assigned ilblackdragon, SkidanovAlex, olonho, willemneal and evgenykuzyakov Jul 20, 2020

maxzaver unassigned ilblackdragon, SkidanovAlex, evgenykuzyakov, willemneal and olonho Jul 20, 2020

maxzaver changed the title ~~[Proposal] Differentiate cold and hot contracts~~ [Deprioritized. Proposal] Differentiate cold and hot contracts Jul 20, 2020

maxzaver mentioned this issue Aug 13, 2020

Fix contract call fee issue near/nearcore#3158

Closed

maxzaver changed the title ~~[Deprioritized. Proposal] Differentiate cold and hot contracts~~ [Proposal] Differentiate cold and hot contracts Aug 13, 2020

maxzaver assigned evgenykuzyakov Aug 13, 2020

evgenykuzyakov mentioned this issue Aug 14, 2020

Disable refunds and burn all prepaid gas #107

Open

thisisjoshford added this to DevRel Nov 22, 2024

github-project-automation bot moved this to NEW❗ in DevRel Nov 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Proposal] Differentiate cold and hot contracts #97

[Proposal] Differentiate cold and hot contracts #97

maxzaver commented Jul 20, 2020 •

edited

Loading

evgenykuzyakov commented Jul 20, 2020

maxzaver commented Jul 20, 2020

maxzaver commented Jul 20, 2020

evgenykuzyakov commented Aug 13, 2020

bowenwang1996 commented Aug 13, 2020

evgenykuzyakov commented Aug 13, 2020

evgenykuzyakov commented Aug 13, 2020

evgenykuzyakov commented Aug 13, 2020

evgenykuzyakov commented Aug 14, 2020

bowenwang1996 commented Aug 15, 2020

evgenykuzyakov commented Aug 15, 2020

bowenwang1996 commented Aug 15, 2020

evgenykuzyakov commented Aug 15, 2020 •

edited

Loading

ilblackdragon commented Aug 17, 2020 •

edited

Loading

SkidanovAlex commented Aug 17, 2020

ilblackdragon commented Aug 17, 2020 •

edited

Loading

SkidanovAlex commented Aug 17, 2020

maxzaver commented Aug 17, 2020

maxzaver commented Aug 17, 2020 •

edited

Loading

[Proposal] Differentiate cold and hot contracts #97

[Proposal] Differentiate cold and hot contracts #97

Comments

maxzaver commented Jul 20, 2020 • edited Loading

evgenykuzyakov commented Jul 20, 2020

maxzaver commented Jul 20, 2020

maxzaver commented Jul 20, 2020

evgenykuzyakov commented Aug 13, 2020

bowenwang1996 commented Aug 13, 2020

evgenykuzyakov commented Aug 13, 2020

evgenykuzyakov commented Aug 13, 2020

evgenykuzyakov commented Aug 13, 2020

evgenykuzyakov commented Aug 14, 2020

bowenwang1996 commented Aug 15, 2020

evgenykuzyakov commented Aug 15, 2020

bowenwang1996 commented Aug 15, 2020

evgenykuzyakov commented Aug 15, 2020 • edited Loading

ilblackdragon commented Aug 17, 2020 • edited Loading

SkidanovAlex commented Aug 17, 2020

ilblackdragon commented Aug 17, 2020 • edited Loading

SkidanovAlex commented Aug 17, 2020

maxzaver commented Aug 17, 2020

maxzaver commented Aug 17, 2020 • edited Loading

maxzaver commented Jul 20, 2020 •

edited

Loading

evgenykuzyakov commented Aug 15, 2020 •

edited

Loading

ilblackdragon commented Aug 17, 2020 •

edited

Loading

ilblackdragon commented Aug 17, 2020 •

edited

Loading

maxzaver commented Aug 17, 2020 •

edited

Loading