Lotus: Provide a way to lookup messages by EthTxID #1029

Stebalien · 2022-10-26T00:35:33Z

No description provided.

raulk · 2022-12-17T18:06:25Z

This is a blocker for Hyperspace, unfortunately. It's a blocking usability/compatibility issue.

jennijuju · 2022-12-20T19:41:21Z

After implementing this filecoin-project/lotus#9839, we can easily expose an API for it.

raulk · 2023-01-04T18:52:17Z

It might not be very unreasonable to add support for calculating transaction hashes to popular libraries as most of them are user-extensible. For example:

See here for a wrapping Provider solution for ethers.js.
A more simple solution for ethers.js too: https://github.com/ethers-io/ancillary-exchain/blob/11fe26fb6c558c71ec203991e7c835f2c659eaf6/src.ts/formatter.ts#L26

Assuming there are similar solutions for web3.js, Foundry, and other libraries.

@scotthconner committed to reviewing the state-of-the-art libraries and checking if we can plug in our own transaction formatter here: https://filecoinproject.slack.com/archives/CP50PPW2X/p1671741786941649?thread_ts=1671733951.808009&cid=CP50PPW2X

Adding him as an assignee here.

scotthconner · 2023-01-04T20:29:20Z

The root cause is workflows that assume they can counter-factually determine the transaction hash before submission to the network. This acts as a security feature and prevents signers from having to actively trust RPC end-points. In the case for all EVM chains and tools, this is a hash of the transaction inputs, including things like nonce, gas, function selector, call data, etc, but is not a function of the network or blockchain state (something that can only be computed once included in a block).

Web3js, ethers, hardhat plugins, etc, all operate on the ability to check the transaction hash against expectations after submission. Foundry, a huge up-and-comer in EVM development space compared to hard hat - has had discussion on how they want to handle (and avoid) similar workflows and vendor additions here: foundry-rs/foundry#2279

Arbitrum, Optimism, Polygon, Avalanche, and associated test-nets are all fully compatible with Hardhat, Foundry, ethers, web3js, wagmi, WalletConnect, etc. as EVM compatible chains. Their tutorials, examples, corpus of cross-deployed products, and commit contributions to open source toolchains suggest most successful EVM compatible networks use the same tools in the same way.

jennijuju · 2023-01-10T18:11:31Z

@scotthconner any AI needed from rpc api ep perspective from clients?

maciejwitowski · 2023-01-13T12:45:50Z

filecoin-project/lotus#9965

scotthconner · 2023-01-13T13:37:30Z

@scotthconner any AI needed from rpc api ep perspective from clients?

We just need to make sure that gateway owners have this indexing turned on.

maciejwitowski · 2023-01-13T17:09:15Z

ETA Monday

jennijuju · 2023-01-18T08:57:08Z

@geoff-vball can you comment with your PR, how can one lookup message by EthTxID here and close the issue with that comment?

raulk · 2023-01-18T19:00:54Z

I think we have a problem with the consistency of this index. We treat it as a cache (with GC and all), but we should focus on this index being strongly consistent with the chain and the mpool. Otherwise, if an entry drops we will being returning the wrong hash in a number of places.

For example:

On EthGetMessageCidByTransactionHash, we would interpret the supplied transaction hash as a message CID (thinking it's a native message), and would return the CID of a message that doesn't exist, even though in the past we knew it was an Eth tx -- we just forgot. Adding insult to injury, a user calling this API with the same input might get different results before and after a GC.
On eth_getTransactionReceipt, we would fail to return the receipt of a message whose receipt we previously returned (but then GC ran and erased the mapping).
On eth_getBlock, we might return the wrong transaction hashes.

Furthermore, this lookup table must be compulsorily enabled if the Eth endpoint is enabled. It should not be an independent option.

Sadly, it may also be time to phase out native message support from the Ethereum JSON-RPC API. That way, if an expected hash is not found, we can error due to a data consistency issue, instead of speculating that it might be a native message.

In other words, the transaction hash must be deterministically handled and returned.

Finally, because this needs to be strongly consistent, we will need to provide some way to reconcile chain/mpool data with the index.

scotthconner · 2023-01-18T19:12:49Z

For the instances we handle the data in there incorrectly:

The case where "we forgot" is when the eth transaction was never actually included into a block, either because the chain state causes a reversion by the time it tries to be included, the fee was too low, or it was otherwise replaced (like a higher nonce getting included first). So the cases where a bad message CID turns up is when someone is trying to request an eth hash that wasn't actually finalized? Is this correct?
For EVM compatibility, do they handle receipt permanence as part of chain state? Like, its useful to be able to see a transaction that was added to the mem-pool but never actually included?
for getBlock, what is the actual risk that we will return wrong data versus missing data?

I tend to agree that the option should be on by default, and not something that is manually set. I also tend to agree that trying to mix in native message support into the ETH API is likely more trouble than its worth, unless we can identify critical use cases, Deterministic behavior, much like Ethereum itself, is obviously desired.

The question I have is about the trade-offs. The current implementation enables clients to handle ethereum and FEVM transactions in the same manner, using all the same assumptions and toolchains. The current implementation however does not cover cases where transactions are not included into blocks in the same manner that Ethereum does. IMO, it seems worthwhile to gain happy-path parity with ethereum-based developer experience at the expense of failure cases behaving differently when transactions aren't included, as long as the results aren't dangerous or insecure.

geoff-vball · 2023-01-18T19:14:53Z

@raulk The first bullet is not correct. If we don't find the mapping in the database, we'll do the reversible conversion, and then look up that message to see if it exists. Otherwise we return nil.

I don't think the third point is correct either. If we have all the messages in the block, we recalculate the hashes from the messages.

We will never return the wrong hash. At every point we will either say "The correct hash is X" or "not found". This is how nodes work today for the majority of our data. Operators are given the option to store all mappings for as long as they want. I'm not sure why we would force all node operators to disable the entire Eth API if they don't want to store a massive index.

Stebalien · 2023-01-18T19:40:52Z

I think @geoff-vball is correct, there's a middle ground as long as we never blindly convert a tx hash into a CID. We can:

Index all inbound messages in the mpool, and all messages found in blocks.
Provide some form of garbage collection logic that clears out all old mappings after some period of time. At worst, we'll return "not found" to the user.

Some nodes will want to keep an index of every message ever seen, most will likely only care about a few days worth of history.

IMO, whether or not we want to continue to support "native" messages in the Ethereum API is an orthogonal question.

raulk · 2023-01-18T19:44:19Z

@geoff-vball

@raulk The first bullet is not correct. If we don't find the mapping in the database, we'll do the reversible conversion, and then look up that message to see if it exists. Otherwise we return nil.

👍

I don't think the third point is correct either. If we have all the messages in the block, we recalculate the hashes from the messages.

That's right. I checked the implementation and given that we have fetched the message from the store, we process the message to calculate the EthTxHash -- makes sense!

However, I do think the second point stands:

On eth_getTransactionReceipt, we would fail to return the receipt of a message whose receipt we previously returned (but then GC ran and erased the mapping).

And this applies to eth_getTransactionByHash too.

Taking this into account:

Are we OK failing to return transaction and receipt data that we successfully returned in the past?

If the answer is yes, then my concern is resolved as there is no ambiguity possible around transaction IDs. However, I would still advise to turn on the index automatically with the Eth API. Having nodes return different hashes depending on configuration is very confusing.

scotthconner · 2023-01-18T20:12:52Z

Fully aligned @raulk

geoff-vball · 2023-01-18T20:44:31Z

I think the best course of action is to have some top-level config variable that turns on all Eth API functionality including events and the tx hash lookup with no GC. We can then include finer configuration options if users want to turn one off, or set some policy for GC.

With the top level config, we can release new functionality that will work out of the box without users having to turn things on individually. This also saves us from having to set the default of the DB to "on" for all users, which is unnecessary for users that don't want to deal with FEVM.

geoff-vball · 2023-01-18T20:47:06Z

I also think there is currently ambiguity in EthGetTransactionHashByCid that I will fix promptly.

Stebalien added this to the M2.1 milestone Oct 26, 2022

maciejwitowski mentioned this issue Nov 7, 2022

FEVM Development Checklist #936

Closed

85 tasks

Stebalien modified the milestones: M2.1, M2.1 Post code-freeze Nov 8, 2022

Stebalien added P3 P3: Might get resolved Topic: Ethereum JSON-RPC labels Nov 8, 2022

jennijuju added the Lotus label Dec 17, 2022

raulk modified the milestones: M2.1 Post-Testnet, M2.1 (rr10) Carbonado Dec 17, 2022

Stebalien modified the milestones: M2.1 (rr10) Carbonado.1, M2.1 (rr11) Carbonado.2 Dec 20, 2022

Stebalien assigned geoff-vball Dec 20, 2022

snissn mentioned this issue Dec 22, 2022

yarn hardhat deploy fails with Transaction hash mismatch filecoin-project/fevm-hardhat-kit#36

Closed

maciejwitowski removed the P3 P3: Might get resolved label Jan 9, 2023

maciejwitowski modified the milestones: M2.1 (rr11) Carbonado.2, M2.1 (r12) Carbonado.3 Jan 16, 2023

jennijuju closed this as completed Jan 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lotus: Provide a way to lookup messages by EthTxID #1029

Lotus: Provide a way to lookup messages by EthTxID #1029

Stebalien commented Oct 26, 2022

raulk commented Dec 17, 2022

jennijuju commented Dec 20, 2022

raulk commented Jan 4, 2023

scotthconner commented Jan 4, 2023 •

edited

Loading

jennijuju commented Jan 10, 2023

maciejwitowski commented Jan 13, 2023

scotthconner commented Jan 13, 2023

maciejwitowski commented Jan 13, 2023

jennijuju commented Jan 18, 2023

raulk commented Jan 18, 2023 •

edited

Loading

scotthconner commented Jan 18, 2023

geoff-vball commented Jan 18, 2023 •

edited

Loading

Stebalien commented Jan 18, 2023

raulk commented Jan 18, 2023 •

edited

Loading

scotthconner commented Jan 18, 2023

geoff-vball commented Jan 18, 2023

geoff-vball commented Jan 18, 2023

Lotus: Provide a way to lookup messages by EthTxID #1029

Lotus: Provide a way to lookup messages by EthTxID #1029

Comments

Stebalien commented Oct 26, 2022

raulk commented Dec 17, 2022

jennijuju commented Dec 20, 2022

raulk commented Jan 4, 2023

scotthconner commented Jan 4, 2023 • edited Loading

jennijuju commented Jan 10, 2023

maciejwitowski commented Jan 13, 2023

scotthconner commented Jan 13, 2023

maciejwitowski commented Jan 13, 2023

jennijuju commented Jan 18, 2023

raulk commented Jan 18, 2023 • edited Loading

scotthconner commented Jan 18, 2023

geoff-vball commented Jan 18, 2023 • edited Loading

Stebalien commented Jan 18, 2023

raulk commented Jan 18, 2023 • edited Loading

scotthconner commented Jan 18, 2023

geoff-vball commented Jan 18, 2023

geoff-vball commented Jan 18, 2023

scotthconner commented Jan 4, 2023 •

edited

Loading

raulk commented Jan 18, 2023 •

edited

Loading

geoff-vball commented Jan 18, 2023 •

edited

Loading

raulk commented Jan 18, 2023 •

edited

Loading