Issue with passing trace_entity when using ThreadPoolExecutor in lambda #342

RossHammer · 2022-06-20T17:11:23Z

I followed the instructions in the readme for passing the trace entity in a ThreadPoolExecutor but this does not appear to work properly in a lambda. The first time a thread does some work the trace_entity does not stick and the subsegment gets put under the root node not the one it should be assiciated with. The next time a thread picks up a job everything seems to operate fine.

I dug into the code a bit and it seems like there is a custome context class for lambdas but it does not override set_trace_entity to work properly with the other changes in the class. To workaround the issue I have found calling get_trace_entity before setting sets the internal state up properly.

example workaround from readme

def load_url(url, trace_entity):
    xray_recorder.get_trace_entity() # Workaround to setup internal state
    xray_recorder.set_trace_entity(trace_entity)

    resp = requests.get(url)

    xray_recorder.clear_trace_entities()
    return resp

The text was updated successfully, but these errors were encountered:

NathanielRN · 2022-06-28T21:38:25Z

Hey there! I had some follow up questions:

the trace_entity does not stick

Can you explain this more please? Do you mean that when you follow our documentation on using ThreadPoolExecutor the call to current_entity = xray_recorder.get_trace_entity() returns None? Or is invalid?

The next time a thread picks up a job everything seems to operate fine.

Hm this is curious. Can you please add screenshots of what you're seeing? Especially before and after screenshots and an explanation of what you expect to see versus what you actually see?

When our documentation says # Get the current active segment or subsegment from the main thread. we expect one to be already open. If you share your code you can show us how you already set the segment that the subsequent line expects to find. Otherwise maybe your segments are missing a trace on the first run?

a custome context class for lambdas but it does not override set_trace_entity to work properly with the other changes in the class.

Which changes are missing to make it compatible with the other changes? Just curious in case you already know. Also please feel free to open a PR to add a fix if you have one and we'll be happy to answer!

To workaround the issue I have found calling get_trace_entity before setting sets the internal state up properly.

That's awesome that you have a workaround! Is there a chance you missed the current_entity = xray_recorder.get_trace_entity() line in our documentation example that I linked above?

Perhaps your segment isn't already available when your load_url function executes, or maybe it always shows up late and that's why the first one doesn't work but the subsequent ones do?

The best way to help you would be to have you share your code so we can see when the trace that get_trace_entity() fetches is created and if we can confirm that it will be available by they this function is called.

Please let me know if you have any questions!

RossHammer · 2022-06-28T23:44:45Z

I can try and get a sample together in the next week or so but it should be doable with what is posted in the readme . In the console the first segments recorded by a thread in the pool show up under the invoke of the lambda not the segment from the trace entity if it should be redording under another subsegment.

It looks like the issue coming from here. It seems like the segment is not set in the local context and the set_trace_entity function does not set it. In turn when recording the next segment it does not have the proper parent element. get_trace_entity before setting it fixes everything becasue the segment will get set.

NathanielRN · 2022-06-29T00:02:35Z

Thanks for your deep dive!

In the console the first segments recorded by a thread in the pool show up under the invoke of the lambda not the segment from the trace entity if it should be redording under another subsegment.

Why does it work on subsequent calls then? Is it because set_trace_entity calls setattr(self._local, 'entities', [trace_entity]) and so the subsegments fall under each other synchronously?

This is where a picture of what you see versus what you expect would be useful to clarify, but I think this is what you mean and would explain why it works on the 2nd time onward.

It seems like the segment is not set in the local context and the set_trace_entity function does not set it.

So are you saying we could fix the issue by overriding the set_trace_entity method to also _refresh_context like get_trace_entity in the lambda_launcher.py should solve this?

def set_trace_entity(self, trace_entity):
        """
        Refresh the context before setting the trace entity so
        that the correct parent is set.
        """
        self._refresh_context()
        super.set_trace_entity(trace_entity)

That case would make sense to me, and we could move to a PR for this?

By the way, we are also recommending users to use the OpenTelemetry Python SDK. It has tons of features and I found an example with ThreadPoolExecutor to ensure spans are parented correctly. You'll find lots of support there and AWS announced GA support for its traces 1 year ago.

RossHammer · 2022-08-02T17:39:14Z

We are using this library becasue it is what AWS lambda powertools is using under the covers. The only spot we are directly using this library is to make things work properly in the threadpool.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with passing trace_entity when using ThreadPoolExecutor in lambda #342

Issue with passing trace_entity when using ThreadPoolExecutor in lambda #342

RossHammer commented Jun 20, 2022

NathanielRN commented Jun 28, 2022

RossHammer commented Jun 28, 2022

NathanielRN commented Jun 29, 2022

RossHammer commented Aug 2, 2022

Issue with passing trace_entity when using ThreadPoolExecutor in lambda #342

Issue with passing trace_entity when using ThreadPoolExecutor in lambda #342

Comments

RossHammer commented Jun 20, 2022

NathanielRN commented Jun 28, 2022

RossHammer commented Jun 28, 2022

NathanielRN commented Jun 29, 2022

RossHammer commented Aug 2, 2022