Add record_exceptions options to with_span #622

albertored · 2023-09-01T10:40:50Z

Fixes #236

The Python API has been taken as a reference.

There is still something that does not fully convince me.

Elixir exceptions are recorded with exception.type equal to the name of the exception struct (see test).

Standard erlang errors are mapped by Elixir to exception structs so when in Elixir, inside a trace, an exception is raised with :erlang.error(type) the exception is recorded with the erlang type (e.g. error:badarg) but the user get an ArgumentError (see test).

codecov · 2023-09-01T10:42:59Z

Codecov Report

Attention: 14 lines in your changes are missing coverage. Please review.

Files	Coverage Δ
apps/opentelemetry/src/otel_tracer_default.erl	`94.73% <90.00%> (-5.27%)`	⬇️
apps/opentelemetry_api/src/otel_tracer.erl	`69.56% <50.00%> (ø)`
apps/opentelemetry_api/lib/open_telemetry/span.ex	`22.72% <0.00%> (-2.28%)`	⬇️
apps/opentelemetry_api/src/otel_span.erl	`72.63% <78.78%> (-0.23%)`	⬇️

... and 1 file with indirect coverage changes

📢 Thoughts on this report? Let us know!.

tsloughter · 2023-09-01T11:03:51Z

apps/opentelemetry_api/src/otel_span.erl

 record_exception(_, _, _, _, _) ->
    false.

+exception_type(error, #{'__exception__' := true, '__struct__' := ElixirErrorStruct}) ->


Hm, any danger in doing this instead of creating a new with_span in the Elixir macro instead of calling Erlang's with_span function?

I like to keep just one with_span function and hadn't considered we could do something like this to handle the exception case.

Replicating the whole with_span macro on Elixir side is the fallback option but first I wanted to explore this solution and I am quite satisfied with the result.

This piece of code fails if someone is doing

erlang:error(#{'__exception__' => true, '__struct__' => foo).

but it seems very unlikely. We can anyway protect from such cases and fallback to the normal type.

Also the call to Elixir.Exception.message() should be modified so that it doesn't fail in any circumstance

I was thinking about the relying on the internal structure of Elixir's structs and exceptions. Probably unlikely they change, but was my initial hesitation.

Ah ok, I think that structure is not internal Exception.t() is not an opaque type

But we may ask someone more involved with Elixir development, I don't know if there is some contributor here that can answer

tsloughter · 2023-09-30T14:59:45Z

Sorry for the delay. I'm not sure what to do about badarg vs ArgumentError either. Its probably best if it could be ArgumentError. Is that possible to do?

albertored · 2023-10-02T07:08:18Z

The only ways I can think of are:

duplicating the implementation of the feature on Elixir side
somehow add a metadata when using the Elixir version of with_span so that in elrang code we can handle this situation

I'm usually against code duplication but in this case I'm not sure the complexity needed for avoiding duplication is worth.

tsloughter · 2023-10-02T09:17:20Z

The problem with duplication is the logic is in the SDK and there is only the Erlang SDK. We'd have to have the logic in the API to add it to Elixir.

Hm, couldn't this just convert Erlang atoms like badarg to ArgumentError?

albertored · 2023-10-02T09:46:31Z

The problem is that in both these cases

 %% ERROR
    ?assertException(error, badarg, otel_tracer:with_span(Tracer, <<"span-error">>, #{record_exception => true},
                                               fun(_SpanCtx) ->
                                                erlang:error(badarg)
                                               end)),

    receive
        {span, SpanError} ->
            ?assertEqual(<<"span-error">>, SpanError#span.name),
            ?assertEqual(undefined, SpanError#span.status),
            [#event{name=exception, attributes=A}] = otel_events:list(SpanError#span.events),
            ?assertMatch(#{'exception.type' := <<"error:badarg">>, 'exception.stacktrace' := _}, otel_attributes:map(A))

    after
        1000 ->
            ct:fail(timeout)
    end,

and

assert_raise ArgumentError, fn ->
        Tracer.with_span "span-1", record_exception: true do
          :erlang.error(:badarg)
        end
      end

      assert_receive {:span,
                      span(
                        name: "span-1",
                        events: {:events, _, _, _, _, [event]},
                        status: :undefined
                      )}

      assert event(name: :exception, attributes: {:attributes, _, _, _, received_attirbutes}) =
               event

      assert %{
               "exception.type": "error:badarg",
               "exception.stacktrace": _
             } = received_attirbutes

the with_span function receives a error:badarg and it needs to know if it should be kept as is in exception.type or if it should convert it to ArgumentError

tsloughter · 2023-10-19T10:54:11Z

Yea, good point. I don't know what to do here. Maybe it has to stay as badarg.

@bryannaegele any thoughts or should we merge this?

bryannaegele · 2023-11-04T18:49:57Z

I don't know if it's 1:1 correlation to here, but Phoenix does normalization of erlang exceptions to something elixir can handle.

https://github.com/open-telemetry/opentelemetry-erlang-contrib/blob/main/instrumentation/opentelemetry_phoenix/lib/opentelemetry_phoenix/reason.ex

albertored · 2023-11-04T21:43:04Z

There is something similar also in Elixir core, where Erlang exception are translated to Elixir ones.

This is exactly what is causing problems here: the logic for recording the exception is in the SDK (Erlang side) and we need to know if the with_span is called from Erlang or Elixir in order to decide whether the exception should be translated to Elixir before being recorded.

The only way for doing so that I can think of is passing an argument to the with_span macro

albertored · 2023-11-05T15:22:39Z

I may have found a solution.

I take the first element of the stacktrace and use it to know if the exception was raised from Erlang or Elixir so that I can decide if the exception need to be translated or not.

In this way we also simplify the custom record_exception that was defined on Elixir API because know the SDK is able to handle it without customizations on top. A change on this is that know exception.type for Elixir exception is missing the Elixir. prefix, if it is a breaking change I can add it again

bforchhammer · 2024-03-11T16:45:05Z

test/otel_tests.exs

+                        status: status(code: :error)
+                      )}
+
+      assert event(name: :exception, attributes: {:attributes, _, _, _, received_attirbutes}) =


Minor spelling mistake: /s/received_attirbutes/received_attributes

albertored requested a review from a team September 1, 2023 10:40

github-actions bot added language-erlang scope-api scope-sdk labels Sep 1, 2023

tsloughter reviewed Sep 1, 2023

View reviewed changes

albertored force-pushed the record_exception branch 3 times, most recently from 0fb52b6 to a72c344 Compare September 1, 2023 13:33

Add record_exceptions options to with_span

826a5ce

albertored changed the title ~~Add record_exceptions options to with_span~~ Draft: Add record_exceptions options to with_span Nov 5, 2023

albertored changed the title ~~Draft: Add record_exceptions options to with_span~~ Add record_exceptions options to with_span Nov 5, 2023

albertored marked this pull request as draft November 5, 2023 13:59

improvements

7c01f87

albertored force-pushed the record_exception branch from a72c344 to 7c01f87 Compare November 5, 2023 15:15

github-actions bot added the language-elixir label Nov 5, 2023

xref

68f1705

albertored marked this pull request as ready for review November 5, 2023 15:23

bforchhammer reviewed Mar 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add record_exceptions options to with_span #622

Add record_exceptions options to with_span #622

albertored commented Sep 1, 2023

codecov bot commented Sep 1, 2023 •

edited

Loading

tsloughter Sep 1, 2023

albertored Sep 1, 2023

albertored Sep 1, 2023

tsloughter Sep 1, 2023

albertored Sep 1, 2023

albertored Sep 4, 2023

tsloughter commented Sep 30, 2023

albertored commented Oct 2, 2023

tsloughter commented Oct 2, 2023

albertored commented Oct 2, 2023 •

edited

Loading

tsloughter commented Oct 19, 2023

bryannaegele commented Nov 4, 2023

albertored commented Nov 4, 2023

albertored commented Nov 5, 2023

bforchhammer Mar 11, 2024

Add record_exceptions options to with_span #622

Are you sure you want to change the base?

Add record_exceptions options to with_span #622

Conversation

albertored commented Sep 1, 2023

codecov bot commented Sep 1, 2023 • edited Loading

Codecov Report

tsloughter Sep 1, 2023

Choose a reason for hiding this comment

albertored Sep 1, 2023

Choose a reason for hiding this comment

albertored Sep 1, 2023

Choose a reason for hiding this comment

tsloughter Sep 1, 2023

Choose a reason for hiding this comment

albertored Sep 1, 2023

Choose a reason for hiding this comment

albertored Sep 4, 2023

Choose a reason for hiding this comment

tsloughter commented Sep 30, 2023

albertored commented Oct 2, 2023

tsloughter commented Oct 2, 2023

albertored commented Oct 2, 2023 • edited Loading

tsloughter commented Oct 19, 2023

bryannaegele commented Nov 4, 2023

albertored commented Nov 4, 2023

albertored commented Nov 5, 2023

bforchhammer Mar 11, 2024

Choose a reason for hiding this comment

codecov bot commented Sep 1, 2023 •

edited

Loading

albertored commented Oct 2, 2023 •

edited

Loading