Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: read db operation name from telemetry_options #368

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

yordis
Copy link
Contributor

@yordis yordis commented Aug 20, 2024

Context

Ecto allows to pass :telemetry_options as a shared options https://github.com/elixir-ecto/ecto/blob/cba494787b8e98aa0038f6c35e2f44d3661f2620/lib/ecto/repo.ex#L93-L94

I use that to set the db.operation OTEL value.

Comment on lines 208 to 216
defp maybe_add_db_operation(attributes, nil) do
attributes
end

defp maybe_add_db_operation(attributes, db_operation) do
Map.put(attributes, Trace.db_operation(), db_operation)
end
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general, should I care about guarding against nil, or is that something that downstream would filter out? Or should I even bother sending a nil value in case it is nil?

I see nil values in my traces, so I want to confirm the general rule around the topic.

Copy link
Collaborator

@bryannaegele bryannaegele left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This attribute is deprecated so we'll need to come back to implementing it after semconv 1.26 is introduced. The new attribute is experimental so it'll have to come in a later pass.

@@ -82,7 +83,7 @@ defmodule OpentelemetryEcto do
def handle_event(
event,
measurements,
%{query: query, source: source, result: query_result, repo: repo, type: type},
%{query: query, source: source, result: query_result, repo: repo, type: type, options: options},
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This presumes options is available. Is that accurate?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

end

defp maybe_add_db_operation(attributes, db_operation) do
Map.put(attributes, Trace.db_operation(), db_operation)
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is not a valid attribute any longer so this would need to be introduced with semconv 1.26. It's also experimental so it'll have to be opt-in.

For db.operation.name the value is supposed to be the command or operation being invoked (function name, SQL operation, etc), so I'm not sure it can be changed by the user.

[5]: It is RECOMMENDED to capture the value as provided by the application without attempting to do any case normalization. If the operation name is parsed from the query text, it SHOULD be the first operation name found in the query. For batch operations, if the individual operations are known to have the same operation name then that operation name SHOULD be used prepended by BATCH , otherwise db.operation.name SHOULD be BATCH or some other database system specific term if more applicable.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should I hard-code it for now without using Trace.*? I intended to use SemConv. I wasn't sure what to do about it.

For db.operation.name the value is supposed to be the command or operation being invoked (function name, SQL operation, etc), so I'm not sure it can be changed by the user.

I asked for direction in the Slack channel, the wording and examples are confusing me!

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@yordis you can look at opentelemetry_xandra where I use all hardcoded semcov names exactly because we are waiting for 1.26

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Either way it is an experimental attribute which requires part of the rewrite of accepted options and overall processing. I'm happy to work through what the values can be so that it can go in there but it will not be published in a release before then.

Copy link
Contributor Author

@yordis yordis Aug 20, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That works for me!

Based on the description, it could be anything from the app, so I think my intent is correct.

Maybe we should follow up with OTEL folks; regardless, I would love to have a mechanism to have an identity for the operation.

Maybe I should expand the feature to allow passing of any value into the span telemetry_options: [otel_attributes: [...]] (to make a point)

@yordis yordis force-pushed the add-operation-name branch from 1d9bc8f to 51ce033 Compare August 20, 2024 03:29
@yordis
Copy link
Contributor Author

yordis commented Aug 20, 2024

@bryannaegele, given that there is some key under the SemConv (do not get cut up on the one I used), should we wait until the SemConv changes are done, or could we manually add it and add the function later?

@yordis
Copy link
Contributor Author

yordis commented Aug 20, 2024

After reading https://opentelemetry.io/docs/specs/semconv/database/sql/ I am even more confused;

The intent is to add the identity of a given call to the database so that I can identify where the call may be coming from.

  • Why can you not use the table name, operation name, or query itself?

The query is dynamic, and it doesn't have a limit of characters, so systems like CloudWatch could not ingest the data as is. Other things like table name and operation name may tell me what happened, but not where it came from, so I can not observe the metric where the optimizations should be performed without hunting for where the query may be coming from.

So, I am not sure what the answer is here.

In the worst case, maybe allow a key to be passed in as "additional attributes."?!

@yordis yordis requested a review from bryannaegele August 20, 2024 03:48
@yordis
Copy link
Contributor Author

yordis commented Aug 20, 2024

From https://opentelemetry.io/docs/specs/semconv/database/database-metrics/

[5]: It is RECOMMENDED to capture the value as provided by the application without attempting to do any case normalization.

Wouldn't that imply that passing is OK, as I am doing it in the PR?

The second part is about "If the operation name is parsed from the query text"

@yordis yordis force-pushed the add-operation-name branch from 51ce033 to 307be3a Compare August 20, 2024 14:33
@yordis yordis requested a review from whatyouhide August 20, 2024 14:33
@bryannaegele
Copy link
Collaborator

The intent is to add the identity of a given call to the database so that I can identify where the call may be coming from.

Wouldn't the parent span identify the caller? Trying to understand what you're wanting to achieve just because I believe this field is only for the precise name of the operation being executed rather than the context of the caller. I'm guessing that is the case?

Other things like table name and operation name may tell me what happened, but not where it came from

Wouldn't that imply that passing is OK, as I am doing it in the PR?

That line just says we shouldn't modify what is parsed, e.g. select shouldn't be modified to SELECT

And just so we're on the same page, this library only does spans at this time, so this section is the relevant one.

Let's see what the WG has to say. A complicating factor here is this value is conditionally part of the span name so any solution has to take that into account for all use cases.

@yordis
Copy link
Contributor Author

yordis commented Aug 20, 2024

The tricky bit here, we are using exclusively :telemetry and mapping into OTEL 😭 😭 😭 😭 😭

@yordis
Copy link
Contributor Author

yordis commented Aug 23, 2024

@bryannaegele

Screenshot 2024-08-23 at 12 10 25 PM

@yordis
Copy link
Contributor Author

yordis commented Oct 16, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants