-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: expose prometheus metrics from operator #138
Conversation
Now the operator exposes a prometheus scrape endpoint. It does not try to push its metrics over OTLP. This operator is long lived so this should not be a problem. The runner continues to push its metrics over OTLP which is important since it is short lived. However I was not able to get both prometheus scrape and OTLP push working on the same process. So the operator only supports prometheus and the runner only support OTLP push.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Amazing, I'd just like to see some error metrics when things don't go as expected.
Added comments.
requests_status_codes_total.as_any(), | ||
requests_duration_percentiles.as_any(), | ||
simulation_min_peer_requests_per_second.as_any(), | ||
]; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
🤩
common/src/telemetry.rs
Outdated
Ok((meter, shutdown, join)) | ||
} | ||
|
||
// /metrics scrape endpoin |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
comment spelling, "endpoint" 😇
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
Now the operator exposes a prometheus scrape endpoint. It does not try to push its metrics over OTLP. This operator is long lived so this should not be a problem.
The runner continues to push its metrics over OTLP which is important since it is short lived.
However I was not able to get both prometheus scrape and OTLP push working on the same process. So the operator only supports prometheus and the runner only support OTLP push.