Dynamic namespace handling for ServiceMonitor serverName in operator metrics scraping #6846

iblancasa · 2024-10-18T10:20:30Z

Feature Request

Describe the problem you need a feature to resolve.

We have the /metrics endpoint for the operator, and we create a ServiceMonitor to scrape those metrics. However, the ServiceMonitor requires a CA and a serverName. The serverName field depends on the namespace where the operator is installed.

If the user installs the operator in a namespace different from the default one, the serverName field is incorrect. This leads to the certificate being invalid, and as a result, the ServiceMonitor cannot scrape the metrics.

For example:

 tlsConfig:
   ca: {}
   caFile: /etc/prometheus/configmaps/serving-certs-ca-bundle/service-ca.crt
   cert: {}
   serverName: my-operator-controller-manager-metrics-service.my-namespace.svc

The current solution is to create the ServiceMonitor from the operator at runtime, but this solution is not ideal because it introduces OpenShift-specific logic in upstream operators.

Describe the solution you'd like.

We want the operator-sdk to dynamically handle the serverName configuration for the ServiceMonitor based on the namespace where the operator is installed. This would ensure that the correct serverName is used, regardless of the installation namespace, making the certificate valid and allowing the ServiceMonitor to scrape the metrics properly.

Or any alternative solution that can help with this use case.

The text was updated successfully, but these errors were encountered:

openshift-bot · 2025-01-17T01:00:24Z

Issues go stale after 90d of inactivity.

Mark the issue as fresh by commenting /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.
Exclude this issue from closing by commenting /lifecycle frozen.

If this issue is safe to close now please do so with /close.

/lifecycle stale

pavolloffay mentioned this issue Oct 21, 2024

Create a ServiceMonitor for the operator metrics endpoint open-telemetry/opentelemetry-operator#3370

Closed

openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic namespace handling for ServiceMonitor serverName in operator metrics scraping #6846

Dynamic namespace handling for ServiceMonitor serverName in operator metrics scraping #6846

iblancasa commented Oct 18, 2024

openshift-bot commented Jan 17, 2025

Dynamic namespace handling for ServiceMonitor serverName in operator metrics scraping #6846

Dynamic namespace handling for ServiceMonitor serverName in operator metrics scraping #6846

Comments

iblancasa commented Oct 18, 2024

Feature Request

Describe the problem you need a feature to resolve.

Describe the solution you'd like.

openshift-bot commented Jan 17, 2025