Add self-diagnostics example #1846

lalitb · 2024-05-29T23:22:11Z

Changes

Example to demonstrate using tracing as a global error handler for errors generated by the OpenTelemetry Metrics SDK. In this example, measurements are recorded to exceed the cardinality limit, which triggers the error to be logged. This error is then emitted to stdout using opentelemetry-appender-tracing subscriber.

Merge requirement checklist

CONTRIBUTING guidelines followed
Unit tests added/updated (if applicable)
Appropriate CHANGELOG.md files updated for non-trivial, user-facing changes
Changes in public API reviewed (if applicable)

codecov · 2024-05-29T23:28:19Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 74.6%. Comparing base (e0fb7fe) to head (3631c8f).
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##            main   #1846     +/-   ##
=======================================
- Coverage   74.6%   74.6%   -0.1%     
=======================================
  Files        122     122             
  Lines      19952   19952             
=======================================
- Hits       14902   14901      -1     
- Misses      5050    5051      +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…st into self-diagnostics

examples/self-diagnostics/Cargo.toml

cijothomas · 2024-05-30T04:28:55Z

examples/self-diagnostics/src/main.rs

+    let provider: LoggerProvider = LoggerProvider::builder()
+        .with_simple_exporter(exporter)
+        .build();
+    let filter = EnvFilter::new("error"); // only push errors


can you add a readme.md and explain how this works and how infinite logging is prevented here?
(I am not sure if we prevent that actually though... For example, if OTLP Endpoint is not reachable, and we use this approach - would we be in infinite loop ?)

For example, if OTLP Endpoint is not reachable, and we use this approach - would we be in an infinite loop

It shouldn't be an infinite loop with OTLP Metrics Exporter + stdout LogExporter. Should we update the example to use the OTLP exporters for both logs and metrics, as that could need some filters to avoid an infinite loop if the collector is not running?

Yes. That'd be the most common scenario (using OTLP), so showing how exactly to filter would be helpful for majority users.

There is a potential risk of an infinite loop when using the OTLP Metrics Exporter and the OTLP Log Exporter as a custom handler. If the OTLP Log Exporter generates an error, it can trigger this loop. We can manage this in our code; currently, it is being handled by the custom handler.

TommyCpp · 2024-05-30T15:50:26Z

examples/self-diagnostics/src/main.rs

+fn custom_error_handler(err: OtelError) {
+    match err {
+        OtelError::Metric(err) => error!("OpenTelemetry metrics error occurred: {}", err),
+        OtelError::Trace(err) => error!("OpenTelemetry trace error occurred: {}", err),
+        OtelError::Log(err) => error!("OpenTelemetry log error occurred: {}", err),
+        OtelError::Propagation(err) => error!("OpenTelemetry propagation error occurred: {}", err),
+        OtelError::Other(err_msg) => error!("OpenTelemetry error occurred: {}", err_msg),
+        _ => error!("An unknown OpenTelemetry error occurred"), //won't reach here
+    }
+}


This is pretty much the same as the default handler but using traing::error! macros?

yes true. This is to demonstrate using OpenTelemetry logging pipeline as custom error handler for self-diagnostics.

ramgdev · 2024-05-30T20:49:06Z

Is there an easy way to catch from the error handler when spans are being dropped? Ideally, I'd like to use this technique to build a status provider for the export pipeline.

lalitb · 2024-05-31T16:21:23Z

Moving to draft to address some concerns.

cijothomas · 2024-05-31T19:24:28Z

Is there an easy way to catch from the error handler when spans are being dropped? Ideally, I'd like to use this technique to build a status provider for the export pipeline.

Can you elaborate more? Or maybe create a new issue ? This PR is just showing how to override error handler to route logs via tracing.

utpilla · 2024-06-10T17:41:43Z

examples/self-diagnostics/src/main.rs

+    // Metrics are exported by default every 30 seconds when using stdout exporter,
+    // however shutting down the MeterProvider here instantly flushes
+    // the metrics, instead of waiting for the 30 sec interval.
+    meter_provider.shutdown()?;


Use force_flush instead of shutdown?

This was delibrate to call shutdown twice, and trigger "already shutdown" error.

The collector logs added in the README file do not show anything for "already shutdown" error though. There is only any entry for exceeding cardinality limit.

Yes, I meant the extra shutdown was to demonstrate that the error is triggered but not logged, as the pipeline is already closed with the first shutdown. But it seems it was causing more confusion, so have removed the second shutdown :)

examples/self-diagnostics/docker-compose.yaml

examples/self-diagnostics/src/main.rs

examples/self-diagnostics/Cargo.toml

examples/self-diagnostics/otel-collector-config.yaml

examples/self-diagnostics/src/main.rs

examples/self-diagnostics/Cargo.toml

Co-authored-by: Utkarsh Umesan Pillai <[email protected]>

initial commit

aa40ad5

lalitb requested a review from a team as a code owner May 29, 2024 23:22

lalitb marked this pull request as draft May 29, 2024 23:22

lalitb changed the title ~~Add self-diagnostics example~~ [WIP] Add self-diagnostics example May 29, 2024

Merge branch 'main' into self-diagnostics

8cbfe1f

lalitb added 3 commits May 29, 2024 17:41

use otel-tracing-appender

cbd48a7

Merge branch 'self-diagnostics' of github.com:lalitb/opentelemetry-ru…

9156883

…st into self-diagnostics

add comment

de660d5

lalitb marked this pull request as ready for review May 30, 2024 02:31

lalitb changed the title ~~[WIP] Add self-diagnostics example~~ [Add self-diagnostics example May 30, 2024

lalitb changed the title ~~[Add self-diagnostics example~~ Add self-diagnostics example May 30, 2024

cijothomas reviewed May 30, 2024

View reviewed changes

examples/self-diagnostics/Cargo.toml Outdated Show resolved Hide resolved

cijothomas reviewed May 30, 2024

View reviewed changes

Merge branch 'main' into self-diagnostics

465e3bc

TommyCpp reviewed May 30, 2024

View reviewed changes

use otlp exporters

6ed8593

lalitb marked this pull request as draft May 31, 2024 16:21

Merge branch 'main' into self-diagnostics

e7cf751

lalitb added 6 commits June 3, 2024 19:02

add filter, and prevent infinite tracing

96a9760

update comment

b69db42

add README.md, and the docker files

224162e

fmt

cef7064

update readme

d042cad

Merge branch 'main' into self-diagnostics

247d84e

lalitb marked this pull request as ready for review June 3, 2024 21:08

Merge branch 'main' into self-diagnostics

0539473

lalitb and others added 2 commits June 5, 2024 10:28

add output in readme

dbfae7e

Merge branch 'main' into self-diagnostics

abade5c

TommyCpp approved these changes Jun 9, 2024

View reviewed changes

utpilla reviewed Jun 10, 2024

View reviewed changes

examples/self-diagnostics/docker-compose.yaml Outdated Show resolved Hide resolved

utpilla reviewed Jun 10, 2024

View reviewed changes

examples/self-diagnostics/src/main.rs Outdated Show resolved Hide resolved

utpilla reviewed Jun 10, 2024

View reviewed changes

examples/self-diagnostics/src/main.rs Show resolved Hide resolved

utpilla reviewed Jun 10, 2024

View reviewed changes

examples/self-diagnostics/Cargo.toml Outdated Show resolved Hide resolved

lalitb added 3 commits June 10, 2024 17:20

review comments

894dd71

fix collect interval

05edd23

Merge branch 'main' into self-diagnostics

b08272e

utpilla reviewed Jun 11, 2024

View reviewed changes

examples/self-diagnostics/otel-collector-config.yaml Outdated Show resolved Hide resolved

utpilla reviewed Jun 11, 2024

View reviewed changes

examples/self-diagnostics/src/main.rs Outdated Show resolved Hide resolved

utpilla reviewed Jun 11, 2024

View reviewed changes

examples/self-diagnostics/Cargo.toml Outdated Show resolved Hide resolved

lalitb and others added 2 commits June 10, 2024 18:16

more comments

d8701a9

Update examples/self-diagnostics/src/main.rs

2dbf4a6

Co-authored-by: Utkarsh Umesan Pillai <[email protected]>

utpilla approved these changes Jun 11, 2024

View reviewed changes

Merge branch 'main' into self-diagnostics

3631c8f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add self-diagnostics example #1846

Add self-diagnostics example #1846

lalitb commented May 29, 2024 •

edited

Loading

codecov bot commented May 29, 2024 •

edited

Loading

cijothomas May 30, 2024

lalitb May 30, 2024 •

edited

Loading

cijothomas May 30, 2024

lalitb Jun 3, 2024

TommyCpp May 30, 2024

lalitb May 30, 2024 •

edited

Loading

ramgdev commented May 30, 2024 •

edited

Loading

lalitb commented May 31, 2024

cijothomas commented May 31, 2024

utpilla Jun 10, 2024

lalitb Jun 11, 2024

utpilla Jun 11, 2024

lalitb Jun 11, 2024

Add self-diagnostics example #1846

Are you sure you want to change the base?

Add self-diagnostics example #1846

Conversation

lalitb commented May 29, 2024 • edited Loading

Changes

Merge requirement checklist

codecov bot commented May 29, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

lalitb May 30, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lalitb May 30, 2024 • edited Loading

Choose a reason for hiding this comment

ramgdev commented May 30, 2024 • edited Loading

lalitb commented May 31, 2024

cijothomas commented May 31, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lalitb commented May 29, 2024 •

edited

Loading

codecov bot commented May 29, 2024 •

edited

Loading

lalitb May 30, 2024 •

edited

Loading

lalitb May 30, 2024 •

edited

Loading

ramgdev commented May 30, 2024 •

edited

Loading