Some articles I've been collecting for observability / distributed tracing:
- How to maximize span ingestion while limiting writes per second to Scylla with Jaeger - Jaeger primarily supports two backends: Cassandra and Elasticsearch. Here at Grafana Labs we use Scylla, an open source Cassandra-compatible backend. In this post we’ll look at how we run Scylla at scale and share some techniques to reduce load while ingesting even more spans. We’ll also share some internal metrics about Jaeger load and Scylla backend performance.
- The Cost of Containerization for Your Scylla - There is, however, a performance payoff for the operational convenience of using containers. This is to be expected because of the extra layer of abstraction (the container itself), relaxation on resource isolation, and increased context switches.
- Hacking your way to Observability — Part 3 - A distributed tracing quick start with Jaeger and OpenTelemetry
2021/09/26 - Two different links from two different lists talking about the same thing but differently:
- Five years evolution of open-source distributed tracing - provides temporal context for Hawkular, Zipkin, OpenTracing, Jaeger, OpenTelemetry, Hypertrace, Grafana Tempo and SigNoz
- Observability: The 5-Year Retrospective - how to find answers to unknown-unknowns.
2021/10/28 Hacker News - OpenTelemetry
- OpenTelemetry was OpenCensus, and before that it was OpenTracing
- OpenTelemetry is a collection of tools, APIs, and SDKs. Use it to instrument, generate, collect, and export telemetry data (metrics, logs, and traces) to help you analyze your software’s performance and behavior.
- OpenMetrics relates to the metrics part of OpenTelemetry, where the latter is currently in alpha/experimental phase. OpenTelemetry has a statement on being interoperable with OpenMetrics for both accepting, forwarding and generating in the OpenMetrics format [0]. So they are two separate CNCF projects where OpenMetrics has overlap with a part of OpenTelemetry.