parente.dev

A Software Observability Roundup

January 02, 2025

I spent some time recently catching up on my #to-read saves in Obsidian. More than a few of these were blog posts from 2024 about software observability. Talk of "redefining observability", "observability 2.0", and "try Honeycomb" had caught my eye in a few spaces, and so I had been hoarding links on the topic.

After spending a few days immersing myself in those articles and branching out to others, I decided to write this bullet-form roundup:

  1. for myself, as a way of solidifying my current understanding
  2. in public, as a way to invite corrections and improvements (drop a comment below or @parente.dev on Bluesky!)
  3. with my colleagues in mind, as a new way to approach and discuss an ever-green question:

As our issue space changes and grows, and our solutions adapt and scale in response, what (else) should we do today so that we can readily address unknown-unknowns tomorrow?


Overview

The seventeen references I surveyed offer perspectives on observability as it pertains both to software systems and organizations around them. They cover what observability is, what problems it solves, how it is and should be implemented. There's alignment from the authors on the state of affairs, learned best practices, and a direction in which the industry should head. Shared terminology and goals are works in progress.

Origins

Problems and Limitations

Better Practices

Looking Forward

References


  1. Observability (software). (2024, May 24). In Wikipedia

  2. Bourgon, P. (2017, February 21). Metrics, tracing, and logging. Peter Bourgon's Blog

  3. Parker, A. (2024, March 29). Re-Redefining Observability. Austin Parker's Blog

  4. Majors, C. (2024, August 7). Is It Time To Version Observability? (Signs Point To Yes). charity.wtf

  5. Sigelman, B. (2021, February 4). Debunking the 'Three Pillars of Observability' Myth. Software Engineering Daily

  6. Weakly, H. (2024, October 3). The 4 Evolutions of Your Observability Journey. The New Stack

  7. Sigelman, B. (2021, February 4). Observability Won’t Replace Monitoring (Because It Shouldn’t). The New Stack

  8. Majors, C. (2024, January 24). The Cost Crisis in Observability Tooling. Honeycomb Blog

  9. Tane, B. & Galbraith, K. (2024, December 6). Observing Serverless Applications (SVS212) [Conference presentation]. AWS re:Invent 2024 Las Vegas, Nevada, United States. 

  10. Majors, C. (2024, November 19). There Is Only One Key Difference Between Observability 1.0 and 2.0. Honeycomb Blog

  11. Tane, B. (2024, September 8). Observability Wide Events 101. Boris Tane's Blog

  12. Morrell, J. (2024, October 22). A Practitioner's Guide to Wide Events. Jeremy Morrell's Blog

  13. Majors, C., Fong-Jones, L., & Miranda, G. (2022, May 6). Observability Engineering: Achieving production excellence. O’Reilly Media, Inc. 

  14. Burmistrov, I. (2024, February 15). All you need is Wide Events, not "Metrics, Logs and Traces". A Song Of Bugs And Patches

  15. Weakly, H. (2024, March 15). Redefining Observability. Hazel Weakly's Blog

  16. Weakly, H. (2024, December 9). The Future of Observability: Observability 3.0. Hazel Weakly's Blog

  17. Majors, C. (2024, December 20). On Versioning Observabilities (1.0, 2.0, 3.0…10.0?!?). charity.wtf

Another Read: Create Obsidian Web Clip Summaries on MacOS with Firefox and Llama 3.2 »

I use Obsidian to capture all kinds of information. I set out to get the Obsidian web clipper Firefox extension configured on my Mac with a local Meta Llama 3.2 model to summarize content. This post explains the steps I took top-to-bottom.