Session + Live Q&A
An Observable Service with No Logs
After working with Honeycomb for a little while and starting to instrument our existing code with events, I’d become enamoured with the level of observability possible with that sort of telemetry. In particular, how easy it became to interactively and visually explore how my systems were behaving in production.
With these experiences fresh in my mind, I found myself in an unusual position of beginning development of a new service written in Go, when Go was a new language for services at the company (previously it had only been used for CLIs). This meant that we couldn’t use any existing internal libraries, and had a bit of freedom to imagine something different.
So I proposed an experiment to my team:
What if we never included a logging library in the new service, and only used event tracing?
This is the story of how that went.
Speaker
Glen Mailer
Senior Software Engineer @Geckoboard
After spending a bunch of years as a contractor, Glen worked across a variety of roles at all levels of the stack: from infrastructure to frontend with a detour via databases - this has led to a very varied set of experiences to draw from. Most recently he’s worked on build infrastructure...
Read moreFrom the same track
Profiles, the Missing Pillar: Continuous Profiling in Practice
Thursday May 12 / 10:00AM EDT
With Continuous Profiling (CP) you capture resource usage (such as CPU, memory, I/O, etc.) over time, enabling you to pinpoint the (source) code that is slow or causes an issue. In recent times, CP has become mainstream and a number of open source projects such as Parca, Pyroscope, or CNCF...
Michael Hausenblas
Solution Engineering Lead @AWS
Chaos Engineering Observability with Visual Metaphors
Thursday May 12 / 01:30PM EDT
Observability is key in operating a system in production; it’s required during an incident, when an operator has to interrogate, inspect, and piece together what happened to avoid a similar event. In those scenarios, Chaos engineering and Observability are closely connected - providing...
Yury Niño Roa
Cloud Infrastructure Engineer @Google
Slack’s DNSSEC Rollout: Third Time’s the Outage
Thursday May 12 / 12:20PM EDT
We all have to manage DNS. DNS changes are inherently high-blast-radius and high-visibility. We present a case study of what happened when a large SaaS company enabled DNSSEC. We did significant planning and testing beforehand. The rollout went smoothly for most of our domains, but one...
Rafael de Elvira
Senior Software Engineer @Slack