Loading…
June 26, 2025
Learn more and Register to Attend

Please note: This schedule is automatically displayed in Mountain Daylight Time (UTC -5). The schedule is subject to change.
Type: Open Observability Summit clear filter
Thursday, June 26
 

10:25am MDT

Building Composable OTel Pipelines: CI/CD, Testing, Team-First, and Scalable Design - Anil Kuncham & Joe Canuel, DoorDash
Thursday June 26, 2025 10:25am - 10:50am MDT
Are you aiming to construct scalable, vendor-agnostic OpenTelemetry (OTel) pipelines? Do you want to leverage your organization's Git repository structure, CI/CD pipelines, and unit and integration testing frameworks? Are you looking to adopt industry-standard engineering best practices? Do you want to promote collaborative contributions from Observability team members to a unified codebase? Explore how the DoorDash observability team implements this approach in their technical talk. The DoorDash Observability team has developed 28 in-house OTel components to power pipelines for logs, traces, and metrics at scale, processing 100TB of log data a day and processing over 15 million spans per second. These pipelines incorporate a mix of different architectures, security tooling, and cost-efficient designs. Through running multiple pipelines, we continuously apply our learnings to improve overall performance. Join us to unlock the strategies for building scalable, maintainable, and collaborative OTel pipelines that grow with your needs!
Speakers
avatar for Anil Kuncham

Anil Kuncham

Software Engineer, DoorDash
Anil Kuncham is a software engineer on DoorDash’s Observability team, where he’s shaping a unified observability platform that bridges mobile clients and backend systems. He’s passionate about building efficient telemetry systems that tell a more accurate and actionable story... Read More →
avatar for Joe Canuel

Joe Canuel

Software Engineer, DoorDash
Software Engineer working on the Observability team at DoorDash. Been in the SRE and Observability space for 10 years, including at Spotify and a handful of startups. Located around Boston.
Thursday June 26, 2025 10:25am - 10:50am MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

10:55am MDT

Telemetry Showdown: Fluent Bit Vs. OpenTelemetry Collector - A Comprehensive Benchmark Analysis - Henrik Rexed, Dynatrace
Thursday June 26, 2025 10:55am - 11:20am MDT
In a push to standardize observability practices, the cloud-native community has embraced OpenTelemetry, offering a unified framework for metrics, logs, and traces. Prior to this, log processing relied on agents like fluent, evolving into fluentbit. With fluentbit's recent expansion to support additional signals and the OpenTelemetry Collector's emergence, a pertinent question arises: Which is the superior choice for performance?

This session delves into:
- Unveiling the distinctions between Fluent Bit and the OpenTelemetry Collector.
- Sharing the findings derived from a series of benchmark tests.
- Providing valuable insights to empower the community in selecting the most fitting agent for their cloud-native environments.
Speakers
avatar for Henrik Rexed

Henrik Rexed

Cloud native advocate, Dynatrace
Henrik is a Cloud Native Advocate at Dynatrace and a CNCF Ambassador. Prior to Dynatrace, Henrik has worked more than 15 years, as Performance Engineer. Henrik Rexed Is Also one of the Organizer of the conferences named WOPR, KCD Austria and the owner of the Youtube Channel IsitO... Read More →
Thursday June 26, 2025 10:55am - 11:20am MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

11:25am MDT

Correlating Application and Database Performance Using OpenTelemetry - Lin Lin & Tammy Baylis, SolarWinds
Thursday June 26, 2025 11:25am - 11:50am MDT
OpenTelemetry captures database operations from the client perspective, in the context of an originating resource, application and request. How can this insight be matched to observations about the database server? This talk focuses on the sqlcommenter feature in OTel which propagates trace context in SQL queries, so that client- and server-side observations can be correlated to answer questions like “is this request slow due to the state of the database” and “where are the expensive queries to this database coming from".

We’ll briefly describe the challenge of context propagation, give an overview of the current state of sqlcommenter implementation in OTel SDKs, demonstrate the enriched correlation that can be built from it, and highlight limitations and pitfalls found along the way in adapting it into our product.

This talk aims to give viewers a good understanding of the OTel sqlcommenter feature and the integrated performance insight that it makes possible.
Speakers
avatar for Tammy Baylis

Tammy Baylis

Staff Software Engineer, SolarWinds
Tammy is a developer at SolarWinds, specializing in instrumentation for APM. She is an active contributor to OpenTelemetry with a passion for Python.
avatar for Lin Lin

Lin Lin

Engineering Manager, SolarWinds
I manage a team that works on the APM product for SolarWinds, specifically OTel and instrumentation.
Thursday June 26, 2025 11:25am - 11:50am MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

11:55am MDT

⚡ Lightning Talk: Faster Insights and Improved Accuracy: Spotify's Prometheus Upgrade - Lauren Roshore, Spotify
Thursday June 26, 2025 11:55am - 12:10pm MDT
Spotify replaced its in-house TSDB (Heroic) with VictoriaMetrics and Prometheus, achieving 10x faster query speeds and significant cost savings. The 2 year migration addressed stability, performance, and stakeholder dissatisfaction with Heroic's slow queries and bespoke nature. After evaluating alternatives, VictoriaMetrics was chosen for its speed, cost efficiency, Prometheus compatibility, and self-hosted flexibility. Challenges included migrating from Heroic's overly flexible data model and often misused HQL to Prometheus's standardized structure and best-practice PromQL, a necessary shift for improved data quality and query efficiency. The migration involved updating metric agents, adapting alert management, and a significant fleetshift. Benefits included improved dashboard loading times, faster alerting, more accurate data, and alignment with industry standards.
Speakers
avatar for Lauren Roshore

Lauren Roshore

Engineering Manager, Spotify
Spotify Engineering Manager in Core Infrastructure, leading the Observability and Reliability team. 9+ years in SRE/Engineering roles.
Thursday June 26, 2025 11:55am - 12:10pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

12:15pm MDT

⚡ Lightning Talk: Weaving Legacy and OpenTelemetry: A Schema Strategy With Weaver - Andrew Wang, Comcast
Thursday June 26, 2025 12:15pm - 12:30pm MDT
Migrating from proprietary tracing systems to OpenTelemetry can be challenging, especially when applications rely on custom span and resource attributes that do not align with semantic conventions. Organizations need a strategy to unify legacy telemetry with OpenTelemetry standards without disrupting existing queries, visualizations, or alerts.

This session presents a real-world case study from Comcast, highlighting how the OpenTelemetry Weaver tool was used to create a unified schema combining semantic conventions with existing attributes. This approach allows teams to gradually adopt standard attributes while maintaining compatibility with current setups, ensuring continuity and reducing migration friction.

The session details how Weaver was configured using Jinja templates to generate outputs in multiple formats—JSON, YAML, and Markdown—supporting backend ingestion and schema documentation. This enabled consistency across pipelines and improved clarity for engineering teams reviewing trace attribute definitions. The talk also highlights how flexible namespacing integrated domain-specific attributes alongside standardized ones without sacrificing backend compatibility.
Speakers
avatar for Andrew Wang

Andrew Wang

Senior Principal Software Engineer, Comcast
Andrew Wang is a Software Engineer at Comcast, developing solutions in Observability. He focuses on tracing services and has worked on automating logging services, anomaly detection in metrics, and application monitoring via synthetics. His projects range from prototyping network... Read More →
Thursday June 26, 2025 12:15pm - 12:30pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

1:30pm MDT

⚡ Lightning Talk: Observability-First DevSecOps: Building Resilient Multi-Cloud Pipelines With OpenTelemetry and GitOps - Ravindra Bhargava, UPS
Thursday June 26, 2025 1:30pm - 1:45pm MDT
In today’s cloud-native world, ensuring visibility, reliability, and security across multi-cloud environments is critical. As GitOps adoption grows, observability must evolve in parallel to support performance, compliance, and resilience.

In this session, I’ll share how we built an observability-first DevSecOps pipeline using OpenTelemetry, Prometheus, Grafana, Terraform, and ArgoCD across GCP, Azure, and AWS. I’ll cover how we embedded telemetry into CI/CD, cut MTTR by 35%, and optimized cloud performance and costs with real-world metrics.

We'll explore common anti-patterns when scaling observability in GitOps workflows—and how to overcome them. I’ll also show how we aligned observability data with business KPIs to drive better stakeholder decisions.

Attendees will leave with a practical blueprint for implementing open-source observability at scale in secure, automated pipelines.
Speakers
avatar for Ravindra Bhargava

Ravindra Bhargava

Lead Software Development Engineer, UPS
Leads DevSecOps for app modernization on Google Cloud. Strong infra & platform eng background at UPS.
Thursday June 26, 2025 1:30pm - 1:45pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

2:10pm MDT

Telemetry Pipelines: Never Gonna Let You Down - Yuri Oliveira, OllyGarden & Alex Boten, Honeycomb
Thursday June 26, 2025 2:10pm - 2:35pm MDT
In cloud-native environments, telemetry pipelines are expected to be reliable, especially when everything else isn't. Unfortunately, it's exactly in these critical moments that your observability setup might decide to take an unexpected break.

In this session, we'll explore proven strategies for building robust telemetry pipelines that won't abandon you mid-incident. You'll learn practical techniques for configuring a continuous data ingestion, how to manage telemetry streams effectively, and be resilient enough to survive backend disruptions.

Join us to discover how to keep your observability data flowing smoothly—because your OpenTelemetry Collector should never give you up, never let you down, and certainly never run around and desert you when you need it most.
Speakers
avatar for Yuri Oliveira

Yuri Oliveira

Software Engineer, OllyGarden
Yuri Oliveira has dedicated his career to helping companies elevate their infrastructure automation to the next level. With 15 years of experience in critical environments as a SysAdmin, SRE, and DevOps Engineer, he has worked on designing resilient systems and streamlining operations... Read More →
avatar for Alex Boten

Alex Boten

Staff Software Engineer, Honeycomb
Alex Boten is a staff software engineer that has spent the last ten years helping organizations adapt to a cloud-native landscape by mashing keyboards. From building core network infrastructure to mobile client applications and everything in between, Alex has first-hand knowledge... Read More →
Thursday June 26, 2025 2:10pm - 2:35pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

2:40pm MDT

No Dependencies. No Plugins. Just Native OpenTelemetry - Liudmila Molkova, Microsoft
Thursday June 26, 2025 2:40pm - 3:05pm MDT
The best telemetry starts at the source—inside the client libraries.
But in most cases, that means taking a dependency on the OpenTelemetry API from your library. And while it’s stable, minimal, reliable, and safely no-op unless configured—transitive dependencies are still the bane of any library developer’s existence, and most of us try to avoid them.

To work around this, people reach for abstractions, plugins, bridges, or even OTel forks that break context propagation. The result? A poor user experience. Users must find the right plugin, install it, wire it up—and still hit the diamond dependency problem, now it just affects a subset of users.

But what if you could take a truly optional dependency? If OpenTelemetry is on the classpath, instrumentation kicks in. If it’s not, no harm done.
How hard is that to pull off? How reliable? How performant?

Let’s explore that—through the lens of the next generation of Azure SDKs for Java. Spoiler: it’s easy and fast, and as a side-bonus, we can fall back to logs-based tracing if OTel is not found.
Speakers
avatar for Liudmila Molkova

Liudmila Molkova

Principal Software Engineer, Microsoft
Liudmila Molkova is a Principal Software Engineer at Microsoft working on observability and Azure client libraries. She is a co-author of distributed tracing implementations across the .NET ecosystem including HTTP client instrumentation and Azure Functions. Liudmila is an active... Read More →
Thursday June 26, 2025 2:40pm - 3:05pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

3:10pm MDT

Let's Generate Art and Traces! - Tiffany Jernigan, Grafana Labs
Thursday June 26, 2025 3:10pm - 3:35pm MDT
The year is 2025, and it's pretty safe to assume that most of us have used "Generative AI" (or GenAI) at least once to produce images from prompts. Rumors say that some of us even used that as an excuse to buy a fancy new GPU and run models like Stable Diffusion locally, "For Science!", of course.

In this talk, we'll set aside the feelings (positive or negative!) that we might harbor for generative AI and use them as an excuse, not to buy shiny new GPUs, but to learn how to build a practical Spring + Java application in a GenAI context, deploy it on Kubernetes, and instead of tracing GPU invoices, we’ll trace our application’s behavior using distributed tracing.

Using Spring, we’ll store prompts in a database and submit them to generative models. Image results will be stored and then displayed. The application will take advantage of Kubernetes batch job processing facilities and implement reasonable security policies by leveraging RBAC. We will also look into distributed tracing, using OpenTelemetry, Grafana, and Grafana Tempo, to visualize the lifecycle of our requests as they pass through our application.
Speakers
avatar for Tiffany Jernigan

Tiffany Jernigan

Senior Developer Advocate, Grafana Labs
Tiffany is senior developer advocate at Grafana Labs and a CNCF Ambassador. She also formerly worked as a software developer and developer advocate at VMware, Amazon, Docker, and Intel. Prior to that, she graduated from Georgia Tech with a degree in electrical engineering. In her... Read More →
Thursday June 26, 2025 3:10pm - 3:35pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

4:00pm MDT

How To Think About Instrumentation Overhead - Jason Plumb, Splunk
Thursday June 26, 2025 4:00pm - 4:25pm MDT
Novice observability practitioners are often overly obsessed with performance. They might approach instrumentation with skepticism and have concerns about latency degradation or resource consumption. This talk is a primer on the topic of instrumentation overhead, and it will teach you how to think about overhead in an observability context. We will cover the causes of overhead and why overhead is so hard to measure and even harder to predict reliably. Lastly, we will present some practical techniques for understanding overhead in your environment and some strategies for coping with it.
Speakers
avatar for Jason Plumb

Jason Plumb

Engineer, Splunk
Jason Plumb (he/him) is a hacker, artist, experimenter, polyglot programmer, and dad from Portland, OR, USA. He is co-maintainer of OpenTelemetry Android and an approver in various OpenTelemetry java projects. When not at work, Jason volunteers with Futel to install and maintain a... Read More →
Thursday June 26, 2025 4:00pm - 4:25pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

4:30pm MDT

Monitoring GenAI Applications - Prasad Mujumdar, Okahu AI
Thursday June 26, 2025 4:30pm - 4:55pm MDT
GenAI Observability is proactive monitoring of your AI apps and cloud infra they run on to understand how to make them work better. The ever evolving landscape of genAI technologies makes it very challenging to manage. Project. Monocle is built for App developers to trace their app code in any environment without lots of custom code decoration. It’s a community driven open source project under Linux Foundation AI&Data, built on top of OpenTelemetry. It provides out of the box support of several genAI tech components. With very little to no code changes, you can generate OpenTelemetry compatible traces and spans of your genAI application. Monocle provides consistent format to describe entities like LLMs and vector stores as well as events like prompts and responses. It can be integrated with apps in personal dev/lab environment as well as cloud deployments.

This session will illustrate challenges of monitoring/tracing genAI applications which is a big problem taking your dev/prototypes to production. It will provide a quick overview of how to use Monocle Python and Typescript libraries in your application with little efforts.
Speakers
avatar for Prasad Mujumdar

Prasad Mujumdar

CTO, Okahu AI
Prasad is founder and CTO of Okahu AI. He's leads the LF's Data&AI project Monocle. Prasad has extensive experience in data management, data governance and AI technology, in past worked in IBM, Microsoft and Cloudera at technical leadership positions.
Thursday June 26, 2025 4:30pm - 4:55pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

5:00pm MDT

Innovating on Top of Open Source Observability - Pushing the Boundaries of Innovation - Vijay Samuel & Wei Tang, eBay
Thursday June 26, 2025 5:00pm - 5:25pm MDT
We all have innovative ideas around what has been done on top of Open Source. Sometimes it maybe because of our organization's "specific" need and other times it is because the technology can't simply scale. Sometimes it may be because you are trying new things that are premature to put out. Never the less, it becomes important to manage the constraints effectively. This problem has been the norm at eBay's Observability platform where scale, innovation and organizational demand drives creativity. In this talk we discuss how we have improvised on top technologies like Prometheus, Alertmanager and OpenTelemetry Collector. Some of the ideas we would discuss include:

* enriching alerts using Exemplars
* spanmetrics connector for massively large tracing systems
* scaling tracing backends using clickhouse

At the end of this talk end users would get a sense of new things that can be done on top of existing projects and scaling open source with a little bit of extra love.
Speakers
avatar for Wei Tang

Wei Tang

MTS 2 - Software Engineer, eBay
Joined eBay at 2015. Have been working in observability for 5 years. Worked with teammates to build metrics platform/alerting platform, and machine learning platform. Starting from 2022, focus on building distributed tracing solution.
avatar for Vijay Samuel

Vijay Samuel

Principal MTS, Architect, eBay
Vijay Samuel works with eBay's observability platform as its architect. During his time at eBay Vijay has transformed eBay's observability platform into a cloud native offering that is primarily built on top of open source technologies. He loves to code in Go and play video games... Read More →
Thursday June 26, 2025 5:00pm - 5:25pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)

5:30pm MDT

⚡ Lightning Talk: Taming Metric Cardinality: Practical Cost Reduction With the OpenTelemetry Collector - Jon Reeve & Eric Anderson, ControlTheory
Thursday June 26, 2025 5:30pm - 5:45pm MDT
As OpenTelemetry adoption expands, one common pain point across the community is metric cardinality. High-cardinality metrics—especially those with user- or host-level tags—can overload backends with unique time series, driving up costs and reducing signal quality. Many teams only notice once the vendor bill arrives.

In this talk, we’ll explore how to use the OpenTelemetry Collector and OTTL (OpenTelemetry Transformation Language) to proactively reduce metric cardinality. Learn how to spot costly metrics, reshape or drop them with the filter processor, test changes with the OTTL Playground, and implement a control plane to safely evolve filtering rules.

Whether you’re using Prometheus, a commercial vendor, or OpenTelemetry-native pipelines, this session offers practical, open-source strategies to reduce noise, improve signal, and keep observability costs sustainable.
Speakers
avatar for Jon Reeve

Jon Reeve

CPO, ControlTheory
Jonathan Reeve is a co-founder of ControlTheory, where he helps teams take control of their observability data with smarter, more efficient telemetry pipelines. A passionate advocate for OpenTelemetry and open standards, Jonathan focuses on making observability more scalable, cost-effective... Read More →
avatar for Eric Anderson

Eric Anderson

CTO, ControlTheory
Eric is the Chief Technology Officer at ControlTheory, an observability geek, and serial entrepreneur with a few notches in his belt. When not hacking, solving problems, or fighting some YAML file somewhere, you'll problem find him hovering in a helicopter somewhere.
Thursday June 26, 2025 5:30pm - 5:45pm MDT
Bluebird Ballroom 1C (Level 1 - Terrace Level)
 
Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.