So you think you should Serverless? Things to know before you do with Sebastian Vietz!

Episodios

MCPs (Model Context Protocol) are not that magic, but they enable magic things with Dana Harrison
14 abr· PurePerformance
MCPs (Model Context Protocol) is an open source standard for connecting AI assistants to the the systems where data lives. But you probably already knew that if you have followed the recent hype around this topic after Anthropic made their announcement end of 2024.
To learn more about that MCPs are not that magic, but enable "magic" new use cases to speed up efficiency of engineers we have invited Dana Harrison, Staff Site Reliability Engineer at Telus. Dana goes into the use cases he and his team have been testing out over the past months to increase developer efficiency.
In our conversation we also talk about the difference between local and remote MCPs, the importance of keeping resiliance in mind as MCPs are connecting to many different API backends and how we can and should observe the interactions with MCPs.

Links we discussed
Antrohopic Blog: https://www.anthropic.com/news/model-context-protocol
Dana's LinkedIn: https://www.linkedin.com/in/danaharrisonsre/overlay/about-this-profile/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
The History & Power of Distributed Tracing with Christoph Neumueller & Thomas Rothschaedl
31 mar· PurePerformance
So you think Distributed Tracing is the new thing? Well - its not! But its never been as exciting as today!
In this episode we combine 50 years of Distributed Tracing experience across our guests and hosts. We invited Christoph Neumueller and Thomas Rothschaedl who have seen the early days of agent-based instrumentation, how global standards like the W3C Trace Context allowed tracing to connect large enterprise systems and how OpenTelemetry is commoditizing data collection across all tech stacks.
Tune in and learn about the difference between spans and traces, why collecting the data is only part of the story, how to combat the challenge when dealing with too much data and how traces relate and connect to logs, metrics and events.

Links we discussed
YouTube with Christoph: LINK WILL FOLLOW ONCE VIDEO IS POSTED
Christoph's LinkedIn: https://www.linkedin.com/in/christophneumueller/
Thomas's LinkedIn: https://www.linkedin.com/in/rothschaedl/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
¿Faltan episodios?

Pulsa aquí para actualizar resultados
An Inside Look into Platform Engineering for Architects with the authors Max, Hilliary & Andi
17 mar· PurePerformance
In the ever-changing IT world, creating content that stays relevant for long is hard. One of the objectives of "Platform Engineering for Architects: Crafting Modern Platforms as a Product" was to stay timeless by providing practical examples of use cases not necessarily tied to current technology trends.
The book focuses on the importance of building a platform with a purpose, making the impact measurable, and ensuring the platform continuously evolves by continuously including the end users (the engineering teams) in the evolution of the platform.
Tune in to this episode and hear from Max Körbächer (Founder of Liquid Reply), Hilliary Lipsig (Senior Principal SRE at RedHat), and Andi Grabner (Co-Host of PurePerformance) on what made them write a book on Platform Engineering and get some personal insights into what gets the authors excited about their respective topics.
If you have a chance, meet Max, Hilliary, and Andi at KubeCon in London. They will present at Platform Engineering Day and do a book signing at KubeCrawl!

Links we discussed:
Book on Amazon: https://www.amazon.com/Platform-Engineering-Architects-Crafting-platforms-ebook/dp/B0DH5DJFTH
Platform Engineering Day Session: https://colocatedeventseu2025.sched.com/event/1u5mX/platform-engineering-for-architects-crafting-platforms-as-a-product-max-korbacher-liquid-reply-hilliary-lipsig-red-hat
Hilliary Lipsig: https://www.linkedin.com/in/hilliary-lipsig-a5935245/
Max Körbächer: https://www.linkedin.com/in/maxkoerbaecher/
Andi Grabner: https://www.linkedin.com/in/grabnerandi/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
How CERN analyzed 1 PetaByte per second using K8s with Ricardo Rocha
3 mar· PurePerformance
One PetaByte is the equivalent of 11000 4k movies. And CERN's Large Hadron Collider (LHC) generates this every single second. Only a fraction of this data (~1 GB/s) is stored and analyzed using a multicluster batch job dispatcher with Kueue running on Kubernetes.
In this episode we have Ricardo Rocha, Platform Engineering Lead at CERN and CNCF Advocate, explaining why after 20 years at CERN he is still excited about the work he and his colleagues at CERN are doing. To kick things off we learn about the impact that the CNCF has on the scientific community, how to best balance an implementation of that scale between "easy of use" vs "optimized for throughput". Tune in and learn about custom hardware being built 20 years ago and how the advent of the latest chip generation has impacted the evolution of data scientists around the globe

Links we discussed
Ricardo's LinkedIn: https://www.linkedin.com/in/ricardo-rocha-739aa718/
KubeCon SLC Keynote: https://www.youtube.com/watch?v=xMmskWIlktA&list=PLj6h78yzYM2Pw4mRw4S-1p_xLARMqPkA7&index=5
Kueue CNCF Project: https://kubernetes.io/blog/2022/10/04/introducing-kueue/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Why Compliance is Important and not Boring with Michiel de Lepper
17 feb· PurePerformance
The word "Compliance" reminds many about mandatory training or audits. Two things not everyone gets excited about!
Tune in and meet Michiel de Lepper who has spent most of his career in Security and Compliance. He gives us a different perspective on the importance of compliance, why it exists, how it intertwines with security and threat detection, what it has to do with security posture management and why he thinks its one of the most exciting things in IT!

Links we discussed:
Michiel's LinkedIn: https://www.linkedin.com/in/madelepper/
Blog posts on security and compliance:
https://www.dynatrace.com/news/blog/dynatrace-for-executives-security-compliance/
https://www.dynatrace.com/news/blog/manage-compliance-and-resilience-at-scale-with-dynatrace/
https://www.dynatrace.com/news/blog/dynatrace-kspm-transforming-kubernetes-security-and-compliance/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
What's next for Feature Flagging and OpenFeature with Ben Rometsch
3 feb· PurePerformance
Feature Flagging - some may call them "glorified if-statements" - has been a development practice for decades. But have we reached a stage where organizations are doing "Feature Flag-Driven Development?". After all it took years to establish a test-driven development culture despite having great tools and frameworks available!
To learn more we invited Ben Rometsch, Co-Founder of Flagsmith, to chat about the history, state and future of Feature Flagging. He is giving us an update on where the market is heading, how the CNCF project OpenFeature and its community is driving best practices, what the role of AI might be and what he thinks might be next!

Couple of links we discussed during the episode:
Ben on LinkedIn: https://www.linkedin.com/in/benrometsch/
YouTube Video on Observability & Feature Flagging: https://www.youtube.com/watch?v=VZakh1_oEL8
OpenFeature: https://openfeature.dev/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Observability Predictions 2025 Under the Covers with Bernd Greifeneder
20 ene· PurePerformance
To predict the future, it's important to know the past. And that is true for Bernd Greifeneder, Founder and CTO of Dynatrace, who has been driving innovation in the observability and security since he founded Dynatrace 20 years ago!

Bernd agreed to sit down, look behind the covers and answer the open questions that people posted on his LinkedIn in response to his recent observability prediction blog.
Tune in and learn about Bernd's though on the evaluation from reactive to preventive operations, who is behind the convergence of observability & security, why observability can help those that have serious intentions for sustainability and how observability becomes mandatory and indispensable for AI-driven services.

We mentioned a lot of links in todays session. Here they are:
Our podcast from 9 years ago: https://www.spreaker.com/episode/015-leading-the-apm-market-from-enterprise-into-cloud-native--9607734
Bernds LinkedIn Post: https://www.linkedin.com/feed/update/urn:li:activity:7275101213237354497/
Predictions Blog: https://www.dynatrace.com/news/blog/observability-predictions-for-2025/
K8s Predictive Scaling Lab: https://github.com/Dynatrace/obslab-predictive-kubernetes-scaling
Security Video: https://www.youtube.com/watch?v=ICUwRy4JFTk
Carbon Impact App: https://www.youtube.com/watch?v=8Px0BB1U1yk
AI & LLM Observability Video: https://www.youtube.com/watch?v=eW2KuWFeZyY
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
From Infra to Services to Happy End Users: The role of SLOs at Uber with Vishnu Acharya
6 ene· PurePerformance
eBay, Yahoo, Netflix and then 10+ years at Uber. In this episode we sit down with Vishnu Acharya, Head of Network Infrastructure EMEA and Platform Engineering at Uber. Vishnu shares how Uber has scaled over the years to about 4000 engineers and how his team makes sure that infrastructure and platform engineering scales with the growing company and the growing demand on their digital services.
Tune in and learn about how Vishnu thinks about SLOs across all layers of the stack, how they manage to get better insights with their cloud providers and why its important to have an end-to-end understanding of the most critical end user journeys.

Links we discussed:
Conference talk at Observability & SRE Summit: https://www.iqpc.com/events-observability-sre-summit/speakers/vishnu-acharya
Vishnu's LinkedIn Page: https://www.linkedin.com/in/vishnuacharya/
Uber Engineering Blog: https://www.uber.com/blog/engineering/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
The Road to OpenTelemetry Adoption at Booking with Anton Timofieiev
23 dic 2024· PurePerformance
For the past 10 years Anton has been working at Booking.com - one of the leading digital travel companies based out of Amsterdam. The journey that started as System Administrator has led Anton to be an Engineering Manager for Site Reliability where over the past 3 years he led the rollout and adoption of OpenTelemetry as the standard for getting observability into new cloud native deployments.
Tune in and learn how Anton saw R&D grow from 300 to 2000, why they replaced their home-grown Perl-based Observability Framework with OpenTelemetry, how they tackle adoption challenges and how they extend and contribute back to the open source community

Links we discussed:
Anton's LinkedIn Profile: https://www.linkedin.com/in/antontimofieiev/
Observability & SRE Summit: https://www.iqpc.com/events-observability-sre-summit/speakers/anton-timofieiev
OpenTelemetry: https://opentelemetry.io/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Why Security and Compliance must not be a showstopper for SaaS with Milan Steskal
9 dic 2024· PurePerformance
Most services are moving to SaaS - whether it’s email, collaboration, customer relations, or finance. But not everyone can go to SaaS - or at least that’s the initial reaction when navigating certain industries’ rules and regulations.
Milan Steskal - who worked in healthcare for many years - is now helping organizations ask the right questions and find the best solutions as they evaluate their options to move their observability data to SaaS. Tune in and learn about the questions to ask vendors and your internal security, privacy, and compliance teams. Milan also walks us through the capabilities SaaS vendors such as Dynatrace have put in place to protect data sent to the cloud so that it stays safe and only accessible to those needing access.

Links discussed today:
Milans LinkedIn Page: https://www.linkedin.com/in/milansteskal/
Dynatrace Trust Center: https://www.dynatrace.com/company/trust-center/
Blogs on Trust: https://www.dynatrace.com/news/tag/trust-center/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Every Byte Counts: Web Performance Flashback with Andreas Taranetz
25 nov 2024· PurePerformance
Andreas Taranetz is a software engineer and lecturer at the University of Vienna. He creates a lot of educational content around Web Performance Optimization. For the past seven years, he has also operated Wahlkabine, Austria's top website, for matching one's political views with the parties that are up for election.
This episode was an amazing flashback - reminding us about the time when Steve Souders - the "godfather" of Web Performance Optimization - educated web developers about optimizing CSS, JavaScript, and server-side roundtrips.
Tune in and learn why Web Performance is still such an important topic, how it relates to sustainability, why you should cache on every layer, and what the Static Site Paradox really is!

Links we discussed in the episode:
Andreas on LinkedIn: https://www.linkedin.com/in/andreas-taranetz/
Personal Website: https://andreas.taranetz.com/
We Are Developers Talk: https://www.youtube.com/live/KRemC82gsBk
Wahlkabine: https://wahlkabine.at/
Steve Souders: https://stevesouders.com/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
The Security and Resiliency Challenges of Cloud Native Authorization with Alex Olivier
11 nov 2024· PurePerformance
Authentication (validating who you claim to be) and Authorization (enforcing what you are allowed to do) are critical in modern software development. While authentication seems to be a solved problem, modern software development faces many challenges with secure, fast, and resilient authorization mechanisms.
To learn more about those challenges, we invited Alex Olivier, Co-Founder and CPO at Cerbos, an Open Source Scalable Authorization Solution. Alex shared insights on attribute-based vs. role-based access Control, the difference between stateful and stateless authorization implementations, why Broken Access Control is in the OWASP Top 10 Security Vulnerabilities, and how to observe the authorization solution for performance, security, and auditing purposes.

Links we discussed during the episode:
Alex's LinkedIn: https://www.linkedin.com/in/alexolivier/
Cerbos on GitHub: https://github.com/cerbos/cerbos
OWASP Broken Access Control: https://owasp.org/www-community/Broken_Access_Control
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Open Source: Why its the Best Thing that happened to IT with Marcio Lena
28 oct 2024· PurePerformance
Open Source is the Best Thing that happened to IT"! Powerful words from Marcio Lena who has been using and contributing back to open source for the past 20+ years. Besides being a vivid advocate for open source, Marcio also knows the concerns of large enterprises when picking open source projects.
Tune in and follow our discussion about how to identify a healthy open-source project, how to balance between vendor and community lock-in, the power of open standards such as OpenTelemetry, open source business models as well as that contributing to open source is not limited to code but includes documentation, education and advocacy as well!

Links we discussed:
Marcio's LinkedIn Page: https://www.linkedin.com/in/marcio-lena/
CNCF DevStats: https://devstats.cncf.io/
Linux Foundation Events: https://events.linuxfoundation.org/
CNCF Ambassadors: https://www.cncf.io/people/ambassadors/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Understanding DORA - Europe's Digital Operational Resiliency Act with Kay Young
14 oct 2024· PurePerformance
DORA - the EU's Digital Operational Resiliency Act - will take effect in January of 2025 and is currently top of mind for IT Leaders across all financial service institutions that operate in the European Union. But what is DORA really? Why is this important? How can institutions meet the DORA requirements? What is the role of observability, automation and AI in all of this?
To answer all those and more questions we invited Kay Young, Sr Principal Product Manager at Dynatrace, who has been working with organizations around the globe that have been tasked to implement regulations such as DORA, GDPR, FedRAMP or others.
In our conversation we also touch base on the third-party risk management as well as resiliency testing and incident reporting.

Resources we discussed:
Kay's LinkedIn Profile: https://www.linkedin.com/in/karlien-young-4a156730/
What is DORA blog: https://www.dynatrace.com/news/blog/what-is-dora/
Taming DORA compliance: https://www.dynatrace.com/news/blog/taming-dora-compliance-with-ai-observability-and-security/
Blog on Dynatrace's DORA compliance journey: https://www.dynatrace.com/news/blog/the-dynatrace-journey-toward-dora-compliance/
Beyond DORA compliance: https://www.dynatrace.com/news/blog/dora-how-dynatrace-helps-the-financial-sector-stay-resilient/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Lessons learned when building the NAIS Platform with Hans Kristian Flaatten
30 sep 2024· PurePerformance
NAIS (pronounced like NICE) is a team central application platform that provides DevOps teams with the tools they need build, test, deploy, run and observe applications.
In this episode Hans Kristian Flaatten, Platform Engineer at NAV, walks us through the WHYs, HOWs and challenges of building modern platforms on Kubernetes. Tune in and hear WHY they defined their own abstraction layer for applications, HOW developers benefit from that platform and WHY they developed their developer portal instead of going with other popular available choices.

Links we discussed:
Hans Kristian's LinkedIn: https://www.linkedin.com/in/hansflaatten/
NAIS Documentation: https://docs.nais.io/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Why Developer Observability is not a tooling problem with Viktor Farcic
16 sep 2024· PurePerformance
"We will overwhelm developers if we give them the same specialized observability, security or deployment tools that are used by their platform engineering, operations, SREs or security teams!" - says Viktor Farcic, Developer Advocate at UpBound and host of The DevOps Toolkit YouTube channel.
Tune in and hear us discuss about making observability easier accessible for developers, what Viktor doesn't like about Kubernetes and how Crossplane - the cloud native control plane framework - can be the gateway to real product-oriented platform engineering!
Here the links we discussed during this episode:
Viktor on LinkedIn: https://www.linkedin.com/in/viktorfarcic/DevOps Toolkit: https://www.youtube.com/@DevOpsToolkitCrossplane: https://www.crossplane.io/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Pitfalls to avoid when going all-in on OpenTelemetry with Hans Kristian Flaatten
2 sep 2024· PurePerformance
Hans Kristian is a Platform Engineer for NAV's Kubernetes Platform Nais hosting Norway's wellfare services. With 10 years on Kubernetes, 2000 apps and 1000 developers across more than 100 teams there was a need to make OpenTelemetry adoption as easy as possible.Tune in as we hear from Hans Kristian who is also a CNCF Ambassador and hosts Cloud Native Day Bergen why OpenTelemetry is chosen by the public sector, why it took much longer to adopt, which challenges they had to scale the observability backend and how they are tackling the "noisy data problem"
Links we discussed in the episode
Follow Hans Kristian on LinkedIn: https://www.linkedin.com/in/hansflaatten/From 0 to 100 OTel Blog: https://nais.io/blog/posts/otel-from-0-to-100/?foo=barCloud Native Day Bergen: https://2024.cloudnativebergen.dev/Public Money, Public Code. How we open source everything we do! (https://m.youtube.com/watch?v=4v05Huy2mlw&pp=ygUkT3BlbiBzb3VyY2Ugb3BlbiBnb3Zlcm5tZW50IGZsYWF0dGVu)State of Platform Engineering in Norway (https://m.youtube.com/watch?v=3WFZhETlS9s&pp=ygUYc3RhdGUgb2YgcGxhdGZvcm0gbm9yd2F5)
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
So you think you should Serverless? Things to know before you do with Sebastian Vietz!
26 ago 2024· PurePerformance
Has one of the decision makers in your organization decided that you have to go "all in on technology X" because they saw a great presentation at a conference or got a great sales pitch from a vendor? If that is the case then this episode is for you and you should forward it to those decision makers.
Sebastian Vietz, Director of Reliability Engineering and Host of the Reliability Enablers Podcast, shares his thoughts on considerations when picking a technology like Serverless. We discuss the importance of knowing limits, best fit architectural patterns and things that should influence your technology decisions!
Being aware of coldstarts, a 20000 concurrent request limit or 512mb being an ideal size for Lambda are just some of the things we can all learn from Sebastian.

Additional links we discussed:
Sebastians LinkedIn: https://www.linkedin.com/in/sebastianvietz/
Reliability Podcast: https://podnews.net/podcast/ibe8k
More things on serverless: https://serverlessland.com/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Observability that is Battle tested by Millions with Marco Sussitz and Wolfgang Ziegler
12 ago 2024· PurePerformance
When your code runs on more than 6 million systems - many of them business critical - then this is really exciting news for Marco and Wolfgang, Dynatrace OneAgent Java Team members. Their code powers auto-instrumentation and collection of all observability signals of Java based applications running on every possible stack: container in k8s, serverless, VM, on your workstation or even the mainframe.
Tune is as we sat down with Marco and Wolfgang to learn what it means to continuously innovate on agent-based instrumentation with 160+ other engineers across the globe that also focus on OneAgent. They share insights on how they develop their observability code, how they continuously test across all supported environments, what the processes at Dynatrace look like to avoid situations like the recent CrowdStrike outage and how they integrate and collaborate with other communities and tools such as OpenTelemetry!

Things we discussed during the episode
Dynatrace OneAgent: https://www.dynatrace.com/platform/oneagent/
Dynatrace for Java: https://www.dynatrace.com/technologies/java-monitoring/
OpenTelemetry and Dynatrace: https://docs.dynatrace.com/docs/extend-dynatrace/opentelemetry
Jobs at Dynatrace: https://careers.dynatrace.com/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Using Observability to Prioritize CrowdStrike Remediation with Josh Wood
5 ago 2024· PurePerformance
When thousands of systems show a blue screen - which ones do you fix first to quickly bring up your most critical systems? For that you need to know which systems are impacted, which mission critical applications run on it, and which depending systems are also impacted by something like the recent CrowdStrike incident!
We have invited Josh Wood, Principal Solutions Engineer at Dynatrace, who was one of the first responders helping organizations to leverage observability data to identify which systems to fix first to bring critical apps such as ATMs, Self-Service Terminals, POS (Point of Sales), ... back up again quickly.
In this special episode Josh is walking us through the technical details of the CrowdStrike BSOD (Blue Screen of Death), what caused it, how to leverage observability to get a priorities list of systems to fix first and what organizations can do to prevent software impacting issues in the future.

Here the links we discussed in the episode:
Josh on LinkedIn: https://www.linkedin.com/in/joshuadwood/
Josh's blog on CrowdStrike BSOD: https://www.dynatrace.com/news/blog/crowdstrike-bsod-quickly-find-machines-impacted-by-the-crowdstrike-issue/
CrowdStrike Incident Takeaway Blog: https://www.dynatrace.com/news/blog/crowdstrike-incident-revisiting-vendor-quality-control/
- Escuchar Escuchar de nuevo Continuar Reproduciendo...
- Escuchar más tarde Escuchar más tarde
Mostrar más

Episodios

MCPs (Model Context Protocol) are not that magic, but they enable magic things with Dana Harrison

The History & Power of Distributed Tracing with Christoph Neumueller & Thomas Rothschaedl

An Inside Look into Platform Engineering for Architects with the authors Max, Hilliary & Andi

How CERN analyzed 1 PetaByte per second using K8s with Ricardo Rocha

Why Compliance is Important and not Boring with Michiel de Lepper

What's next for Feature Flagging and OpenFeature with Ben Rometsch

Observability Predictions 2025 Under the Covers with Bernd Greifeneder

From Infra to Services to Happy End Users: The role of SLOs at Uber with Vishnu Acharya

The Road to OpenTelemetry Adoption at Booking with Anton Timofieiev

Why Security and Compliance must not be a showstopper for SaaS with Milan Steskal

Every Byte Counts: Web Performance Flashback with Andreas Taranetz

The Security and Resiliency Challenges of Cloud Native Authorization with Alex Olivier

Open Source: Why its the Best Thing that happened to IT with Marcio Lena

Understanding DORA - Europe's Digital Operational Resiliency Act with Kay Young

Lessons learned when building the NAIS Platform with Hans Kristian Flaatten

Why Developer Observability is not a tooling problem with Viktor Farcic

Pitfalls to avoid when going all-in on OpenTelemetry with Hans Kristian Flaatten

Observability that is Battle tested by Millions with Marco Sussitz and Wolfgang Ziegler

Using Observability to Prioritize CrowdStrike Remediation with Josh Wood