Trending Articles

article thumbnail

How OpenAI’s Downtime Incident Teaches Us to Build More Resilient Systems

DZone

On December 11, 2024, OpenAI services experienced significant downtime due to an issue stemming from a new telemetry service deployment. This incident impacted API, ChatGPT, and Sora services, resulting in service disruptions that lasted for several hours. As a company that aims to provide accurate and efficient AI solutions, OpenAI has shared a detailed post-mortem report to transparently discuss what went wrong and how they plan to prevent similar occurrences in the future.

Systems 204
article thumbnail

Dynatrace KSPM: Transforming Kubernetes security and compliance

Dynatrace

We are excited to announce that Dynatrace has introduced new capabilities for Kubernetes security posture management. This solution provides continuous assessment of adherence to technical standards, visibility, prioritization, and remediation for misconfigurations and compliance findings across Kubernetes environments. Why manual audits and custom scripts fall short for Kubernetes security posture management In the dynamic and complex world of Kubernetes, relying on manual audits, custom script

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Title Launch Observability at Netflix Scale

The Netflix TechBlog

Part 1: Understanding The Challenges By: VarunKhaitan With special thanks to my stunning colleagues: Mallika Rao , Esmir Mesic , HugoMarques Introduction At Netflix, we manage over a thousand global content launches each month, backed by billions of dollars in annual investment. Ensuring the success and discoverability of each title across our platform is a top priority, as we aim to connect every story with the right audience to delight our members.

Traffic 172
article thumbnail

Tech predictions for 2025 and beyond

All Things Distributed

We've entered an era of unprecedented societal challenges and rapid technological advancements. Harnessing technology for good has become both an ethical imperative and a profitable endeavor. These are the areas where I see technology shaping society in 2025 and beyond—and it all starts with mission-driven work.

article thumbnail

Generative Logic

O'Reilly

Alibabas latest model, QwQ-32B-Preview , has gained some impressive reviews for its reasoning abilities. Like OpenAIs GPT-4 o1, 1 its training has emphasized reasoning rather than just reproducing language. That seemed like something worth testing outor at least playing around withso when I heard that it very quickly became available in Ollama and wasnt too large to run on a moderately well-equipped laptop, I downloaded QwQ and tried it out.

C++ 76
article thumbnail

New continuous compliance requirements drive the need to converge observability and security

Dynatrace

At the time when I was building the most innovative observability company, security seemed too distant. However, customers began approaching me, praising Dynatraces deep end-to-end insights into even the most complex digital service deployments and asking how to use it for security compliance, exposure, and response use cases. I realized that our platforms unique ability to contextualize security events, metrics, logs, traces, and user behavior could revolutionize the security domain by convergi

Analytics 214
article thumbnail

Charge Vertical Scaling With the Latest Java GCs

DZone

In the dynamic landscape of Java ecosystem enhancements, one could miss an important progress that has been made in Java Garbage Collection (GC) in recent years. Meanwhile, the latest generations of GC bring far-reaching consequences for running Java applications. This article intends to highlight the significant effects brought to us by ZGC and Shenandoah.

Java 204

More Trending

article thumbnail

Performing ELT Operations Using DOMAINs

Percona

First things first, a couple of definitions to clear up: ETL: Extract, Transform, Load. Transforms data before loading it into the target system extract raw data from various sources use a secondary processing server to transform that data load that data into a target database ELT: Extra, Load, Transform.

article thumbnail

Our 10 most popular web performance articles of 2024

Speed Curve

We love writing articles and blog posts that help folks solve real web performance and UX problems. Here are the ones you loved most in 2024. (The number one item may surprise you!) Some of these articles come from our recently published Web Performance Guide – a collection of evergreen how-to resources (written by actual humans!) that will help you master website monitoring, analytics, and diagnostics.

article thumbnail

HTTP monitors on the latest Dynatrace platform extend insights into the health of your API endpoints and simplify test management

Dynatrace

The improved UI of the new Synthetic app makes managing your synthetic tests and analyzing their results easier and more effective. Exploratory analytics now cover more bespoke scenarios, allowing you to access any element of test results stored in the Dynatrace Grail data lakehouse. This allows you to build customized visualizations with Dashboards or perform in-depth analysis with Notebooks.

article thumbnail

Which Flow Is Best for Your Data Needs: Time Series vs. Streaming Databases

DZone

Data is being generated from various sources, including electronic devices, machines, and social media, across all industries. However, unless it is processed and stored effectively, it holds little value. A significant evolution is taking place in the way data is organized for further analysis. Some databases prioritize organizing data based on its time of generation, while others focus on different functionalities.

article thumbnail

These 7 Edge Data Challenges Will Test Companies the Most in 2025

VoltDB

Edge computing has transformed how businesses and industries process and manage data. By bringing computation closer to the data source, edge-based deployments reduce latency, enhance real-time capabilities, and optimize network bandwidth. However, as organizations accelerate their adoption of edge technologies, things are getting more difficult in the form of security, bottlenecks, and more. 2025 portends to be the year these difficulties come to a head.

IoT 52
article thumbnail

MySQL with Diagrams Part One: Replication Architecture

Percona

In this series, MySQL with Diagrams, Ill use diagrams to explain internals, architectures, and structures as detailed as possible. In basic terms, here’s how replication works: the transactions are written into a binary log on the source side, carried into the replica, and applied.

article thumbnail

Power Laws and Heavy-Tail Distributions in Hyperscale Microservices Architectures

Abhishek Tiwari

Recent analyses of Meta and Alibaba’s production microservices architectures reveal distinct patterns of heavy-tail and power law distributions. These patterns manifest in service scale, request patterns, and resource utilisation, offering insights into the inherent characteristics of large-scale distributed systems.

article thumbnail

When things go sideways: Troubleshooting the OpenTelemetry Operator

Dynatrace

This blog post was co-written with Reese Lee. If you already have an application running in Kubernetes and are exploring using OpenTelemetry to gain insights into the health and performance of your app and cluster, you might be interested in an implementation of the Kubernetes Operator called the OpenTelemetry Operator. As youll learn shortly, due to its range of capabilities, the Operator is your go-to for (almost) hassle-free OpenTelemetry management.

Java 183
article thumbnail

Monitor Spring Boot Web Application Performance Using Micrometer and InfluxDB

DZone

As a Java developer, there's nothing more frustrating than dealing with sluggish application performance in production. Diagnosing issues within complex microservice architectures can quickly become a time-consuming and daunting task. Fortunately, the Spring Boot framework offers a powerful observability stack that streamlines real-time monitoring and performance analysis.

article thumbnail

My code::dive talk video is available: New Q&A

Sutter's Mill

Two weeks ago, Bjarne and I and lots of ISO committee members had a blast at the code::dive C++ conference held on November 25, just two days after the end of the Wrocaw ISO C++ meeting. Thanks again to Nokia for hosting the ISO meeting, and for inviting us all to speak at their conference! My talk was an updated-and-shortened version of my CppCon keynote (which I also gave at Meeting C++; I’ll post a link to that video too once it’s posted): If you already saw the CppCon talk, you c

Code 64
article thumbnail

How to Get Useful Information Out of Valkey/Redis Keys

Percona

In this blog post we will talk about how to get useful information out of the Valkey OR Redis keys. This would be helpful in many scenarios, like troubleshooting a specific key thing and verifying the resources, especially the Memory the data set is holding.

article thumbnail

Summarizing Books as Podcasts

O'Reilly

Like just about everyone, we were impressed by the ability of NotebookLM to generate podcasts: Two virtual people holding a discussion. You can give it some links, and it will generate a podcast based on the links. The podcasts were interesting and engaging. But they also had some limitations. The problem with NotebookLM is that, while you can give it a prompt, it largely does what its going to do.

article thumbnail

Predictable costs for Log Management & Analytics with new simplified licensing plan

Dynatrace

As cloud complexity increases and security concerns mount, organizations need log analytics to discover and investigate issues and gain critical business intelligence. But exploring the breadth of log analytics scenarios with most log vendors often results in unexpectedly high monthly log bills and aggressive year-over-year costs. To give organizations the freedom to explore log analytics without barriers due to cost concerns, Dynatrace is proud to announce a new Dynatrace Platform Subscription

Analytics 155
article thumbnail

Spring Boot WebClient: Performance Optimization and Resilience

DZone

Spring WebClient is a reactive, non-blocking HTTP (HyperText Transfer Protocol) client designed for making requests to external services. It belongs to the Spring WebFlux framework and provides advanced, scalable handling of HTTP requests more efficiently than the RestTemplate. WebClient also supports parallel and reactive programming, making it suitable to perform a large volume of operations without blocking requests.

article thumbnail

Unveiling Graph Structures in Microservices: Service Dependency Graph, Call Graph, and Causal Graph

Abhishek Tiwari

The rise of service-oriented architecture (SOA) and microservices architecture has led to a major shift in software development, enabling the creation of complex, distributed systems composed of independent, loosely coupled services. These architectures offer numerous benefits, including scalability, flexibility, and resilience. However, the distributed nature of these systems introduces new challenges related to understanding, managing, and analysing their behaviour.

article thumbnail

MySQL Transaction ERROR 1412 and Isolation Levels

Percona

This blog post explains the cause of “ERROR 1412 (HY000): Table definition has changed, please retry transaction” with the specific Isolation level settings. Background As per the MySQL documentation, this error should occur for “operations that make a temporary copy of the original table and delete the original table when the temporary copy is built.

article thumbnail

How to understand TPC-C tpmC and TPROC-C NOPM and what is ‘good’ performance?

HammerDB

More than ever, we see confusion in interpreting and comparing the performance of databases with workloads derived from the TPC-Council’s TPC-C specification, including HammerDB’s TPROC-C NOPM and TPM. In this post, we revisit how to interpret transactional database performance metrics and give guidance on what levels of performance should be expected on up-to-date hardware and software in 2024. tpmC tpmC is the transactions per minute metric that is the measurement of the official T

C++ 58
article thumbnail

Dynatrace wins InfoWorld’s 2024 Technology of the Year Award for DevOps: Observability

Dynatrace

We’re excited to share that Dynatrace has been recognized in the DevOps: Observability category of InfoWorlds 2024 Technology of the Year awards! This honor reflects our dedication to helping organizations tackle complexity and achieve greater clarity in managing their digital environments. Why this award matters The InfoWorld Technology of the Year awards celebrate standout solutions in AI, APIs, applications, business intelligence, cloud, data management, DevOps, and software development

DevOps 155
article thumbnail

Stress Testing for Resilience in Modern Infrastructure

DZone

Today, users' expectations of seamless performance mean the system cannot afford downtime or disruption that might turn into losses in revenue and reputation. Therefore, no one can underestimate the role of stress testing in ensuring that the systems are resilient against unfortunate events and failures. Indeed, chaos engineering is an innovation concerning testing infrastructure resilience these days.

article thumbnail

Cache Me If You Can: Taming the Caching Complexity of Microservice Call Graphs

Abhishek Tiwari

As microservices architectures have become increasingly prevalent in modern software systems, they’ve brought both tremendous benefits and significant challenges. One of the most pressing challenges has been maintaining performance at scale while dealing with complex service dependencies and network communication overhead. Today, I want to explore MuCache, an innovative framework recently presented by Zhang et. al at USENIX NSDI 2024 that tackles this challenge head-on by providing automat

Cache 59
article thumbnail

Software Licensing and Open Source: Why Definitions Matter

Percona

Over the last few years, there have been many calls to evolve the Open Source Definition (OSD) to fit the modern world. Some would like to seenoncompetitive licenses such as the Server Side Public License (SSPL) or Elastic License considered open source.

article thumbnail

ICYMI: Some of our most exciting product updates of 2024!

Speed Curve

Every year feels like a big year here at SpeedCurve, and 2024 was no exception. Here's a recap of product highlights designed to make your performance monitoring even better and easier! Our biggest achievements this year have centred on making it easier for you to: Gather more meaningful real user monitoring (RUM) data Get actionable insights from Core Web Vitals Simplify your synthetic testing Get expert performance coaching when and how you need it Keep reading to learn more.

Cache 52
article thumbnail

Five observability predictions for 2025

Dynatrace

As the digital world grows more complex, 2025 will bring a tipping point for organizations navigating increasingly dynamic and interconnected IT environments. Observability, long a cornerstone of IT operations, will take on transformative new roles. Driven by rapid advances in AI, evolving regulatory frameworks, and mounting sustainability pressures, observability will no longer be a passive diagnostic tool.

Energy 144
article thumbnail

Using the Log Node in IBM App Connect Enterprise

DZone

In the world of IBM App Connect Enterprise (ACE), effective logging is crucial for monitoring and troubleshooting. With the introduction of the Log node, it's now easier than ever to log ExceptionList inserts directly into the activity log, which can be viewed from the WebUI. The Log node can be especially valuable, often replacing the Trace node in various scenarios.

article thumbnail

Policy Zones: Purpose Limitation at Scale using Information Flow Control

Abhishek Tiwari

At the heart of privacy lies the principle of purpose limitation, dictating that data should only be processed for explicitly stated purposes. This principle presents a considerable challenge, especially for organisations operating at the scale of Meta, which handles vast amounts of data from billions of users. This article examines Policy Zones, a core component of Meta’s Privacy Aware Infrastructure (PAI), designed to address the complexities of purpose limitation at scale.

article thumbnail

Percona Wins 2024 Digital Innovator Award from Intellyx

Percona

As organizations turn to the cloud for more and more products and services, many are discovering that the majority of these offerings come with hidden, yet significant, costs.

article thumbnail

Performance Hero: Pat Meenan

Speed Curve

This month, we celebrate everything that OG performance hero Pat Meenan has done – and continues to do – for the web performance community. When we started the Performance Hero series earlier this year, we had an idea of the types of folks in our community we wanted to acknowledge: People who are making a difference in web performance People who are humble People who give without expectation People who don't necessarily crave the spotlight When looking at these attributes – for

article thumbnail

Scaling EPAS 17 by EDB

HammerDB

Blog post from EDB on how to measure PostgreSQL performance with HammerDB can be found here Scaling Breakthrough: EPAS 17 Scales Twice as Well, Thanks PostgreSQL! Methodology shows the correct use of generating performance profiles with increasing numbers of virtual users to highlight the difference between performance and scalability.

article thumbnail

How and Why the Developer-First Approach Is Changing the Observability Landscape

DZone

Developers play a crucial role in modern companies. If we want our product to be successful, we need to have a developer-first approach and include observability from day one. Read on to understand why. The World Has Changed Many things have changed in the last decade. In our quest for greater scalability, resilience, and flexibility within the digital infrastructure of our organization, there has been a strategic pivot away from traditional monolithic application architectures towards embracing

article thumbnail

Microservice Architecture of Alibaba

Abhishek Tiwari

Alibaba has constructed a sophisticated microservices architecture to address the challenges of serving its vast user base and handling complex business operations. This article delves into the nuances of Alibaba’s microservices architecture, presenting critical insights into its design, scalability, performance optimisation, and resource management strategies.