August, 2023

article thumbnail

Shift-Left Monitoring Approach for Cloud Apps in Containers

DZone

Shift-left is an approach to software development and operations that emphasizes testing, monitoring, and automation earlier in the software development lifecycle. The goal of the shift-left approach is to prevent problems before they arise by catching them early and addressing them quickly. When you identify a scalability issue or a bug early, it is quicker and more cost-effective to resolve it.

Cloud 289
article thumbnail

Dynatrace and Google unleash cloud-native observability for GKE Autopilot

Dynatrace

Cloud-native observability for Google’s fully managed GKE Autopilot clusters demands new methods of gathering metrics, traces, and logs for workloads, pods, and containers to enable better accessibility for operations teams. Managed Kubernetes clusters on GKE Autopilot have gained unprecedented momentum among enterprises. GKE Autopilot empowers organizations to invest in creating elegant digital experiences for their customers in lieu of expensive infrastructure management.

Google 264
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Zero Configuration Service Mesh with On-Demand Cluster Discovery

The Netflix TechBlog

by David Vroom, James Mulcahy, Ling Yuan, Rob Gulewich In this post we discuss Netflix’s adoption of service mesh: some history, motivations, and how we worked with Kinvolk and the Envoy community on a feature that streamlines service mesh adoption in complex microservice environments: on-demand cluster discovery. A brief history of IPC at Netflix Netflix was early to the cloud, particularly for large-scale companies: we began the migration in 2008, and by 2010, Netflix streaming was fully run o

Traffic 226
article thumbnail

PostgreSQL Parameters: Scope and Priority Users Should Know

Percona

PostgreSQL allows its users to set parameters at different scopes, and the same parameter can be specified at different places and using different methods. And there could be conflicts. Someone might be wondering why certain changes are not coming into effect, so it is important to understand/recollect the scope and priority of settings. In this blog, I am trying to list the options available for users and list them in increasing order of priority.

Database 120
article thumbnail

The Swedbank Outage shows that Change Controls don't work

High Scalability

This week I’ve been reading through the recent judgment from the Swedish FSA on the Swedbank outage. If you’re unfamiliar with this story, Swedbank had a major outage in April 2022 that was caused by an unapproved change to their IT systems. It temporarily left nearly a million customers with incorrect balances, many of whom were unable to meet payments.

article thumbnail

How PHP Web Development Beneficial For Online Business?

Official Blog - World Web Technology

When you decide to develop a business website, there are many things you need to consider to reap all the benefits it offers. For example, website design is very crucial to impress your website visitors. Not just that, you also need to ensure that the website has original content and loads quickly. Similarly, using the best programming language for your website is also crucial.

article thumbnail

Time Series Analysis: VARMAX-As-A-Service

DZone

VARMAX-As-A-Service is an MLOps approach for the unification and reuse of statistical models and machine learning models deployment pipelines. It is the first of a series of articles that will be built on top of that project, representing experiments with various statistical and machine learning models, data pipelines implemented using existing DAG tools, and storage services, both cloud-based and alternative on-premises solutions.

Storage 264

More Trending

article thumbnail

Curbing Connection Churn in Zuul

The Netflix TechBlog

By Arthur Gonigberg , Argha C Plaintext Past When Zuul was designed and developed , there was an inherent assumption that connections were effectively free, given we weren’t using mutual TLS (mTLS). It’s built on top of Netty , using event loops for non-blocking execution of requests, one loop per core. To reduce contention among event loops, we created connection pools for each, keeping them completely independent.

Traffic 176
article thumbnail

Data Redundancy With the PostgreSQL Citus Extension

Percona

Over the years, I’ve had the opportunity to architect all sorts of configurations using Postgres as a backend. I’ve always found it very cool and satisfying to implement sophisticated business rules, often in more ways than one has fingers and toes. So, it’s not an understatement when I say that Citus is one of the more interesting technologies that I’ve come across when scaling PostgreSQL.

C++ 98
article thumbnail

Reducing Embodied Carbon is Important

ACM Sigarch

(Editor’s Note: This post was submitted as a rebuttal to Andrew Chien’s July 24 SIGARCH Blog Post ) The recent post “ Why Embodied Carbon is a poor Architecture Design metric, and Operational Carbon remains an important Problem ” by Prof. Andrew Chien rightfully raises awareness of the challenges of reducing operational carbon. Specifically, we agree that one of the grand challenges of sustainable computing is matching electrical load with the availability of low-carbon electricity.

Energy 98
article thumbnail

The next generation of developer productivity

O'Reilly

To follow up on our previous survey about low-code and no-code tools, we decided to run another short survey about tools specifically for software developers—including, but not limited to, GitHub Copilot and ChatGPT. We’re interested in how “developer enablement” tools of all sorts are changing the workplace. Our survey 1 showed that while these tools increased productivity, they aren’t without their costs.

article thumbnail

Diagnose CPU Spikes in a Non-Intrusive Manner

DZone

In this post, we are going to discuss a non-intrusive approach (i.e., an approach that doesn’t add any noticeable overhead to the application) to diagnose CPU spikes. Thus, you can use this approach in your production environment to troubleshoot CPU spikes.

262
262
article thumbnail

Customer expectations for retail: Beyond digital experience

Dynatrace

Digital experience is often considered the most important customer-facing aspect of digital commerce. This is typically the first thing that comes to mind for IT professionals working in the retail industry when evaluating holiday readiness. While digital experience has many facets, transaction speed usually ranks among the most important. Almost two decades ago, a Google experiment showed that fast-loading transactions are more important to customers than content quality—even small increases in

Retail 246
article thumbnail

AVA Discovery View: Surfacing Authentic Moments

The Netflix TechBlog

By: Hamid Shahid , Laura Johnson , Tiffany Low Synopsis At Netflix, we have created millions of artwork to represent our titles. Each artwork tells a story about the title it represents. From our testing on promotional assets , we know which of these assets have performed well and which ones haven’t. Through this, our teams have developed an intuition of what visual and thematic artwork characteristics work well for what genres of titles.

Media 175
article thumbnail

Failover and Recovery Scenarios in InnoDB Cluster and ClusterSet

Percona

This blog post will focus on failover and recovery scenarios inside the InnoDB Cluster and ClusterSet environment. To know more about the deployments of these topologies, you can refer to the manuals – InnoDB Cluster and Innodb ClusterSet setup. In the below snippet, we have two clusters (cluster1 and cluster2), which are connected via an async channel and combined, known as a ClusterSet topology.

Traffic 98
article thumbnail

Cadence 1.0: Uber Releases Its Scalable Workflow Orchestration Platform

InfoQ

Uber released a major version of its workflow orchestration platform named Cadence after six years in development. Uber and other companies use Cadence to build stateful services at scale using native programming languages.

article thumbnail

The ChatGPT Surge

O'Reilly

I’m sure that nobody will be surprised that the number of searches for ChatGPT on the O’Reilly learning platform skyrocketed after its release in November, 2022. It might be a surprise how quickly it got to the top of our charts: it peaked in May as the 6th most common search query. Then it dropped almost as quickly: it dropped back to #8 in June, and fell further to #19 in July.

article thumbnail

How to Configure Istio, Prometheus and Grafana for Monitoring

DZone

Intro to Istio Observability Using Prometheus Istio service mesh abstracts the network from the application layers using sidecar proxies. You can implement security and advance networking policies to all the communication across your infrastructure using Istio. But another important feature of Istio is observability. You can use Istio to observe the performance and behavior of all your microservices in your infrastructure (see the image below).

article thumbnail

Complete Kubernetes observability with logs in topology context

Dynatrace

Kubernetes workload management is easier with a centralized observability platform When deploying applications with Kubernetes, the configuration is flexible and declarative, allowing for scalability. However, due to the distributed nature of Kubernetes, it can be difficult to understand overall deployment health and the status of Kubernetes clusters.

article thumbnail

Running A Page Speed Test: Monitoring vs. Measuring

Smashing Magazine

Running A Page Speed Test: Monitoring vs. Measuring Running A Page Speed Test: Monitoring vs. Measuring Geoff Graham 2023-08-10T08:00:00+00:00 2023-08-10T12:35:05+00:00 This article is sponsored by DebugBear There is no shortage of ways to measure the speed of a webpage. The tooling to get a report with details from the time it takes to establish a server connection to the time it takes for the full page to render is out there.

Speed 85
article thumbnail

The Deprecation of qpress/QuickLZ Compression Algorithm

Percona

To reduce the backup size, save storage space, and speed up the backup and restore process, you can compress a backup with Percona XtraBackup. To compress and decompress local or streaming backups, Percona XtraBackup uses xbstream – a binary that supports simultaneously compressing and streaming dynamically generated files. The XtraBackup –compress option makes XtraBackup compress all output data, including the transaction log file and metadata files, with one of the sup ported compr

article thumbnail

My C++ Now 2023 talk is online: “A TypeScript for C++”

Sutter's Mill

Thanks again to C++ Now for inviting me to speak this year in glorious Aspen, Colorado, USA! It was nice to see many old friends again there and make a few new ones too. The talk I gave there was just posted on YouTube, you can find it here: At CppCon 2022, I argued for why we should try to make C++ 10x simpler and safer , and I presented my own incomplete experimental compiler, cppfront.

C++ 83
article thumbnail

Golem Unveils a Resilient Computing Platform for Serverless Workers with WebAssembly Component Model

InfoQ

Recently Golem released its flagship product Golem Cloud, a durable computing platform allowing developers to build and deploy long-running, stateful serverless workers that are resistant to failures, upgrades, and updates. The product is currently in developer preview.

article thumbnail

The Agile Architect: Mastering Architectural Observability To Slay Technical Debt

DZone

It's a story as old as ( UNIX ) time — in scene one, we meet an international online retailer whose software infrastructure is based on a sprawling monolithic application. As their story unfolds, and with the relentless additions of new features and modifications to existing ones, this once sleek application has grown into a complex, intertwined system.

article thumbnail

IT modernization improves public health services at state human services agencies

Dynatrace

IT modernization improves public health services at state human services agencies For many organizations, the pandemic was a crash course in IT modernization as agencies scrambled to meet the community’s needs as details unfolded. The early days of the pandemic highlighted the importance of communicating with the public and getting the message right.

article thumbnail

How We Optimized Performance To Serve A Global Audience

Smashing Magazine

How We Optimized Performance To Serve A Global Audience How We Optimized Performance To Serve A Global Audience Liran Cohen 2023-08-03T10:00:00+00:00 2023-08-03T13:06:00+00:00 I work for Bookaway , a digital travel brand. As an online booking platform, we connect travelers with transport providers worldwide, offering bus, ferry, train, and car transfers in over 30 countries.

article thumbnail

High Availability vs. Fault Tolerance: Is FT’s 00.001% Edge in Uptime Worth the Headache?

Percona

Estimates vary, but most reports put the average cost of unplanned database downtime at approximately $300,000 to $500,000 per hour, or $5,000 to $8,000 per minute. With so much at stake, database high availability and fault tolerance have become must-have items, but many companies just aren’t certain which one they must have. This blog article will examine shared attributes of high availability (HA) and fault tolerance (FT).

article thumbnail

Basis Path Testing in Software Testing

Testlodge

Basis path testing in software testing is a white box method where the tester examines the codebase to identify all possible paths that could be taken by the user to achieve their aims. These paths are then written as test cases to ensure all the different identified scenarios in the main branches are covered. Higher test coverage is then achieved when test cases are executed, typically through a form of unit testing.

Testing 75
article thumbnail

AWS Launches General Availability of Amazon EC2 P5 Instances for AI/ML and HPC Workloads

InfoQ

AWS recently announced the general availability (GA) of Amazon EC2 P5 instances powered by the latest NVIDIA H100 Tensor Core GPUs suitable for users that require high performance and scalability in AI/ML and HPC workloads. The GA is a follow-up to the earlier announcement of the development of the infrastructure.

AWS 75
article thumbnail

How to Configure an HTTPS Endpoint for Mule Applications With One-Way SSL

DZone

In this post, we will learn about the steps involved in the process of configuring an HTTPS endpoint with one-way SSL for a Mule Application. Securing communication between clients and servers is essential in today's digital world, and using HTTPS over HTTP ensures secure data transfer. Before we start, let's understand how does CloudHub load balancer work.

Servers 246
article thumbnail

OpenTelemetry observability and Dynatrace deliver actionable answers at scale

Dynatrace

If a microservice falls in the forest and all your monitoring solutions report it differently, can operators accurately trace what happened and automate a response? OpenTelemetry observability provides the answer to this philosophical conundrum. Different monitoring point solutions, such as Jaeger, Zipkin, Logstash, Fluentd, and StatsD, each have their own way of observing and recording such an event.

article thumbnail

Modern Methods For Improving Drupal’s Largest Contentful Paint Core Web Vital

Smashing Magazine

Modern Methods For Improving Drupal’s Largest Contentful Paint Core Web Vital Modern Methods For Improving Drupal’s Largest Contentful Paint Core Web Vital Mike Herchel 2023-08-15T10:00:00+00:00 2023-08-15T13:35:08+00:00 Let’s start with a fairly common example of a hero component on the homepage of Drupal’s demo installation of the Umami theme. The image in this hero component is loaded by CSS via the background-image property.

Media 77
article thumbnail

Updated Percona Distributions for MongoDB, Metadata Table for QAN in PMM: Release Roundup August 21, 2023

Percona

Percona is a leading provider of unbiased, performance-first, open source database solutions that allow organizations to easily, securely, and affordably maintain business agility, minimize risks, and stay competitive, free from vendor lock-in. Percona software is designed for peak performance, uncompromised security, limitless scalability, and disaster-proofed availability.

article thumbnail

Write about what you learn. It pushes you to understand topics better.

Addy Osmani

Write about what you learn. It pushes you to understand topics better. Sometimes the gaps in our knowledge only become clear when explaining things to others.

73
article thumbnail

Dolphie, your real-time MySQL monitoring assistant

Percona Community

For as long as I can remember, Innotop has been the go-to terminal tool for real-time MySQL monitoring. It is an invaluable addition to any DBA’s toolkit, but unfortunately, it’s not really actively maintained these days, except for addressing critical issues, and it hasn’t kept pace with the evolving capabilities of modern terminals.

article thumbnail

API Management

DZone

In the ever-evolving landscape of modern software development, Application Programming Interfaces (APIs) have emerged as crucial building blocks that enable seamless communication between different software systems, applications, and services. APIs act as intermediaries, allowing developers to access the functionality and data of existing software components, making it easier to build new applications and integrate various services into a unified ecosystem.

article thumbnail

Avoid billing surprises with smart Dynatrace cost monitors

Dynatrace

Managing a Dynatrace Platform Subscription (DPS) budget requires balancing your organization’s usage of Dynatrace capabilities against a pre-defined annual budget commitment. Without adequate flexibility in the subscription model, your organization might fail to benefit from capabilities that could transform your observability and security processes.