May, 2023

article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. Behind the scenes, a myriad of systems and services are involved in orchestrating the product experience.

Traffic 344
article thumbnail

Implementing a Self-Healing Infrastructure With Kubernetes and Prometheus

DZone

In today's world, the need for highly available and fault-tolerant systems is more important than ever. Furthermore, with the increased adoption of microservices and containerization , the need for a reliable infrastructure that can automatically detect and recover from failures has become critical. Kubernetes , an open-source container orchestration platform, and Prometheus, a popular monitoring and alerting toolkit, are two tools that can be used to implement such a self-healing infrastructure

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

The path to achieving unprecedented productivity and software innovation through ChatGPT and other generative AI

Dynatrace

With the launch of ChatGPT, an AI chatbot developed by OpenAI, in November 2022, large language models (LLMs) and generative AI have become a global sensation, making their way to the top of boardroom agendas and household discussions worldwide. GPT (generative pre-trained transformer) technology and the LLM-based AI systems that drive it have huge implications and potential advantages for many tasks, from improving customer service to increasing employee productivity.

article thumbnail

So many bad takes?—?What is there to learn from the Prime Video microservices to monolith story

Adrian Cockcroft

So many bad takes — What is there to learn from the Prime Video microservices to monolith story Excerpt from Serverless First deck first published in 2019 The Prime Video team published this story: Scaling up the audio/video monitoring service and reducing costs by 90% , and the internet piled in with opinions and bad takes, mostly missing the point.

article thumbnail

Understanding Linux IOWait

Percona

I have seen many Linux Performance engineers looking at the “IOWait” portion of CPU usage as something to indicate whenever the system is I/O-bound. In this blog post, I will explain why this approach is unreliable and what better indicators you can use. Let’s start by running a little experiment – generating heavy I/O usage on the system: sysbench --threads=8 --time=0 --max-requests=0 fileio --file-num=1 --file-total-size=10G --file-io-mode=sync --file-extra-flags=direct

Cache 144
article thumbnail

Monoliths are not dinosaurs

All Things Distributed

Building evolvable software systems is a strategy, not a religion. And revisiting your architectures with an open mind is a must.

Strategy 151
article thumbnail

ABAC on SpiceDB: Enabling Netflix’s Complex Identity Types

The Netflix TechBlog

By Chris Wolfe , Joey Schorr , and Victor Roldán Betancort Introduction The authorization team at Netflix recently sponsored work to add Attribute Based Access Control (ABAC) support to AuthZed’s open source Google Zanzibar inspired authorization system, SpiceDB. Netflix required attribute support in SpiceDB to support core Netflix application identity constructs.

Cache 252

More Trending

article thumbnail

Log forensics: Finding malicious activity in multicloud environments with Dynatrace Grail

Dynatrace

Log forensics—investigating security incidents based on log data—has become more challenging as organizations adopt cloud-native technologies. Organizations are increasingly turning to these cloud environments to stay competitive, remain agile, and grow. But as organizations rely more on cloud environments, data and complexity have proliferated. Teams struggle to maintain control of and gain visibility into all the applications, microservices and data dependencies these environments generate.

Analytics 239
article thumbnail

Article: Magic Pocket: Dropbox’s Exabyte-Scale Blob Storage System

InfoQ

A horizontally scalable exabyte-scale blob storage system which operates out of multiple regions, Magic Pocket is used to store all of Dropbox’s data. Adopting SMR technology and erasure codes, the system has extremely high durability guarantees but is cheaper than operating in the cloud.

Storage 114
article thumbnail

Proof of Concept: Horizontal Write Scaling for MySQL With Kubernetes Operator

Percona

Historically MySQL is great in horizontal READ scale. The scaling, in that case, is offered by the different number of Replica nodes, no matter if using standard asynchronous replication or synchronous replication. However, those solutions do not offer the same level of scaling for writes operation. Why? Because the solutions still rely on writing in one single node that works as Primary.

Traffic 127
article thumbnail

Can autonomous trucks transform the global supply chain?

All Things Distributed

For the season finale of Now Go Build, I traveled to Arizona, to see first-hand how autonomous trucking has the potential to revolutionize the global supply chain.

107
107
article thumbnail

Using anti-requirements to find system boundaries

Particular Software

We all love building greenfield projects. 1 But inevitably, starting a new project involves lots of meetings with business stakeholders to hash out initial requirements and canonical data models. Those are…not so fun. When one of those meetings occurs after a carb-heavy lunch, it’s easy for your mind to drift away…back to those university lectures about entity design.

Systems 98
article thumbnail

How to Handle Secrets in Kubernetes

DZone

Kubernetes has become the de facto standard for container orchestration, enabling organizations to build, deploy, and scale modern applications with efficiency and agility. As more organizations adopt Kubernetes, the need for proper security and management of sensitive data within these environments becomes paramount. One crucial aspect of ensuring a secure Kubernetes infrastructure is the effective management of secrets, such as API keys, passwords, and tokens.

article thumbnail

Dynatrace adds monitoring support for Microsoft Azure Kubernetes Service deployments using Azure Linux container host

Dynatrace

Dynatrace is proud to provide deep monitoring support for Azure Linux as a container host operating system (OS) platform for Azure Kubernetes Services (AKS) to enable customers to operate efficiently and innovate faster. This integration is a result of our collaboration with Microsoft to better serve our Azure customers across the Microsoft portfolio.

Azure 236
article thumbnail

A Complete Guide to Performance Budgets

Speed Curve

It's easier to make a fast website than it is to keep a website fast. If you've invested countless hours in speeding up your pages, but you're not using performance budgets to prevent regressions, you could be at risk of wasting all your efforts. In this post we'll cover how to: Use performance budgets to fight regressions Understand the difference between performance budgets and performance goals Identify which metrics to track Validate your metrics to make sure they're measuring what you think

article thumbnail

ChatGPT vs. MySQL DBA Challenge

Percona

Given the increasing popularity of ChatGPT in solving questions of humankind, I decided to take a better look at the current state of AI. At the same time that I see database engineers relying on the tool, sites such as StackOverflow are banning ChatGPT. As the question comes from time to time, is my profession at risk of being extinct due to AI? As my mentor said when I started working in the tech area, the answer is: it depends.

article thumbnail

Easy Way to Start Contributing to Open Source With PMM Documentation

Percona Community

If you are a user of Percona Monitoring and Management and noticed any typo or inaccurate information in its documentation, you can easily correct it yourself in the repository following detailed instructions in README.md. But if you are not experienced in open source contributions, you may still feel uneasy about following those steps. This post is for you!

article thumbnail

cppfront: Spring update

Sutter's Mill

Since the year-end mini-update , progress has continued on cppfront. (If you don’t know what this personal project is, please see the CppCon 2022 talk on YouTube.) This update covers Acknowledgments, and highlights of what’s new in the compiler and language since last time, including: simple, mathematically safe, and efficient chained comparisons named break and continue “simple and safe” starts with. main user-defined type , including unifying all special member functions as o

C++ 95
article thumbnail

Microservices Decoded: Unraveling the Benefits, Challenges, and Best Practices for APIs

DZone

In today's fast-paced software development landscape, microservices have emerged as a popular architectural pattern. But what exactly are microservices? In a nutshell, microservices are a way of structuring an application as a collection of loosely coupled, fine-grained services that communicate through lightweight protocols. This architectural style enables teams to develop and deploy services independently, offering flexibility and scalability to the software development process.

article thumbnail

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

Keeping pace with modern digital transformation requires ensuring that applications are responsive, resilient, and always available amid increased complexity. As a result, site reliability has emerged as a critical success metric for many organizations. Site reliability engineering (SRE) has recently become a critical discipline in recent years as the world has shifted in favor of web-based interactions.

article thumbnail

Farewell FID. and hello Interaction to Next Paint!

Speed Curve

Today at Google I/O 2023, it was announced that Interaction to Next Paint (INP) is no longer an experimental metric. INP will replace First Input Delay (FID) as a Core Web Vital in March of 2024. It's been three years since the Core Web Vitals initiative was kicked off in May 2020. In that time, we've seen people's interest in performance dramatically increase, especially in the world of SEO.

Metrics 95
article thumbnail

Disaster Recovery for PostgreSQL on Kubernetes

Percona

Disaster recovery is not optional for businesses operating in the digital age. With the ever-increasing reliance on data, system outages or data loss can be catastrophic, causing significant business disruptions and financial losses. With multi-cloud or multi-regional PostgreSQL deployments, the complexity of managing disaster recovery only amplifies.

Storage 125
article thumbnail

Pause AI?

O'Reilly

It’s hard to ignore the discussion around the Open Letter arguing for a pause in the development of advanced AI systems. Are they dangerous? Will they destroy humanity? Will they condemn all but a few of us to boring, impoverished lives? If these are indeed the dangers we face, pausing AI development for six months is certainly a weak and ineffective preventive.

article thumbnail

React Server Components, Next.js App Router and examples

Addy Osmani

A summary of the state of React Server Components, Next.js App Router, rendering, examples and other related links and thoughts.

Servers 92
article thumbnail

VPN Architecture for Internal Networks

DZone

In today's hyperconnected digital landscape, secure internal networks have become a cornerstone of corporate infrastructure. This crucial aspect of operations is often bolstered by robust VPN technologies, such as OpenVPN. As a leading solution in the realm of VPN , OpenVPN's architecture plays a critical role in enhancing corporate network security and connectivity.

Network 306
article thumbnail

DevOps monitoring tools: How to drive DevOps efficiency

Dynatrace

With the world’s increased reliance on digital services and the organizational pressure on IT teams to innovate faster, the need for DevOps monitoring tools has grown exponentially. In fact, the Dynatrace 2023 CIO Report found that 78% of respondents deploy software updates every 12 hours or less. 54% reported deploying updates every two hours or less.

DevOps 231
article thumbnail

?What experts said at Kubecon about Data on Kubernetes

Percona Community

Melissa Logan, managing director of Data on Kubernetes (DoK), led one of the best panels I’ve been to at a conference at Kubecon EU in Amsterdam about challenges with and the state of the art of running databases on Kubernetes. This panel united the Data on Kubernetes Community Operator SIG and Kubernetes Storage SIG to discuss key features of Kubernetes database operators.

Storage 63
article thumbnail

Save Money in AWS RDS: Don’t Trust the Defaults

Percona

Default settings can help you get started quickly – but they can also cost you performance and a higher cloud bill at the end of the month. Want to save money on your AWS RDS bill? I’ll show you some MySQL settings to tune to get better performance, and cost savings, with AWS RDS. Recently I was engaged in a MySQL Performance Audit for a customer to help troubleshoot performance issues that led to downtime during periods of high traffic on their AWS RDS MySQL instances.

AWS 125
article thumbnail

Bootstrapping Uber’s Infrastructure on arm64 with Zig

Uber Engineering

In this blog post we explain how we bootstrapped arm64 infrastructure using a relatively new toolchain in town: zig cc.

article thumbnail

What is Portability Testing in Software Testing?

Kovair

Get information on - What is Portability Testing in Software Testing?, follow Kovair Blog for the latest information and updates on software development and information technology.

article thumbnail

Navigating the Divide: Distinctions Between Time Series Data and Relational Data

DZone

I have coded many applications, both client and web, over my career, and I understand the importance of building a well-developed application from the ground up. Therefore, I found it important to write a piece based on my understanding of time series data versus relational data as someone with a unique understanding of both. Relational Data The main objective of relational data is to maintain an accurate representation of the current state of the world with respect to its objects and the relati

Database 304
article thumbnail

Network performance monitoring top of mind for CloudOps teams

Dynatrace

For cloud operations teams, network performance monitoring is central in ensuring application and infrastructure performance. If the network is sluggish, an application may also be slow, frustrating users. Worse, a malicious attacker may gain access to the network, compromising sensitive application data. Still, despite having adopted myriad network monitoring tools, organizations still struggle to identify whether the network is at fault for application and infrastructure problems.

Network 228
article thumbnail

Data on Kubernetes Meetup May 23

Percona Community

Percona has started to participate in Data on Kubernetes (DoK) meetings about Kubernetes Operators. These meetings are an initiative of DoK meetups that spotlight DoK case studies. In this blog post series, I will summarize the topics covered in each meeting. On May 23, very interesting topics were discussed on the agenda. Let’s begin to summarize it.

Systems 58
article thumbnail

MySQL Connection Security With Connection Control Plugins

Percona

As a database administrator, have you ever been in a situation when your database confronted a brute force attack? A brute force attack can be launched against a user account in MySQL. MySQL replies with success or error based on supplied credentials, and the time required for the verification is almost the same in either case. Hence, an attacker can launch a brute force attack against a MySQL user account at a rapid rate and can try many different passwords.

Servers 123
article thumbnail

Give us autonomy - but first you've gotta tell us what to do

The Agile Manager

The ultimate state for any team is self-determination: they lead their own discovery of work, self-prioritize that work, self-organize their roles and self-direct the delivery. Self-determination requires meta awareness. The team knows the problem space - the motivations of different actors (buyers, users, influencers), the guiding policies (regulatory and commercial preference), the tech in place, and so forth.

Code 52
article thumbnail

Current status, needs, and challenges in Heterogeneous and Composable Memory from the HCM workshop (HPCA’23)

ACM Sigarch

Introduction Memory systems are evolving into heterogeneous and composable architectures. Heterogeneous and Composable Memory (HCM) offers a feasible solution for terabyte- or petabyte-scale systems, addressing the performance and efficiency demands of emerging big-data applications. However, building and utilizing HCM presents challenges, including interconnecting various memory technologies (e.g., using Compute Express Link or CXL), organizing memory components for optimal performance, adaptin

Latency 52
article thumbnail

What Is Istio Service Mesh?

DZone

Most organizations prefer to deploy containerized applications into K8s because of its scalability and flexibility. But as the number of microservices increased and application pods are distributed across multiple clusters and cloud providers, managing and scaling them has become complex. While scaling, it is harder to configure complex communication logic between microservices.