2024

article thumbnail

Introducing the New Dapr Jobs API and Scheduler Service

DZone

The Dapr 1.14 release last month included many new capabilities and was feature-packed. This included the addition of a new Jobs API and Scheduler control plane service for managing jobs. Over the years, the Dapr project was often requested to include a Jobs API. The Scheduler service enables this and is designed to address the performance and scalability improvements on Actor reminders and the Workflow API.

article thumbnail

Dynatrace joins the Microsoft Intelligent Security Association

Dynatrace

As organizations adopt more cloud-native technologies, the risk—and consequences—of cyberattacks are also increasing. This rising risk amplifies the need for reliable security solutions that integrate with existing systems. That’s why we’re proud to announce that Dynatrace has joined the Microsoft Intelligent Security Association (MISA).

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Managing PostgreSQL® High Availability – Part I: PostgreSQL Automatic Failover

Scalegrid

Managing High Availability (HA) in your PostgreSQL hosting is very important to ensuring your database deployment clusters maintain exceptional uptime and strong operational performance so your data is always available to your application. In an earlier blog post, we introduced you to configure high availability for PostgreSQL using streaming replication , and now we’re going to show you how to best manage client-side HA.

article thumbnail

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

Vidhya Arvind , Rajasekhar Ummadisetty , Joey Lynch , Vinay Chella Introduction At Netflix our ability to deliver seamless, high-quality, streaming experiences to millions of users hinges on robust, global backend infrastructure. Central to this infrastructure is our use of multiple online distributed databases such as Apache Cassandra , a NoSQL database known for its high availability and scalability.

Latency 248
article thumbnail

AWS Lambda turns 10: A rare look at the doc that started it

All Things Distributed

On AWS Lambda's 10th anniversary, I'm publishing the internal PR/FAQ that helped launch this groundbreaking service. This document provides insight into the customer problems we observed in the early 2010s and our vision for serverless computing. Readers will find annotations throughout, offering a glimpse into how our long-term thinking has influenced Lambda's evolution over the past decade.

Lambda 212
article thumbnail

Do Not Upgrade to Any Version of MySQL After 8.0.37

Percona

Warning! Recently, Jean-François Gagné opened a bug on bug.mysql.com #115517; unfortunately, the bug is now private. However, the bug looks quite serious. We at Percona have performed several tests and opened the issue PS-9306 to investigate the problem.

Testing 145
article thumbnail

AI Flame Graphs

Brendan Gregg

Imagine halving the resource costs of AI and what that could mean for the planet and the industry -- based on extreme estimates such savings could reduce the total US power usage by over 10% by 2030 1. At Intel we've been creating a new analyzer tool to help reduce AI costs called AI Flame Graphs : a visualization that shows an AI accelerator or GPU hardware profile along with the full software stack, based on my CPU flame graphs.

More Trending

article thumbnail

C++ safety, in context

Sutter's Mill

Scope. To talk about C++’s current safety problems and solutions well, I need to include the context of the broad landscape of security and safety threats facing all software. I chair the ISO C++ standards committee and I work for Microsoft, but these are my personal opinions and I hope they will invite more dialog across programming language and security communities.

C++ 141
article thumbnail

QCon London: Meta Used Monolithic Architecture to Ship Threads in Only Five Months

InfoQ

Zahan Malkani talked during QCon London 2024 about Meta’s journey from identifying the opportunity in the market to shipping the Threads application only five months later. The company leveraged Instagram's existing monolithic architecture and quickly iterated to create a new text-first microblogging service in record time.

article thumbnail

The psychology of site speed and human happiness

Speed Curve

In the fourteen years that I've been working in the web performance industry, I've done a LOT of research, writing, and speaking about the psychology of page speed – in other words, why we crave fast, seamless online experiences. In fact, the entire first chapter of my book, Time Is Money (reprinted here courtesy of the good folks at O'Reilly), is dedicated to the subject.

Speed 133
article thumbnail

Efficient Data Management With Offset and Cursor-Based Pagination in Modern Applications

DZone

Managing large datasets efficiently is essential in software development. Retrieval strategies play a crucial role in improving performance and scalability, especially when response times are critical. Pagination is a core technique used to manage data effectively. It is essential for optimizing performance and resource management. In this article, we will explore two pagination strategies, offset and cursor-based pagination, that are suited to different scenarios and requirements.

article thumbnail

CrowdStrike BSOD: Quickly find machines impacted by the CrowdStrike issue

Dynatrace

Security vendor CrowdStrike released an update to their popular platform early on July 19, 2024, ultimately producing an issue that caused many Windows-based machines to fail, resulting in a BSOD (Blue Screen of Death). The global impact has affected almost every major industry, resulting in closed bank branches , ground stops on flights , failures for retail point-of-sale devices, and, unfortunately, much more.

Airlines 307
article thumbnail

Will AWS Have Anything New To Say About Sustainability at re:Invent 2024?

Adrian Cockcroft

Photo by Adrian of my father’s “round tuit” which I’m hoping will inspire AWS to do something… There’s an old saying that any headline that ends in a question mark can be answered with a “no”. My hopes aren’t high, after spending the last three years waiting for anything new to appear. There’s been broken promises, management changes, I’ve been told that there are teams working on things like Scope 3 carbon accounting, but nothing has been released since the Well Architected Guide to Sustainabil

AWS 98
article thumbnail

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction. This counting service, built on top of the TimeSeries Abstraction, enables distributed counting at scale while maintaining similar low latency performance.

Latency 224
article thumbnail

Continuous reinvention: A brief history of block storage at AWS

All Things Distributed

Marc Olson, a long-time Amazonian, discusses the evolution of EBS, highlighting hard-won lessons in queueing theory, the importance of comprehensive instrumentation, and the value of incrementalism versus radical changes. It's an insightful look at how one of AWS’s foundational services has evolved to meet the needs of our customers.

Storage 198
article thumbnail

Is Oracle Finally Killing MySQL?

Percona

Some 15 years ago, when Oracle acquired Sun, and hence MySQL, there was a lot of talk on the internet over when Oracle was going to “Kill MySQL” Multiple theories were circled around, ranging from killing MySQL altogether, so there is less competition to proprietary Oracle, or just killing it as an open source project, […]

article thumbnail

Linux Crisis Tools

Brendan Gregg

When you have an outage caused by a performance issue, you don't want to lose precious time just to install the tools needed to diagnose it. Here is a list of "crisis tools" I recommend installing on your Linux servers by default (if they aren't already), along with the (Ubuntu) package names that they come from: Package Provides Notes procps ps(1), vmstat(8), uptime(1), top(1) basic stats util-linux dmesg(1), lsblk(1), lscpu(1) system log, device info sysstat iostat(1), mpstat

Servers 140
article thumbnail

Rift Between Junior and Senior Developers

O'Reilly

I’m worried about AI. I’m not worried about it taking my job. I believe AI is a genuine productivity tool. By which I mean it can make developers produce more. The question is whether those developers are producing something good or not. The difference between an experienced developer and a junior is that an experienced developer knows: There’s more than one good solution to every problem.

article thumbnail

Home Screen Advantage

Alex Russell

After weeks of confusion and intentional chaos, Apple's plan to kneecap the web has crept into view, menacing a PWApocalypse as the March 6th compliance deadline approaches for the EU's Digital Markets Act (DMA). The view from Cupertino. The DMA requires Apple to open the iPhone to competing app stores, and and its lopsided proposal for "enabling" them is getting most of the press.

Storage 114
article thumbnail

Uber Builds Scalable Chat Using Microservices with GraphQL Subscriptions and Kafka

InfoQ

Uber replaced a legacy architecture built using the WAMP protocol with a new solution that takes advantage of GraphQL subscriptions. The main drivers for creating a new architecture were challenges around reliability, scalability, observability/debugibility, as well as technical debt impeding the team’s ability to maintain the existing solution.

article thumbnail

Five ways cookie consent managers hurt web performance (and how to fix them)

Speed Curve

I've been spending a lot of time looking at the performance of European sites lately, and have found that the consent management platforms (CMPs) are consistently creating a bit of a false reality for folks when trying to understand performance using synthetic monitoring. Admittedly, this is not a new topic, but I feel that it's important enough that it warrants another PSA.

article thumbnail

The State of Observability 2024: Navigating Complexity With AI-Driven Insights

DZone

In today's fast-paced digital landscape, organizations are increasingly embracing multi-cloud environments and cloud-native architectures to drive innovation and deliver seamless customer experiences. However, the 2024 State of Observability report from Dynatrace reveals that the explosion of data generated by these complex ecosystems is pushing traditional monitoring and analytics approaches to their limits.

Analytics 331
article thumbnail

New Distributed Tracing app provides effortless trace insights

Dynatrace

We’re excited to announce the first version of our new Distributed Tracing app, a part of the new Dynatrace user experience that leverages the full power of the Dynatrace platform. With the Distributed Tracing app, you can flexibly slice and dice raw trace data to understand what went wrong and why. Find what you’re looking for faster with: Enhanced charting and data visualization: Easily filter, group, search, and visualize trace data to gain deeper insights into your system’s behavior.

Tuning 271
article thumbnail

How To Monitor And Optimize Google Core Web Vitals

Smashing Magazine

How To Monitor And Optimize Google Core Web Vitals How To Monitor And Optimize Google Core Web Vitals Matt Zeunert 2024-04-16T10:00:00+00:00 2024-04-16T12:05:36+00:00 This article is sponsored by DebugBear Google’s Core Web Vitals initiative has increased the attention website owners need to pay to user experience. You can now more easily see when users have poor experiences on your website, and poor UX also has a bigger impact on SEO.

Google 108
article thumbnail

Maestro: Netflix’s Workflow Orchestrator

The Netflix TechBlog

By Jun He , Natallia Dzenisenka , Praneeth Yenugutala , Yingyi Zhang , and Anjali Norwood TL;DR We are thrilled to announce that the Maestro source code is now open to the public! Please visit the Maestro GitHub repository to get started. If you find it useful, please give us a star. What is Maestro Maestro is a general-purpose, horizontally scalable workflow orchestrator designed to manage large-scale workflows such as data pipelines and machine learning model training pipelines.

Strategy 255
article thumbnail

Introducing Distill CLI: An efficient, Rust-powered tool for media summarization

All Things Distributed

After a few code reviews from Rustaceans at Amazon and a bit of polishing, I'm ready to share the Distill CLI. An open-source tool written in Rust for summarizing meetings and other media that uses AWS services, such as S3, Transcribe, and Bedrock.

Media 178
article thumbnail

An Elephant in the Cluster: Making PostgreSQL Feel at Home on Kubernetes

Percona

TL;DR Kubernetes was built for stateless apps, but as more stateful applications (like databases) run on it, operators include quite heavy implementations to Kubernetes workload management API (such as StatefulSets) deficits. While creating custom methods allows flexibility and faster time to market, it also leads to inconsistency and complexity.

Database 117
article thumbnail

No More Blue Fridays

Brendan Gregg

In the future, computers will not crash due to bad software updates, even those updates that involve kernel code. In the future, these updates will push eBPF code. Friday July 19th provided an unprecedented example of the inherent dangers of kernel programming, and has been called the largest outage in the history of information technology. Windows computers around the world encountered blue-screens-of-death and boot loops, causing outages for hospitals, airlines, banks, grocery stores, media br

Airlines 134
article thumbnail

Can Language Models Replace Compilers?

O'Reilly

Kevlin Henney and I recently discussed whether automated code generation, using some future version of GitHub Copilot or the like, could ever replace higher-level languages. Specifically, could ChatGPT N (for large N) quit the game of generating code in a high-level language like Python, and produce executable machine code directly, like compilers do today?

article thumbnail

JMeter Correlation Recorder 2.5: An Approach to the New Plugin Version

Abstracta

Learn how the latest version of the JMeter Correlation Recorder plugin can make correlating dynamic values easier and minimize errors in performance testing. The post JMeter Correlation Recorder 2.5: An Approach to the New Plugin Version appeared first on Blog about Software Development, Testing, and AI | Abstracta.

article thumbnail

Presentation: Why a Hedge Fund Built Its Own Database

InfoQ

James Munro discusses ArcticDB and the practicalities of building a performant time-series datastore and why transactions, particularly the Isolation in ACID is just not worth it.

Database 111
article thumbnail

Building the future of performance with SpeedCurve

Speed Curve

I’m beyond excited to announce that I’m joining the SpeedCurve team this year! I’ll still be doing some consulting work, but I’ll be taking on a few less clients this year so I can focus on helping to make an already amazing performance tool even better, working alongside some of my favorite people in the performance community.

article thumbnail

Cloud Cost Optimization: New Strategies for the AI Era

DZone

In today's volatile economic landscape, enterprises are scrutinizing their cloud bills more than ever. Platform teams are at the forefront of this challenge, tasked with finding innovative ways to optimize usage and drive down costs. To gain insights into this evolving field, we spoke with Kapil Thangavelu , co-founder and CTO of Stacklet and the creator and lead maintainer of Cloud Custodian.

Cloud 299
article thumbnail

How Red Hat and Dynatrace intelligently automate your production environment

Dynatrace

Red Hat and Dynatrace are joining forces to revolutionize day One and day Two operations. A tight integration between Red Hat Ansible Automation Platform, Dynatrace Davis ® AI, and the Dynatrace observability and security platform enables closed-loop remediation to automate the process from: Detecting a problem. Managing incidents in corresponding tools.

DevOps 295
article thumbnail

Introducing AI Assistance in Chrome DevTools

Addy Osmani

Introducing AI Assistance in Chrome DevTools, a new feature that brings AI-powered coding assistance to the familiar Chrome DevTools environment.

Code 96
article thumbnail

Noisy Neighbor Detection with eBPF

The Netflix TechBlog

By Jose Fernandez , Sebastien Dabdoub , Jason Koch , Artem Tkachuk The Compute and Performance Engineering teams at Netflix regularly investigate performance issues in our multi-tenant environment. The first step is determining whether the problem originates from the application or the underlying infrastructure. One issue that often complicates this process is the "noisy neighbor" problem.

Latency 245
article thumbnail

Hacking our way to better team meetings

All Things Distributed

My team and I set out to build a simple note taking aide, which transcribes and summarizes our meetings using Bedrock. Today, we’re making the proof of concept available to everyone.