Big Data and Servers - Technology Performance Pulse

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

It can scale towards a multi-petabyte level data workload without a single issue, and it allows access to a cluster of powerful servers that will work together within a single SQL interface where you can view all of the data. At a glance – TLDR. The Greenplum Architecture. What is an MPP Database?

Big Data

Big Data Database Artificial Intelligence Open Source

Sustainability: Thoughts from a software engineer

Dynatrace

MARCH 17, 2025

Until recently, improvements in data center power efficiency compensated almost entirely for the increasing demand for computing resources. The rise of big data, cryptocurrencies, and AI means the IT sector contributes significantly to global greenhouse gas emissions. However, this trend is now reversing.

Software Engineering

Software Engineering Engineering Software Software

Moving HPC to the Cloud: A Guide for 2020

High Scalability

SEPTEMBER 14, 2020

This is a guest post by Limor Maayan-Wainstein , a senior technical writer with 10 years of experience writing about cybersecurity, big data, cloud computing, web development, and more. High performance computing (HPC) enables you to solve complex problems which cannot be solved by regular computing.

Cloud

Cloud Big Data Virtualization Efficiency

Driving down the cost of Big-Data analytics - All Things Distributed

All Things Distributed

AUGUST 18, 2011

Driving down the cost of Big-Data analytics. The Amazon Elastic MapReduce (EMR) team announced today the ability to seamlessly use Amazon EC2 Spot Instances with their service, significantly driving down the cost of data analytics in the cloud. Driving down the cost of Big-Data analytics. Comments ().

Big Data

Big Data Analytics AWS Cloud

What is cloud monitoring? How to improve your full-stack visibility

Dynatrace

JANUARY 11, 2023

As cloud and big data complexity scales beyond the ability of traditional monitoring tools to handle, next-generation cloud monitoring and observability are becoming necessities for IT teams. Cloud-server monitoring. What is cloud monitoring? Website monitoring. Cloud storage monitoring.

Cloud

Cloud Monitoring Best Practices Infrastructure

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

These include options where replay traffic generation is orchestrated on the device, on the server, and via a dedicated service. Moreover, allowing the device to execute untested server-side code paths can inadvertently expose an attack surface area for potential misuse. We will examine these alternatives in the upcoming sections.

Traffic

Traffic Latency Tuning Systems

Python at Netflix

The Netflix TechBlog

APRIL 29, 2019

Content is placed on the network of servers in the Open Connect CDN as close to the end user as possible, improving the streaming experience for our customers and reducing costs for both Netflix and our Internet Service Provider (ISP) partners. takes place in Amazon Web Services (AWS), whereas everything that happens afterwards (i.e.,

Open Source

Open Source Network Infrastructure Big Data

Kubernetes in the wild report 2023

Dynatrace

JANUARY 16, 2023

On-premises data centers invest in higher capacity servers since they provide more flexibility in the long run, while the procurement price of hardware is only one of many cost factors. Big data : To store, search, and analyze large datasets, 32% of organizations use Elasticsearch.

Open Source

Open Source Java Operating System Programming

No need to compromise visibility in public clouds with the new Azure services supported by Dynatrace

Dynatrace

JULY 6, 2020

Our customers have frequently requested support for this first new batch of services, which cover databases, big data, networks, and computing. The Azure MySQL dashboard serves as a comprehensive overview of your MySQL servers and database services. See the health of your big data resources at a glance.

Azure

Azure Cloud Big Data Virtualization

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Dynatrace

DECEMBER 15, 2022

The roles and responsibilities of ITOps team members include the following: A system administrator configures servers, installs applications, monitors the health of the system, and fixes and upgrades hardware. The primary goal of ITOps is to provide a high-performing, consistent IT environment. ITOps vs. AIOps.

Artificial Intelligence

Artificial Intelligence DevOps Hardware Virtualization

Introduction to Grafana, Prometheus, and Zabbix

DZone

FEBRUARY 6, 2024

If the data sources are not available then customized plugins can be developed to integrate these data sources. Grafana is used widely these days to monitor and visualize the metrics for 100s or 1000s of servers, Kubernetes Platforms, Virtual Machines, Big Data Platforms, etc.

Big Data

Big Data Open Source Virtualization Metrics

No Server Required - Jekyll & Amazon S3 - All Things Distributed

All Things Distributed

AUGUST 17, 2011

No Server Required - Jekyll & Amazon S3. The increasing sophistication of client-side JavaScript has redefined what dynamic means; where in the past dynamic content would be mainly server generated, today much content is served statically with JavaScript on the client side doing the dynamic modifications. No Server Required.

Servers

Servers Social Media AWS Website

Business Insights extends support for optimizing Core Web Vitals

Dynatrace

APRIL 21, 2021

To do this effectively, you need a big data processing approach. For example: Largest Contentful Paint can be improved by faster server response times, deferring render-blocking JavaScript and CSS, reducing resource load times, and optimizing any client-side rendering. How do you know where to focus first with failing pages?

Traffic

Traffic Mobile Metrics Analytics

What is container orchestration?

Dynatrace

MARCH 24, 2023

Using Marathon, its data center operating system (DC/OS) plugin, Mesos becomes a full container orchestration environment that, like Kubernetes and Docker Swarm, discovers services, balances loads, and manages application containers. Mesos also supports other orchestration engines, including Kubernetes and Docker Swarm.

Infrastructure

Infrastructure Open Source Operating System Cloud

Formulating ‘Out of Memory Kill’ Prediction on the Netflix App as a Machine Learning Problem

The Netflix TechBlog

JULY 21, 2022

We at Netflix, as a streaming service running on millions of devices, have a tremendous amount of data about device capabilities/characteristics and runtime data in our big data platform. With large data, comes the opportunity to leverage the data for predictive and classification based analysis.

Big Data

Big Data Cache Engineering Data Engineering

Experiences with approximating queries in Microsoft’s production big-data clusters

The Morning Paper

SEPTEMBER 8, 2019

Experiences with approximating queries in Microsoft’s production big-data clusters Kandula et al., Microsoft’s big data clusters have 10s of thousands of machines, and are used by thousands of users to run some pretty complex queries. VLDB’19. For the larger more production-like query analysed in §4.2.1,

Big Data

Big Data Analytics Latency Azure

What is APM?

Dynatrace

JUNE 1, 2020

Application discovery, tracing and diagnostics (ADTD): Application discovery, tracing and diagnosis is a set of processes designed to understand the relationships between application servers, map transactions across these nodes, and enable the deep inspection of methods using bytecode instrumentation (BCI) and/or distributed tracing.

Artificial Intelligence

Artificial Intelligence Social Media Monitoring IoT

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage

Storage Systems Big Data Azure

Expanding the AWS Cloud: Introducing the AWS Europe (London) Region

All Things Distributed

DECEMBER 13, 2016

With the launch of the AWS Europe (London) Region, AWS can enable many more UK enterprise, public sector and startup customers to reduce IT costs, address data locality needs, and embark on rapid transformations in critical new areas, such as big data analysis and Internet of Things. Fraud.net is a good example of this.

AWS

AWS Cloud Artificial Intelligence IoT

Helios: hyperscale indexing for the cloud & edge – part 1

The Morning Paper

OCTOBER 26, 2020

Helios also serves as a reference architecture for how Microsoft envisions its next generation of distributed big-data processing systems being built. What follows is a discussion of where big data systems might be heading, heavily inspired by the remarks in this paper, but with several of my own thoughts mixed in.

Cloud

Cloud Big Data Latency Architecture

Applying real-world AIOps use cases to your operations

Dynatrace

OCTOBER 17, 2022

Artificial intelligence for IT operations, or AIOps, combines big data and machine learning to provide actionable insight for IT teams to shape and automate their operational strategy. From there, you look at the web server the application is communicating with, further to the front-end tier and search service.

DevOps

DevOps Artificial Intelligence Healthcare Innovation

New AWS feature: Run your website from Amazon S3 - All Things.

All Things Distributed

FEBRUARY 17, 2011

Since a few days ago this weblog serves 100% of its content directly out of the Amazon Simple Storage Service (S3) without the need for a web server to be involved. I had held out implementing an alternative to my simple blog server that had. Driving down the cost of Big-Data analytics. Comments ().

AWS

AWS Website Storage Servers

Mastering Distributed SQL™ Databases in 2025

Scalegrid

JANUARY 10, 2025

They keep the features that developers like but can handle much more data, similar to NoSQL systems. Notably, they simplify handling big data flows, offer consistent transactions, and sustain high performance even when they’re used for real-time data analysis and complex queries.

Database

Database Scalability Best Practices Blockchain

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

The Morning Paper

MAY 14, 2019

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices Gan et al., By instrumenting both the client-side of a request and the server-side it’s possible to figure out wait times. Seer is also tested on a 100-server GCE cluster with the Social Network microservices application.

Big Data

Big Data Cloud Performance Hardware

What is Application Performance Monitoring?

Dynatrace

JUNE 1, 2020

Application discovery, tracing and diagnostics (ADTD): Application discovery, tracing and diagnosis is a set of processes designed to understand the relationships between application servers, map transactions across these nodes, and enable the deep inspection of methods using bytecode instrumentation (BCI) and/or distributed tracing.

Monitoring

Monitoring Performance Social Media Artificial Intelligence

MySQL vs MongoDB: Best Choice for You

Scalegrid

FEBRUARY 11, 2025

By storing information in rows and columns within these tables, MySQL enables effective sorting and accessing of data. The adherence to a strict schema ensures consistency across the stored data while enforcing rules for validating this information. These elements will be delved into in subsequent subsections.

Scalability

Scalability Database Storage IoT

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

All Things Distributed

DECEMBER 5, 2010

The naming system that we are all most familiar with in the internet is the Domain Name System (DNS) that manages the naming of the many different entities in our global network; its most common use is to map a name to an IP address, but it also provides facilities for aliases, finding mail servers, managing security keys, and much more.

Cloud

Cloud Internet Internet AWS

Expanding the Cloud - Introducing Amazon ElastiCache - All Things.

All Things Distributed

AUGUST 22, 2011

Caching has become a standard component in many applications to achieve a fast and predictable performance, but maintaining a collection of cache servers in a reliable and scalable manner is not a simple task. Driving down the cost of Big-Data analytics. No Server Required - Jekyll & Amazon S3.

Cloud

Cloud Cache AWS Storage

Post: Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

High Scalability

MARCH 24, 2020

Scrapinghub is hiring a Senior Software Engineer (Big Data/AI). You will be designing and implementing distributed systems : large-scale web crawling platform, integrating Deep Learning based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments, etc.

Education

Education Software Engineering Engineering Big Data

The Need for Real-Time Device Tracking

ScaleOut Software

JULY 19, 2021

If a cyber network agent has observed an unusual pattern of failed login attempts, it needs to alert downstream network nodes (servers and routers) to block the kill chain in a potential attack. A New Approach: Real-Time Device Tracking.

IoT

IoT Big Data Analytics Architecture

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

High Scalability

APRIL 14, 2020

Scrapinghub is hiring a Senior Software Engineer (Big Data/AI). You will be designing and implementing distributed systems : large-scale web crawling platform, integrating Deep Learning based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments, etc.

Education

Education Software Engineering Scalability Engineering

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

However, its limited feature set compared to Redis might be a disadvantage for applications that require more advanced data structures and persistence. Introduction Caching serves a dual purpose in web development – speeding up client requests and reducing server load. Data transfer technology. 3d render.

Cache

Cache Storage Architecture Scalability

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

High Scalability

MARCH 30, 2020

Scrapinghub is hiring a Senior Software Engineer (Big Data/AI). You will be designing and implementing distributed systems : large-scale web crawling platform, integrating Deep Learning based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments, etc.

Education

Education Software Engineering Engineering Big Data

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

High Scalability

APRIL 28, 2020

Scrapinghub is hiring a Senior Software Engineer (Big Data/AI). You will be designing and implementing distributed systems : large-scale web crawling platform, integrating Deep Learning based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments, etc.

Education

Education Software Engineering Scalability Engineering

Why MySQL Could Be Slow With Large Tables

Percona

JANUARY 19, 2023

Sharding: Sharding is the concept of splitting data horizontally, i.e. by distributing data into multiple servers (shards), meaning that the different portions of data for a given table, may be stored on many different servers. This can help to split large data sets into smaller ones stored in multiple servers.

Open Source

Open Source Storage Database Big Data

What is RabbitMQ Used For

Scalegrid

JUNE 28, 2024

Such multiprotocol support ensures that the RabbitMQ server can adapt to the messaging requirements of any application, whether it’s a lightweight IoT device transmitting sensor data or a complex enterprise system coordinating large-scale operations. Can RabbitMQ handle the high-throughput needs of big data applications?

IoT

IoT Healthcare Programming Open Source

Expanding the Cloud: Introducing the AWS Asia Pacific (Seoul) Region

All Things Distributed

JANUARY 6, 2016

Mirae Asset Global Investments improved its web service environment and reduced annual management costs by 50% by consolidating the management of all web services, including servers, network, database, and security. Many of these enterprises are assisted by our extensive partner ecosystem in Korea.

AWS

AWS Cloud Games Latency

Automating Physical Backups of MongoDB on Kubernetes

Percona

MARCH 15, 2023

The why Percona Server for MongoDB can handle petabytes of data. The how Basics When you enable backups in your cluster, the Operator adds a sidecar container to each replset pod (including the Config Server pods if sharding is enabled) to run pbm-agent. The feature is now in technical preview.

Database

Database Big Data Processing Servers

NoSQL Data Modeling Techniques

Highly Scalable

MARCH 1, 2012

Perhaps the greatest benefit of an unordered Key-Value data model is that entries can be partitioned across multiple servers by just hashing the key. Applicability : Key-Value Stores, Document Databases, BigTable-style Databases. (5) 5) Enumerable Keys.

Database

Database Ecommerce Efficiency Engineering

The AWS GovCloud (US) Region - All Things Distributed

All Things Distributed

AUGUST 16, 2011

Government and Big Data. One particular early use case for AWS GovCloud (US) will be massive data processing and analytics. Several agencies of very different parts of the government have needs for data analytics that really put the Big in Big-Data, sometimes several orders of magnitude larger than commonly found in industry.

AWS

AWS Government Big Data Cloud

Reduce RPO, Encrypt Backups, and More in 1.15.0 Release of Percona Operator for MongoDB

Percona

OCTOBER 18, 2023

release , we added support for physical backups and restores to significantly reduce Recovery Time Objective ( RTO ), especially for big data sets. release , we added support for physical backups and restores to significantly reduce Recovery Time Objective ( RTO ), especially for big data sets.

Best Practices

Best Practices Storage AWS Big Data

Introducing the AWS South America - All Things Distributed

All Things Distributed

DECEMBER 14, 2011

Driving down the cost of Big-Data analytics. No Server Required - Jekyll & Amazon S3. Introducing the AWS South America (Sao Paulo) Region. Expanding the Cloud - Introducing Amazon ElastiCache. Job Openings in AWS - Senior Leader in Database Services. Expanding the Cloud - The AWS GovCloud (US) Region.

AWS

AWS Latency Storage Cloud

What is Greenplum Database? Intro to the Big Data Database

Sustainability: Thoughts from a software engineer

Trending Sources

Moving HPC to the Cloud: A Guide for 2020

Driving down the cost of Big-Data analytics - All Things Distributed

What is cloud monitoring? How to improve your full-stack visibility

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Python at Netflix

Kubernetes in the wild report 2023

No need to compromise visibility in public clouds with the new Azure services supported by Dynatrace

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Introduction to Grafana, Prometheus, and Zabbix

No Server Required - Jekyll & Amazon S3 - All Things Distributed

Business Insights extends support for optimizing Core Web Vitals

What is container orchestration?

Formulating ‘Out of Memory Kill’ Prediction on the Netflix App as a Machine Learning Problem

Experiences with approximating queries in Microsoft’s production big-data clusters

What is APM?

What is a Distributed Storage System

Expanding the AWS Cloud: Introducing the AWS Europe (London) Region

Helios: hyperscale indexing for the cloud & edge – part 1

Applying real-world AIOps use cases to your operations

New AWS feature: Run your website from Amazon S3 - All Things.

Mastering Distributed SQL™ Databases in 2025

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

What is Application Performance Monitoring?

MySQL vs MongoDB: Best Choice for You

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

Expanding the Cloud - Introducing Amazon ElastiCache - All Things.

Post: Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

The Need for Real-Time Device Tracking

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Redis vs Memcached in 2024

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Sponsored Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Why MySQL Could Be Slow With Large Tables

What is RabbitMQ Used For

Sponsored Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Expanding the Cloud: Introducing the AWS Asia Pacific (Seoul) Region

Automating Physical Backups of MongoDB on Kubernetes

NoSQL Data Modeling Techniques

The AWS GovCloud (US) Region - All Things Distributed

Reduce RPO, Encrypt Backups, and More in 1.15.0 Release of Percona Operator for MongoDB

Introducing the AWS South America - All Things Distributed

Stay Connected