Remove Analytics Remove Internet Remove Storage
article thumbnail

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

Greenplum Database is an open-source , hardware-agnostic MPP database for analytics, based on PostgreSQL and developed by Pivotal who was later acquired by VMware. This feature-packed database provides powerful and rapid analytics on data that scales up to petabyte volumes. What Exactly is Greenplum? At a glance – TLDR.

Big Data 321
article thumbnail

Probabilistic Data Structures for Web Analytics and Data Mining

Highly Scalable

Statistical analysis and mining of huge multi-terabyte data sets is a common task nowadays, especially in the areas like web analytics and Internet advertising. This approach often leads to heavyweight high-latency analytical processes and poor applicability to realtime use cases. Case Study.

Analytics 191
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Observations on the Importance of Cloud-based Analytics

All Things Distributed

Many of these innovations will have a significant analytics component or may even be completely driven by it. For example many of the Internet of Things innovations that we have seen come to life in the past years on AWS all have a significant analytics components to it. Cloud analytics are everywhere.

Analytics 111
article thumbnail

Overseeing SaaS security with AWS AppFabric and Dynatrace

Dynatrace

Customers can then ingest logs from AWS into the Dynatrace® platform, for in-depth log management and security analytics powered by Grail TM. AWS AppFabric ingests and normalizes audit logs from SaaS applications and delivers them to an Amazon Simple storage service (Amazon S3) bucket in an organization’s AWS account.

AWS 255
article thumbnail

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

Unbundling the Data Warehouse: The Case for Independent Storage Recording Speaker : Jason Reid (Co-founder & Head of Product at Tabular) Summary : Unbundling a data warehouse means splitting it into constituent and modular components that interact via open standard interfaces.

article thumbnail

Dynatrace adds support for AWS Transit Gateway with VPC Flow Logs

Dynatrace

This new service enhances the user visibility of network details with direct delivery of Flow Logs for Transit Gateway to your desired endpoint via Amazon Simple Storage Service (S3) bucket or Amazon CloudWatch Logs. Check out our Power Demo: Log Analytics with Dynatrace. What is AWS Transit Gateway?

AWS 229
article thumbnail

Driving down the cost of Big-Data analytics - All Things Distributed

All Things Distributed

Driving down the cost of Big-Data analytics. The Amazon Elastic MapReduce (EMR) team announced today the ability to seamlessly use Amazon EC2 Spot Instances with their service, significantly driving down the cost of data analytics in the cloud. Hadoop is quickly becoming the preferred tool for this type of large scale data analytics.