Blogs

An Analysis of Data Egress Cost and How Sprinkle saves on it

Introduction

Cost reduction is one of the primary objectives of any business. Modern day data analytics decision makers are no different. They strive to get their hands on a tool or a platform that would enable them to not only perform better data analytics but also help them reduce the cost of the complete data infrastructure. For example, the total cost of ownership of a data infrastructure includes the cloud cost, ETL tool cost and Analytics tools cost. 

Its not a hidden secret that moving data from local data centres to cloud based data lakes is much more cost efficient. Generally, cloud providers charge pennies for storing multiple gigabytes of data. For example, Amazon Web Services (AWS) charges a little more than 2 cents per GB of data stored and the incremental cost gets even lower with the rising volume of data. 

Once a decision maker is convinced about a storage solution and it serves their use case, the next question that comes to his head is, “ How much will they charge for transferring data?”

Data storage providers generally don’t charge anything or charge very less for uploading data into the cloud. They don’t want to put any restrictions on the data volume uploaded on the cloud. But, when the data is moving out of the cloud, the costs pile up for an organisation. This is referred to as data egress cost. 

What is Data Egress ?

Data egress cost is incurred by an organisation when they plan to move out the data from the cloud storage or in other words while exporting the data. Now, there could be multiple reasons for doing this. Some of them include, transfer of data to some other storage, disaster recovery and data movement from an application to a preferred data lake or data warehouse, among others. 

The opposite of data egress is data ingress or import of data into the data storage. Generally, there are no charges involved in this process.

Data Egress Cost Analysis : AWS vs GCP vs Azure

While making a decision of choosing the preferred data storage platform, the answer to the question of how much is the data egress cost ? is answered simply by “not much” as compared to the cost of storing the data. For example, the data egress cost at AWS is only 5 to 7 cents higher than the cost of storing the data on a per GB basis. But, this becomes huge, when the data volume grows. And with any growing business, the volume of data is bound to increase resulting in a huge cost for the organization. 

Let’s take a quick look at the data egress cost of the leading cloud data storage providers, that is, Amazon Web Services, Google Cloud  Platform and Microsoft Azure.

Amazon Web Services

Google Cloud Platform 

Microsoft Azure

$0.09 per GB up to 10 TB/mo, $0.085 per GB for the next 40 TB

$0.12 per GB for the first TB, $0.11 per GB up to 10 TB, $0.08 per GB after 10 TB

$0.087 per GB for up to 10 TB, $0.083 per GB for up to 40 TB

Note: All prices are referred from the respective websites

How Sprinkle saves on Data Egress Cost ?

Sprinkle is a data platform built for cloud data warehouses and it doesn’t require any transfer of data outside your cloud infrastructure. 

Using Kubernetes, Sprinkle processes and transfers data in the same network. This is much more secured - More secured as data does not move out of network

Sprinkle Architecture


As seen in the above image, Sprinkle sits within the customer network and no data moves outside the customer network. Hence, there is no data egress cost while using the Sprinkle as compared to some other tools, who move the data outside the customer network.