Cost Analysis

How to Transfer Petabytes Without Going Broke

Moving 1 PB of data can cost anywhere from $0 to $250,000 depending on the method. Here is the math for every option, including when each one actually makes sense.

The Scale Problem

A petabyte is 1,000 terabytes or roughly one million gigabytes. In concrete terms: about 500,000 hours of HD video, 20 billion document pages, or a single large AI training dataset. As AI workloads grow, petabyte-scale transfers are becoming routine for AI data centers, research institutions, and media companies.

The problem is that most file transfer pricing was designed for gigabyte-scale usage. At petabyte scale, those per-GB fees become absurd. Let us do the math.

Cost Breakdown: Moving 1 PB

MethodCostTransfer TimeNotes
AWS S3 egress$87,040~12 days at 10 GbpsStandard egress pricing; faster with DirectConnect ($$$)
GCP egress$80,000~12 days at 10 GbpsPremium tier; standard tier slightly less
Azure egress$52,224~12 days at 10 GbpsCheapest of the big three clouds
Pay-per-GB service$250,000~20+ days (TCP-limited)$0.25/GB download; TCP throughput ceiling on long paths
Enterprise UDP tool$10,000-30,000/yr~12 days at 10 GbpsLicense + infrastructure; fast once deployed
AWS Snowball Edge$9,000-15,0005-10 days (shipping)$300/device x 13 devices + shipping; 80 TB usable each
Shipping hard drives$3,000-6,0002-5 days (shipping)20 TB drives x 50 + shipping + logistics
Handrive (P2P)$0~12 days at 10 GbpsBoth endpoints need bandwidth and availability

Method 1: Cloud Egress ($50K-$90K)

If your data is already in a cloud provider, egress fees are the most straightforward cost. The pricing is per-GB, tiered by volume:

AWS S3 Egress: 1 PB Calculation

  • First 10 TB: 10,240 GB x $0.09 = $921.60
  • Next 40 TB: 40,960 GB x $0.085 = $3,481.60
  • Next 100 TB: 102,400 GB x $0.07 = $7,168.00
  • Remaining 874 TB: 894,976 GB x $0.05 = $44,748.80
  • Total: approximately $56,320 (tiered) to $87,040 (flat $0.085)

Pricing varies by region and changes periodically. These are approximate US-region rates.

Azure is typically 30-40% cheaper for egress. GCP falls in between. But all three share the same fundamental problem: data gravity. Once your data is in one cloud, the egress fee is effectively a tax on leaving.

When cloud egress makes sense

When data is already in the cloud and needs to move to another cloud region or on-premises location. You are paying the egress whether you use a transfer tool or not. The transfer tool cost is additive.

Method 2: Pay-Per-GB Service ($250,000)

Pay-per-GB services charge $0.25/GB for downloads. The math is simple and painful:

1,000,000 GB x $0.25/GB = $250,000

This is not a typo. Per-GB pricing at petabyte scale is extraordinarily expensive.

These services also use TCP-based transfer, which means throughput is constrained by the bandwidth-delay product on high-latency links. A 10 Gbps connection between US coasts might achieve 2-5 Gbps effective throughput, extending transfer time from 12 days to 20-30 days. See our TCP limitations analysis for the full breakdown.

When pay-per-GB services make sense

Pay-per-GB services are designed for project-based transfers in the gigabyte to low-terabyte range: video deliverables, design assets, client uploads. At that scale, $0.25/GB is reasonable for the convenience. They are not designed for petabyte-scale data movement.

Method 3: Enterprise UDP Tools ($10K-$30K/year)

Enterprise UDP transfer tools use the FASP protocol for UDP-based acceleration, achieving near line-rate throughput regardless of latency. At 10 Gbps, 1 PB transfers in roughly 12 days.

The cost model is a flat annual license ($10,000-$30,000 depending on features and throughput tiers) plus the infrastructure to run dedicated transfer servers. The infrastructure cost varies widely: a single on-prem server might cost $5,000-$10,000, while a cloud-hosted deployment can run $2,000-$5,000/month.

When enterprise UDP tools make sense

When you are an enterprise with dedicated IT staff, need formal compliance certifications (SOC 2, HIPAA), and move petabytes regularly. The fixed cost amortizes well if you are transferring 10+ PB/year. For a one-time petabyte move, the setup overhead is hard to justify.

Method 4: AWS Snowball ($9K-$15K)

AWS Snowball Edge devices hold 80 TB of usable capacity. To move 1 PB, you need 13 devices. Each device costs $300 for a 10-day rental plus shipping.

Snowball Cost: 1 PB

  • Devices: 13 x $300 = $3,900
  • Shipping (round-trip): ~$100-200 x 13 = $1,300-$2,600
  • Data transfer into S3: $0 (ingress is free)
  • Staff time for loading: 2-3 days
  • Total: $5,200-$6,500 (plus labor)

The catch: Snowball is designed for getting data into AWS, not for arbitrary point-to-point transfer. It also takes 5-10 business days for the full cycle (ship, load, ship back, ingest). For data that needs to reach a non-AWS destination, Snowball adds complexity.

When Snowball makes sense

One-time bulk migration into AWS with limited network bandwidth. If your internet connection is under 1 Gbps, Snowball is faster than network transfer for anything over about 50 TB.

Method 5: Shipping Hard Drives ($3K-$6K)

The original sneakernet. Buy 20 TB drives, load them, ship them.

Drive Shipping Cost: 1 PB

  • 50 x 20 TB drives at $40-60 each = $2,000-$3,000
  • Shipping (insured, tracked): $500-$1,500
  • Enclosures/docking stations: $200-$500
  • Staff time for loading/unloading: 1-3 days
  • Total: $2,700-$5,000 (plus labor)

This is genuinely the cheapest option for a one-time bulk transfer when you do not have high-bandwidth connectivity. A box of drives shipped next-day is faster than a 1 Gbps connection for anything over about 100 TB (100 TB over 1 Gbps takes ~9.3 days).

When drive shipping makes sense

Initial bulk data migration with limited bandwidth. This is an honest recommendation: if you need to move 1 PB once and your connection is under 5 Gbps, shipping drives is likely your best option for the first transfer. Use network-based tools for ongoing incremental updates afterward.

Method 6: Handrive ($0)

Handrive is free at any scale. No per-GB fees, no license costs. The transfer uses a UDP-based protocol that achieves full bandwidth utilization regardless of latency, similar to the FASP protocol but without the license fee.

Transfer time depends entirely on your network bandwidth:

1 PB Transfer Time by Bandwidth

  • 1 Gbps: ~93 days
  • 10 Gbps: ~9.3 days
  • 25 Gbps: ~3.7 days
  • 100 Gbps: ~22 hours

The trade-off: both endpoints must be online simultaneously (no intermediate storage server). For a multi-day transfer, this means running a headless server instance on a NAS or always-on machine. The data is end-to-end encrypted and never touches a third-party server, which matters for proprietary AI training datasets and model checkpoints.

When Handrive makes sense

Ongoing transfers between fixed endpoints with reasonable bandwidth (5+ Gbps). The zero cost means you can run continuous data pipelines without worrying about the bill. For AI data centers with 10-100 Gbps interconnects, Handrive delivers enterprise-class performance at zero cost.

The Honest Recommendation

There is no single best method. The right choice depends on your specific situation:

  • One-time bulk migration, limited bandwidth: Ship drives. It is unglamorous but it works and it is cheap.
  • One-time migration into AWS specifically: AWS Snowball. Purpose-built for this.
  • Ongoing transfers, enterprise compliance required: Enterprise UDP tools. The license pays for itself if you are moving 10+ PB/year.
  • Ongoing transfers, cost-sensitive: Handrive. Zero cost at any volume with full-speed UDP protocol.
  • Ad-hoc project transfers under 5 TB: Pay-per-GB services. Simple, no setup, per-GB cost is manageable at this scale.

Many teams combine methods: ship drives for the initial bulk load, then use Handrive for ongoing incremental transfers. This hybrid approach minimizes both cost and time to first data availability.

For a detailed comparison of the network-based options, see our AI data transfer tools comparison.


Related Posts

Move Petabytes for Free

Handrive's UDP-based protocol transfers at full line rate with zero per-GB fees. Try it on your next large dataset.

Download Handrive