Selfhost or Cloud? Cost/Risk Analysis

·
Selfhost or Cloud?
  • Self-hosted storage is almost always more cost-effective for large-scale, persistent needs, specially if there is a focus on data privacy and security.
  • For GPU compute (Deepseek), cloud is often cheaper unless you maintain high utilization (>20–30%) so there is to be mindful of the utilization rate and what the goals are before investing.
  • After some research, a hybrid approach—self-hosted storage, cloud GPUs for burst analysis—is optimal for most organizations but all depends on the use case and goals.
  • See detailed cost, risk, and operational breakdowns below for informed decision-making given your own scenario.

1. Scenario Overview

  • Self-hosted build price: In the previous post, I calculated the cost of a custom build for two cases. Case 1 is a build with 2x RTX 5090 GPUs and Case 2 is a build without GPUs. The cost of the custom build for Case 1 is ~CLP$12,101,864 (≈ USD $13,000) and for Case 2 is ~CLP$3,863,084 (≈ USD $4,200).
  • Specs: the build considers 2x RTX 5090 GPUs, high-end CPU, 48TB+ local storage, 128GB RAM and a PSU of 2000W for Case 1 and for ther sake of simplicity the same PSU for Case 2 but without the GPUs.
  • Use-cases: the build intention is to provide Deepseek AI, large document analysis, enterprise file storage, future web hosting of applications all for a company of around 20-30 employees.
  • Electricity: the build consumes in the worst scenario ~USD$4,000/year in electricity bills. This is running at full capacity 24/7.

2. Cloud Options

In order to make a comparisson of the running costs of the server vs the cloud options, several providers were evaluated to provide a computer capacity similar to what the custom build provides. The providers were selected based on the computer power provided with a bias on asian providers, nonetheless, providers from the US were also selected for comparisson of prices and to have a global perspective.

The storage providers were selected based on the same criteria as the GPU providers.

2.1 GPU Providers

ProviderCountryGPU TypeTypical Price (USD/hr)Notes
DigitalOceanUSAH100~$1.57Pay-as-you-go, $1.57/hr/GPU
GPU MartSingapore4090~$1.00Asia-based, scalable, support for AI
Alibaba/TencentChinaA100/4090~$1.20–$1.50Mainland China, enterprise contracts
HyperbolicUSA/SingaporeH100/4090H100 $0.99+, 4090 $0.49+Market-based, prices vary
RunPodUSAH100/4090H100 ~$2.10, 4090 ~$1.20Market-based, prices vary
JarvisLabsIndiaH100/4090H100 ~$2.99, 4090 ~$1.29Market-based, prices vary
Vast.aiUSAH100/4090H100 ~$2.07, 4090 ~$0.36Market-based, prices vary

2.2 Storage Providers

ProviderCountry20 TB Storage (Monthly)20 TB Storage (Annual)Notes
DigitalOceanUSA$2,048$24,576$0.10/GB/month, block storage
Alibaba CloudChina$430$5,160$0.021/GB/month, OSS Standard
Tencent CloudChina$430$5,160$0.021/GB/month, COS Standard
GPU MartSingapore$400–2,000 (estimate)$4,800–24,000 (est.)Based on market averages, not public pricing
Self-HostingLocalOne-time: ~$1,500-4 × 8TB HDDs, no monthly fee, just electricity

3. Cost Analysis

In order to effectively compare the options of self host vs cloud, it is important to consider the electricity cost of running the server at full capacity. Therefore, the electricity cost is calculated for case 1 build with 2x RTX 5090 GPUs and case 2 build without GPUs and for the purpose of storing data only.

3.1 Estimated Electricity Consumption

Build OptionEstimated Power Draw (W)kWh/monthkWh/yearAnnual Electricity Cost (USD)
Case 1: 2× RTX 5090 GPUs1,6501,20514,460~$4,000
Case 2: Without GPU5003654,444~$1,260
example
Assumptions
  • Case 1: estimate is based on previous calculations for a dual RTX 5090 build (1,650W, 24/7 use, ~$4,000/year at local rates).
  • Case 2: is proportional to the power draw (500W ≈ 30% of full build), so electricity cost is estimated as 30% of $4,000 ≈ $1,260/year.
  • Both assume 24/7 operation. Actual usage will vary with workload and power-saving features.

3.2 Self-Hosting Cost Case 1

note
Case 1
  • Upfront server cost: ~USD$13,000 (includes 2x RTX 5090, 40TB storage)
  • Electricity: ~USD$335/month (~USD $4,000/year)
  • Maintenance: Requires IT skills, local support

3.3 Self-Hosting Cost Case 2

note
Case 2
  • Upfront server cost: ~USD$4200 (40TB storage, High end processor)
  • Electricity: ~USD$105/month (~USD $1,260/year)
  • Maintenance: Requires IT skills, local support

3.3 Cloud Option Cost

note
Cloud Option Cost
  • GPU rental: USD$0.99–$3/hr for H100/4090, pay-as-you-go.
  • Storage: USD$400–$2,000+/month for 20TB
  • Flexibility: Scale up/down, no hardware risk

4. Risk & Operational Assessment

Self-Hosting

When self hosting there is to consider that any hardware failure or outage will require on-site IT skills to resolve. This is a risk that most of the time is not present when using cloud services as the provider is the one that will take care of any problem with the machine.

The benefit of a self-hosting solution is that it provides full control and privacy, as the data is stored locally and under the control of the user.

Personalization of services and scalability are other benefits of self-hosting, as the user can choose the services and hardware that best suit their needs.

Cloud

Cloud services provide a more cost-effective solution for those who do not require full control and privacy, as the data is stored remotely and under the control of the provider. The cloud option assures high uptime and support, as the provider is the one that will take care of any problem with the machine.

The cloud option also provides the flexibility to scale up or down as needed, which is a benefit for those who do not have a fixed amount of resources.

Yet the downside of cloud solutions is that they are not as flexible as self-hosting, as the user is limited by the provider's policies and the hardware is not under the control of the user. The provider can also charge additional fees for certain services, such as data transfer or storage depending on the type of contract signed.


5. 1-Year & 3-Year Cost Comparison

Case 1: 2x RTX 5090 GPUs + 20TB Storage (usable)

Option1-Year Cost3-Year CostControlUptimeMaintenanceData Privacy
Self-HostingUSD$17,000USD$25,000FullLocalDIYFull
DigitalOceanUSD$33,746USD$101,238LimitedHighIncludedVariable
Alibaba CloudUSD$11,160USD$33,480LimitedHighIncludedVariable
Tencent CloudUSD$11,160USD$33,480LimitedHighIncludedVariable
GPU MartUSD$10,800–28,000USD$32,400–84,000LimitedHighIncludedVariable
JarvisLabs (H100)USD$8,760–26,280USD$26,280–78,840LimitedHighIncludedVariable
info
Info

Cloud costs assume 8 hours/day GPU use, 20TB storage. Adjust for your usage. JarvisLabs pricing is for H100 (closest available to 5090 for cloud rental), at $2.99/hr. 1-year cost is $8,760 (8hr/day) to $26,280 (24/7); 3-year cost is triple.

Case 2: 20TB Usable Storage (RAID 1)

Option1-Year Cost3-Year CostControlUptimeMaintenanceData Privacy
Self-HostingUSD$5,460USD$7,890FullLocalDIYFull
DigitalOceanUSD$24,576USD$73,728LimitedHighIncludedVariable
Alibaba CloudUSD$5,160USD$15,480LimitedHighIncludedVariable
Tencent CloudUSD$5,160USD$15,480LimitedHighIncludedVariable
GPU MartUSD$4,800–24,000USD$14,400–72,000LimitedHighIncludedVariable
info
Info

Storage-only costs for 20TB usable storage (e.g., RAID 1 with 40TB raw capacity for self-hosting). No GPU usage included in these costs.


6. Observations

  • Storage: Self-hosting is almost always justified for large-scale, persistent storage due to dramatically lower costs, full control, and no ongoing fees. The risks (hardware failure, outages) can be managed with RAID, backups, and good IT practices.
  • Deepseek GPU Compute: The high up-front cost of GPUs plus ongoing electricity means self-hosting is only cost-effective at high utilization (>20–30%). For bursty or occasional Deepseek analysis, cloud GPUs are more economical and flexible, with no maintenance or power cost. Cloud providers now offer highly competitive, market-based pricing and let you scale up or down as needed.
  • Best Practice: Many organizations use a hybrid model—self-hosting for storage and basic compute, and renting cloud GPUs for heavy, burst, or specialized analysis. This combines cost-effective storage with flexible, scalable compute.
Use CaseSelf-Hosting Justified?Cloud Recommended?
StorageYes (almost always)Only for small/short-term
Deepseek GPUOnly at high utilizationYes, for burst/intermittent

7. Practical Considerations & Utilization

  • Rule-of-thumb: If you expect to do deep learning for longer than a year at moderate or high utilization, it is cheaper to get a desktop/server. Otherwise, cloud is preferable for short-term or low-utilization needs.
  • Utilization rates:
    • Personal desktop: often <15%
    • Research cluster: >35%
    • Company-wide cluster: >60%
  • Break-even: For high utilization (e.g., >15–20%), self-hosting becomes more cost-effective after about 1 year, even considering electricity.
  • Recommendation: For teams, consider a shared cluster for efficiency. For individuals or low-utilization, cloud is more flexible and cost-effective.

8. Conclusion

Choosing between self-hosting and cloud solutions for large-scale AI and storage is not a one-size-fits-all decision. As highlighted both by industry analysis and real-world user experience, the most effective approach is often hybrid:

  • For most organizations and individuals, investing heavily in high-end GPUs for local use is only justified if you require continuous, high-utilization compute. The up-front costs, ongoing electricity, and maintenance can be substantial, and hardware quickly becomes outdated.
  • Cloud GPU rental is ideal for bursty, experimental, or short-term workloads. It offers instant access to state-of-the-art hardware, no maintenance burden, and the flexibility to scale up or down as your needs change. However, costs can add up quickly with sustained use.
  • Self-hosting remains the best value for persistent, high-capacity storage and for users who require full control over their data and infrastructure. With RAID and good IT practices, risks can be managed, but on-site skills are required for maintenance and troubleshooting.
  • A hybrid strategy—using local infrastructure for storage and basic compute, and cloud resources for heavy or specialized AI workloads—offers the best of both worlds. This allows you to optimize for cost, flexibility, and control, adapting as your needs evolve.

In summary, let your actual usage patterns and risk tolerance guide your infrastructure choices. For most, the flexibility to combine local and cloud resources will deliver the most value, future-proofing your investment while keeping costs manageable.


9. Appendices

A. Calculation Formulas

Annual GPU Cost = (Hourly Rate) × (Number of GPUs) × (Hours per Day) × (Days per Year)
Example: $1.57 × 2 × 8 × 365 ≈ $9,170/year (DigitalOcean)
Monthly = Annual / 12
Annual Electricity Cost = (Power in kW) × (Hours per Year) × (Cost per kWh)
Example: 1.65 kW × 24 × 365 × (CLP$/kWh)

B. Provider Notes

  • All prices are for typical on-demand rates, market-based and may vary by region and availability.
  • Storage prices are for standard object/block storage. Egress/download and API request costs are extra. Self-hosting cost is hardware only.

C. Sources

warning
Disclaimer

All prices and specifications as of May 2025. Please check provider pages for the latest details.