Suggest instance type NVIDIA GPU has the best cost

repost.aws › questions › QUMskv-5YRT_GvGmNYgV0eAw › suggest-instance-type-nvidia-gpu-has-the-best-cost

Hello. The following document describes the GPU-equipped instance types that can be used with EC2. https://docs.aws.amazon.com/dlami/latest/devguide/gpu.html I checked the prices in the price list below, and I thought that "g4dn.xlarge" was the cheapest if you were running it on demand. https://aws.amazon.com/ec2/pricing/on-demand/?nc1=h_ls Answer from Riku_Kobayashi on repost.aws

Vantage

instances.vantage.sh › aws › ec2 › g4dn.xlarge

g4dn.xlarge pricing and specs - Vantage

The g4dn.xlarge instance is in the GPU instance family with 4 vCPUs, 16 GiB of memory and up to 25 Gibps of bandwidth starting at $0.526 per hour.

AWS

aws.amazon.com › amazon ec2 › pricing › on-demand pricing

EC2 On-Demand Instance Pricing

1 week ago - On-Demand Instances let you pay for compute capacity by the hour or second (minimum of 60 seconds) with no long-term commitments. This frees you from the costs and complexities of planning, purchasing, and maintaining hardware and transforms what are commonly large fixed costs into much smaller ...

AWS

aws.amazon.com › amazon ec2 › instance types › g4 instances

Amazon EC2 G4 Instances — Amazon Web Services (AWS)

1 week ago - Compared to comparable instances they offer up to 45% better price performance for graphics-intensive applications. ... G4dn instances, powered by NVIDIA T4 GPUs, are the lowest cost GPU-based instances in the cloud for machine learning inference and small scale training.

EC2 Pricing Calculator

costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.xlarge

g4dn.xlarge Pricing and Specs: AWS EC2

The g4dn.xlarge instance is part of the g4dn series, featuring 4 vCPUs and Up to 25 Gigabit of RAM, with Gpu Instances. It is available at a rate of $0.5260/hour. Price / HourN.

Bigbell

bigbell.ai › coin › aws › ec2 › g4dn.xlarge

g4dn.xlarge pricing and specs - BigBell

The g4dn.xlarge instance is in the gpu instance family with 4 vCPUs, 16.0 GiB of memory and up to 25 Gibps of bandwidth starting at $0.526 per hour.

CloudPrice

cloudprice.net › amazon web services › ec2 › g4dn.xlarge

g4dn.xlarge specs and pricing | AWS | CloudPrice

October 24, 2025 - Amazon EC2 instance g4dn.xlarge with 4 vCPUs, 16 GiB RAM and 1 x NVIDIA T4 16 GiB. Available in 23 regions starting from $383.98 per month.

Economize

economize.cloud › resources › aws › pricing › ec2 › g4dn.xlarge

g4dn.xlarge pricing: $383.98 monthly - AWS EC2

1 week ago - Last updated: December 18, 2025The g4dn.xlarge instance is in the G4DN Accelerated computing family with 4 vCPUs and 16 GiB of memory, pricing starts at $0.53 per hour and $383.98 per month in us-east-1 region.

Find elsewhere

Google Bing Mojeek

Vantage

instances.vantage.sh › aws › ec2 › g4dn.2xlarge

g4dn.2xlarge pricing and specs - Vantage

The g4dn.2xlarge instance is in the GPU instance family with 8 vCPUs, 32 GiB of memory and up to 25 Gibps of bandwidth starting at $0.752 per hour.

Cloudzero

advisor.cloudzero.com › aws › ec2 › g4dn.xlarge

g4dn.xlarge Instance Specs And Pricing

CloudZero's intelligent platform helps you optimize cloud costs and improve infrastructure efficiency.

AWS re:Post

repost.aws › questions › QUMskv-5YRT_GvGmNYgV0eAw › suggest-instance-type-nvidia-gpu-has-the-best-cost

Suggest instance type NVIDIA GPU has the best cost | AWS re:Post

Key Takeaways:

-Fastest Prompt Eval Rate: AWS g5 (fastest AWS instance tested)
-Fastest Eval Rate: Home PC w/RTX 3080
-Best Cost-Performance Balance: AWS g4dn.xlarge offers a good balance of performance and cost, at $0.58/hr.
-GPU speed is the key differentiator. Within the same family of models, such as the g4dn and g5 instances, the evaluation rates remain consistent. If the model fits in GPU memory there is no need for more cores/memory.
-I did notice that the more system memory available the greater number of tokens used in the output.

Test Results

AWS Instances

c7g.8xlarge (Compute Instance)
•32 cores, 64GB RAM
•Prompt Eval Rate: 38.38 tokens/s
•Eval Rate: 25.07 tokens/s
•Price: $1.27/hr, $941.16/mo
r6g.4xlarge (Memory Instance)
•16 cores, 128GB RAM
•Prompt Eval Rate: 10.15 tokens/s
•Eval Rate: 8.29 tokens/s
•Price: $0.88/hr, $657.10/mo
g4dn.xlarge (GPU Instance)
•4 cores, 16GB RAM, 16GB GPU
•Prompt Eval Rate: 222.23 tokens/s
•Eval Rate: 41.71 tokens/s
•Price: $0.58/hr, $434.50/mo
g4dn.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 32GB GPU
•Prompt Eval Rate: 214.25 tokens/s
•Eval Rate: 41.74 tokens/s
•Price: $0.84/hr, $621.24/mo
g5.xlarge (GPU Instance)
•4 cores, 16GB RAM, 24GB GPU
•Prompt Eval Rate: 624.29 tokens/s
•Eval Rate: 68.08 tokens/s
•Price: $1.12/hr, $831.05/mo
g5.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 24GB GPU
•Prompt Eval Rate: 624.48 tokens/s
•Eval Rate: 66.67 tokens/s
•Price: $1.35/hr, $1,000.96/mo

Local Machines

M2 MacMini
•M2, 8GB RAM, <8GB GPU
•Prompt Eval Rate: 66.38 tokens/s
•Eval Rate: 18.33 tokens/s
M1 MacBook Air
•M1, 16GB RAM, <16GB GPU
•Prompt Eval Rate: 71.58 tokens/s
•Eval Rate: 11.46 tokens/s
Home PC w/RTX 3080
•Intel i5, 64GB RAM, 10GB GPU
•Prompt Eval Rate: 185.67 tokens/s
•Eval Rate: 83.79 tokens/s

Oracle Ampere

Ampere 16 Core, 32GB RAM
•Prompt Eval Rate: 11.96 tokens/s (Duration: 1m34.955180835s)
•Eval Rate: 9.01 tokens/s (Duration: 1m28.461256s)
•Price: $0.1276/hr, $95/mo
Ampere 32 Core, 32GB RAM
•Prompt Eval Rate: 22.54 tokens/s (Duration: 47.93207936s)
•Eval Rate: 14.11 tokens/s (Duration: 44.423782s)
•Price: $0.2796/hr, $208/mo

Here's the data formatted in table for easier viewing - courtesy of u/sergeant113. https://www.reddit.com/r/LocalLLaMA/comments/1dclmwt/comment/l7zrgzm/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

Top answer

1 of 5

Here's the data formatted in table for easier viewing: AWS Instances Instance Type | Cores | RAM | GPU | Prompt Eval Rate (tokens/s) | Eval Rate (tokens/s) | Price ($/hr) | Price ($/mo) | c7g.8xlarge | 32 | 64GB | - | 38.38 | 25.07 | 1.27 | 941.16 | r6g.4xlarge | 16 | 128GB | - | 10.15 | 8.29 | 0.88 | 657.10 | g4dn.xlarge | 4 | 16GB | 16GB | 222.23 | 41.71 | 0.58 | 434.50 | g4dn.2xlarge | 8 | 32GB | 32GB | 214.25 | 41.74 | 0.84 | 621.24 | g5.xlarge | 4 | 16GB | 24GB | 624.29 | 68.08 | 1.12 | 831.05 | g5.2xlarge | 8 | 32GB | 24GB | 624.48 | 66.67 | 1.35 | 1,000.96 Vs Local Machines Machine Type | Cores | RAM | GPU | Prompt Eval Rate (tokens/s) | Eval Rate (tokens/s) | M2 MacMini | - | 8GB | <8GB | 66.38 | 18.33 | M1 MacBook Air | - | 16GB | <16GB | 71.58 | 11.46 | Home PC w/RTX 3080 | - | 64GB | 10GB | 185.67 | 83.79

2 of 5

I use the Hetzner 16-core 32GB ARM instances when I need something cheap and dirty. The cost is 1/10th of even the cheapest AWS setup you have here, so if you aren't worried about speed that might be the way to go.

Cloudzero

advisor.cloudzero.com › aws › ec2 › g4dn.4xlarge

g4dn.4xlarge Instance Specs And Pricing - EC2

November 11, 2025 - CloudZero's intelligent platform helps you optimize cloud costs and improve infrastructure efficiency.

Instance-pricing

instance-pricing.com › provider=aws-ec2 › instance=g4dn.xlarge

g4dn.xlarge instance pricing of AWS-EC2

We're sorry but instance-pricing doesn't work properly without JavaScript enabled. Please enable it to continue

CloudPrice

cloudprice.net › amazon web services › ec2 › g4dn.2xlarge

g4dn.2xlarge specs and pricing | AWS | CloudPrice

2 weeks ago - Amazon EC2 instance g4dn.2xlarge with 8 vCPUs, 32 GiB RAM and 1 x NVIDIA T4 16 GiB. Available in 23 regions starting from $548.96 per month.

Umbrella

umbrellacost.com › home › learning center › aws cost management › amazon ec2 g4 instances

Using AWS EC2 G4dn and G4ad | Amazon EC2 G4 | Umbrella

April 10, 2025 - In December 2020, AWS released the Amazon EC2 G4ad instance subfamily — powered by AMD Radeon Pro V520 GPUs and second-generation AMD EPYC processors with up to 2.4 TB of local NVMe storage — that delivers up to 40% better price performance over comparable GPU-based instances for graphics intensive applications such as virtual workstations and game streaming. In July 2021, AWS expanded the G4ad subfamily with the g4ad.xlarge and g4ad.2xlarge sizes, which are designed to be cost-effective for workloads that don’t need the high vCPU and system memory that current larger G4ad instance sizes offer — rounding out their AMD offering and providing the lowest cost GPU instance in the AWS Cloud.

Saturn Cloud

saturncloud.io › sagemaker-pricing

Amazon SageMaker Pricing | Saturn Cloud

The Saturn Cloud price is the price per hour for the Saturn Cloud component, while the hosting price is the charge for the underlying AWS EC2 instances that the resources run on.