The g4dn.xlarge instance is in the GPU instance family with 4 vCPUs, 16 GiB of memory and up to 25 Gibps of bandwidth starting at $0.526 per hour.

costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.xlarge

g4dn.xlarge Pricing and Specs: AWS EC2

The g4dn.xlarge instance is part of the g4dn series, featuring 4 vCPUs and Up to 25 Gigabit of RAM, with Gpu Instances. It is available at a rate of $0.5260/hour. Price / HourN.

AWS

aws.amazon.com › amazon ec2 › instance types › g4 instances

Amazon EC2 G4 Instances — Amazon Web Services (AWS)

3 days ago - Compared to comparable instances they offer up to 45% better price performance for graphics-intensive applications. ... G4dn instances, powered by NVIDIA T4 GPUs, are the lowest cost GPU-based instances in the cloud for machine learning inference and small scale training.

CloudPrice

cloudprice.net › amazon web services › ec2 › g4dn.xlarge

g4dn.xlarge specs and pricing | AWS | CloudPrice

Amazon EC2 instance g4dn.xlarge with 4 vCPUs, 16 GiB RAM and 1 x NVIDIA T4 16 GiB. Available in 23 regions starting from $383.98 per month.

Economize

economize.cloud › resources › aws › pricing › ec2 › g4dn.xlarge

g4dn.xlarge pricing: $383.98 monthly - AWS EC2

Last updated: December 18, 2025The g4dn.xlarge instance is in the G4DN Accelerated computing family with 4 vCPUs and 16 GiB of memory, pricing starts at $0.53 per hour and $383.98 per month in us-east-1 region.

Aws-pricing

aws-pricing.com › g4dn.xlarge.html

g4dn.xlarge - Amazon EC2 Instance Type

Cost and pricing across all AWS locations for Amazon Elastic Compute Cloud (EC2) instance type g4dn.xlarge with free operating system.

Economize

economize.cloud › resources › aws › pricing › ec2 › g4dn.4xlarge

g4dn.4xlarge pricing: $878.92 monthly - AWS EC2

Last updated: December 8, 2025The g4dn.4xlarge instance is in the G4DN Accelerated computing family with 16 vCPUs and 64 GiB of memory, pricing starts at $1.20 per hour and $878.92 per month in us-east-1 region.

Bigbell

bigbell.ai › coin › aws › ec2 › g4dn.xlarge

g4dn.xlarge pricing and specs - BigBell

The g4dn.xlarge instance is in the gpu instance family with 4 vCPUs, 16.0 GiB of memory and up to 25 Gibps of bandwidth starting at $0.526 per hour.

Cloudzero

advisor.cloudzero.com › aws › ec2 › g4dn.xlarge

g4dn.xlarge Instance Specs And Pricing

CloudZero's intelligent platform helps you optimize cloud costs and improve infrastructure efficiency.

CloudPrice

cloudprice.net › amazon web services › ec2 › g4dn.4xlarge

g4dn.4xlarge specs and pricing | AWS | CloudPrice

Amazon EC2 instance g4dn.4xlarge with 16 vCPUs, 64 GiB RAM and 1 x NVIDIA T4 16 GiB. Available in 23 regions starting from $878.92 per month.

Find elsewhere

Google Bing Mojeek

Vantage

instances.vantage.sh › aws › ec2 › g4dn.4xlarge

g4dn.4xlarge pricing and specs - Vantage

The g4dn.4xlarge instance is in the GPU instance family with 16 vCPUs, 64 GiB of memory and up to 25 Gibps of bandwidth starting at $1.204 per hour.

Instance-pricing

instance-pricing.com › provider=aws-ec2 › instance=g4dn.xlarge

g4dn.xlarge instance pricing of AWS-EC2

We're sorry but instance-pricing doesn't work properly without JavaScript enabled. Please enable it to continue

EC2 Pricing Calculator

costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.2xlarge

g4dn.2xlarge Pricing and Specs: AWS EC2

The g4dn.2xlarge instance is part of the g4dn series, featuring 8 vCPUs and Up to 25 Gigabit of RAM, with Gpu Instances. It is available at a rate of $0.7520/hour. Price / HourN.

Computer Weekly

computerweekly.com › news › 252471099 › AWS-G4-aims-to-lower-cost-of-GPU-powered-AI-inference

AWS G4 aims to lower cost of GPU-powered AI inference | Computer Weekly

On-demand pricing of the g4dn.xlarge, four virtual core instance, with one GPU and 16GB of memory, starts at $0.526 per hour. The eight virtual core instance, with 32GBs of RAM, costs $0.752.

Economize

economize.cloud › resources › aws › pricing › ec2 › g4dn.12xlarge

g4dn.12xlarge pricing: $2855.76 monthly - AWS EC2

The g4dn.12xlarge instance is in the g4dn family with 48 vCPUs and 192 GiB of memory, priced from $3.91/hr or $2855.76/mo.

Economize

economize.cloud › resources › aws › pricing › ec2 › g4dn.2xlarge

g4dn.2xlarge pricing: $548.96 monthly - AWS EC2

Last updated: December 8, 2025The g4dn.2xlarge instance is in the G4DN Accelerated computing family with 8 vCPUs and 32 GiB of memory, pricing starts at $0.75 per hour and $548.96 per month in us-east-1 region.

reddit.com › r/localllama › benchmarking inexpensive aws instances

r/LocalLLaMA on Reddit: Benchmarking Inexpensive AWS Instances

June 10, 2024 -

I recently did some testing using Dolphin-Llama3 across various (inexpensive-ish) AWS instances to compare performance. The results are in line with what one might expect.

Testing was done using default settings with Ollama. I spun up a new instance on Ubuntu, installed Ollama and ran it with Dolphin-Llama3 —verbose.

Key Takeaways:

-Fastest Prompt Eval Rate: AWS g5 (fastest AWS instance tested)
-Fastest Eval Rate: Home PC w/RTX 3080
-Best Cost-Performance Balance: AWS g4dn.xlarge offers a good balance of performance and cost, at $0.58/hr.
-GPU speed is the key differentiator. Within the same family of models, such as the g4dn and g5 instances, the evaluation rates remain consistent. If the model fits in GPU memory there is no need for more cores/memory.
-I did notice that the more system memory available the greater number of tokens used in the output.

Test Results

AWS Instances

c7g.8xlarge (Compute Instance)
•32 cores, 64GB RAM
•Prompt Eval Rate: 38.38 tokens/s
•Eval Rate: 25.07 tokens/s
•Price: $1.27/hr, $941.16/mo
r6g.4xlarge (Memory Instance)
•16 cores, 128GB RAM
•Prompt Eval Rate: 10.15 tokens/s
•Eval Rate: 8.29 tokens/s
•Price: $0.88/hr, $657.10/mo
g4dn.xlarge (GPU Instance)
•4 cores, 16GB RAM, 16GB GPU
•Prompt Eval Rate: 222.23 tokens/s
•Eval Rate: 41.71 tokens/s
•Price: $0.58/hr, $434.50/mo
g4dn.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 32GB GPU
•Prompt Eval Rate: 214.25 tokens/s
•Eval Rate: 41.74 tokens/s
•Price: $0.84/hr, $621.24/mo
g5.xlarge (GPU Instance)
•4 cores, 16GB RAM, 24GB GPU
•Prompt Eval Rate: 624.29 tokens/s
•Eval Rate: 68.08 tokens/s
•Price: $1.12/hr, $831.05/mo
g5.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 24GB GPU
•Prompt Eval Rate: 624.48 tokens/s
•Eval Rate: 66.67 tokens/s
•Price: $1.35/hr, $1,000.96/mo

Local Machines

M2 MacMini
•M2, 8GB RAM, <8GB GPU
•Prompt Eval Rate: 66.38 tokens/s
•Eval Rate: 18.33 tokens/s
M1 MacBook Air
•M1, 16GB RAM, <16GB GPU
•Prompt Eval Rate: 71.58 tokens/s
•Eval Rate: 11.46 tokens/s
Home PC w/RTX 3080
•Intel i5, 64GB RAM, 10GB GPU
•Prompt Eval Rate: 185.67 tokens/s
•Eval Rate: 83.79 tokens/s

Oracle Ampere

Ampere 16 Core, 32GB RAM
•Prompt Eval Rate: 11.96 tokens/s (Duration: 1m34.955180835s)
•Eval Rate: 9.01 tokens/s (Duration: 1m28.461256s)
•Price: $0.1276/hr, $95/mo
Ampere 32 Core, 32GB RAM
•Prompt Eval Rate: 22.54 tokens/s (Duration: 47.93207936s)
•Eval Rate: 14.11 tokens/s (Duration: 44.423782s)
•Price: $0.2796/hr, $208/mo

Here's the data formatted in table for easier viewing - courtesy of u/sergeant113. https://www.reddit.com/r/LocalLLaMA/comments/1dclmwt/comment/l7zrgzm/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button