🌐
Economize
economize.cloud › resources › aws › pricing › ec2 › g4dn.2xlarge
g4dn.2xlarge pricing: $548.96 monthly - AWS EC2
Last updated: December 24, 2025The g4dn.2xlarge instance is in the G4DN Accelerated computing family with 8 vCPUs and 32 GiB of memory, pricing starts at $0.75 per hour and $548.96 per month in us-east-1 region.
🌐
CloudPrice
cloudprice.net › amazon web services › ec2 › g4dn.2xlarge
g4dn.2xlarge specs and pricing | AWS | CloudPrice
Amazon EC2 instance g4dn.2xlarge with 8 vCPUs, 32 GiB RAM and 1 x NVIDIA T4 16 GiB. Available in 23 regions starting from $548.96 per month.
🌐
Amazon Web Services
aws.amazon.com › machine learning › amazon sagemaker ai › pricing
SageMaker Pricing
2 weeks ago - Storage charges for a Code Editor space accrued until it is deleted. ... You will be using General Purpose SSD storage for 480 hours (24 hours * 20 days). In a Region that charges $0.1125 per GB-month: $0.112 per GB-month * 5 GB * 480 / (24 hours/day * 30-day month) = $0.373
🌐
Vantage
instances.vantage.sh › aws › ec2 › g4dn.2xlarge
g4dn.2xlarge pricing and specs - Vantage
The g4dn.2xlarge instance is in the GPU instance family with 8 vCPUs, 32 GiB of memory and up to 25 Gibps of bandwidth starting at $0.752 per hour.
🌐
Economize
economize.cloud › resources › aws › pricing › ec2 › g4dn.xlarge
g4dn.xlarge pricing: $383.98 monthly - AWS EC2
Last updated: December 18, 2025The g4dn.xlarge instance is in the G4DN Accelerated computing family with 4 vCPUs and 16 GiB of memory, pricing starts at $0.53 per hour and $383.98 per month in us-east-1 region.
🌐
Cloudzero
advisor.cloudzero.com › aws › sagemaker › ml.g4dn.xlarge
ml.g4dn.xlarge SageMaker ML Instance Specs And Pricing
CloudZero's intelligent platform helps you optimize cloud costs and improve infrastructure efficiency.
🌐
Cloudzero
advisor.cloudzero.com › aws › sagemaker › ml.g4dn.2xlarge
ml.g4dn.2xlarge SageMaker ML Instance Specs And Pricing
CloudZero's intelligent platform helps you optimize cloud costs and improve infrastructure efficiency.
🌐
Vantage
instances.vantage.sh › aws › ec2 › g4dn.xlarge
g4dn.xlarge pricing and specs - Vantage
The g4dn.xlarge instance is in the GPU instance family with 4 vCPUs, 16 GiB of memory and up to 25 Gibps of bandwidth starting at $0.526 per hour.
🌐
EC2 Pricing Calculator
costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.2xlarge
g4dn.2xlarge Pricing and Specs: AWS EC2
Explore the EC2 g4dn.2xlarge instance type and its costs across various regions, empowering you to make informed decisions for your cloud deployment strategy.
🌐
CloudPrice
cloudprice.net › amazon web services › ec2 › g4dn.xlarge
g4dn.xlarge specs and pricing | AWS | CloudPrice
Amazon EC2 instance g4dn.xlarge with 4 vCPUs, 16 GiB RAM and 1 x NVIDIA T4 16 GiB. Available in 23 regions starting from $383.98 per month.
Find elsewhere
🌐
AWS re:Post
repost.aws › articles › AR864QSGIIRlCmxnaNRzeouw › being-mindful-of-your-spend-while-experimenting-with-ml
Being mindful of your spend while experimenting with ML. | AWS re:Post
February 7, 2024 - This instance type costs only $0.05/hour, however ml.g4dn.xlarge is $0.7364/hour or about $120 a week. The unnecessary charges can be simply avoided by stopping the instance when you’re done with your work.
🌐
AWS
aws.amazon.com › amazon ec2 › instance types › g4 instances
Amazon EC2 G4 Instances — Amazon Web Services (AWS)
2 weeks ago - G4dn instances, powered by NVIDIA T4 GPUs, are the lowest cost GPU-based instances in the cloud for machine learning inference and small scale training. They also provide high performance and are a cost-effective solution for graphics applications that are optimized for NVIDIA GPUs using NVIDIA libraries such as CUDA, CuDNN, and NVENC.
🌐
Cloudzero
advisor.cloudzero.com › aws › sagemaker › ml.g4dn.4xlarge
ml.g4dn.4xlarge SageMaker ML Instance Specs And Pricing
CloudZero's intelligent platform helps you optimize cloud costs and improve infrastructure efficiency.
🌐
Reddit
reddit.com › r/localllama › benchmarking inexpensive aws instances
r/LocalLLaMA on Reddit: Benchmarking Inexpensive AWS Instances
June 10, 2024 -

I recently did some testing using Dolphin-Llama3 across various (inexpensive-ish) AWS instances to compare performance. The results are in line with what one might expect.

Testing was done using default settings with Ollama. I spun up a new instance on Ubuntu, installed Ollama and ran it with Dolphin-Llama3 —verbose.

Key Takeaways:

-Fastest Prompt Eval Rate: AWS g5 (fastest AWS instance tested)
-Fastest Eval Rate: Home PC w/RTX 3080
-Best Cost-Performance Balance: AWS g4dn.xlarge offers a good balance of performance and cost, at $0.58/hr.
-GPU speed is the key differentiator. Within the same family of models, such as the g4dn and g5 instances, the evaluation rates remain consistent. If the model fits in GPU memory there is no need for more cores/memory.
-I did notice that the more system memory available the greater number of tokens used in the output.

Test Results

AWS Instances

c7g.8xlarge (Compute Instance)
•32 cores, 64GB RAM
•Prompt Eval Rate: 38.38 tokens/s
•Eval Rate: 25.07 tokens/s
•Price: $1.27/hr, $941.16/mo
r6g.4xlarge (Memory Instance)
•16 cores, 128GB RAM
•Prompt Eval Rate: 10.15 tokens/s
•Eval Rate: 8.29 tokens/s
•Price: $0.88/hr, $657.10/mo
g4dn.xlarge (GPU Instance)
•4 cores, 16GB RAM, 16GB GPU
•Prompt Eval Rate: 222.23 tokens/s
•Eval Rate: 41.71 tokens/s
•Price: $0.58/hr, $434.50/mo
g4dn.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 32GB GPU
•Prompt Eval Rate: 214.25 tokens/s
•Eval Rate: 41.74 tokens/s
•Price: $0.84/hr, $621.24/mo
g5.xlarge (GPU Instance)
•4 cores, 16GB RAM, 24GB GPU
•Prompt Eval Rate: 624.29 tokens/s
•Eval Rate: 68.08 tokens/s
•Price: $1.12/hr, $831.05/mo
g5.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 24GB GPU
•Prompt Eval Rate: 624.48 tokens/s
•Eval Rate: 66.67 tokens/s
•Price: $1.35/hr, $1,000.96/mo

Local Machines

M2 MacMini
•M2, 8GB RAM, <8GB GPU
•Prompt Eval Rate: 66.38 tokens/s
•Eval Rate: 18.33 tokens/s
M1 MacBook Air
•M1, 16GB RAM, <16GB GPU
•Prompt Eval Rate: 71.58 tokens/s
•Eval Rate: 11.46 tokens/s
Home PC w/RTX 3080
•Intel i5, 64GB RAM, 10GB GPU
•Prompt Eval Rate: 185.67 tokens/s
•Eval Rate: 83.79 tokens/s

Oracle Ampere

Ampere 16 Core, 32GB RAM
•Prompt Eval Rate: 11.96 tokens/s (Duration: 1m34.955180835s)
•Eval Rate: 9.01 tokens/s (Duration: 1m28.461256s)
•Price: $0.1276/hr, $95/mo
Ampere 32 Core, 32GB RAM
•Prompt Eval Rate: 22.54 tokens/s (Duration: 47.93207936s)
•Eval Rate: 14.11 tokens/s (Duration: 44.423782s)
•Price: $0.2796/hr, $208/mo

Here's the data formatted in table for easier viewing - courtesy of u/sergeant113. https://www.reddit.com/r/LocalLLaMA/comments/1dclmwt/comment/l7zrgzm/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button

🌐
Cloudchipr
cloudchipr.com › blog › amazon-sagemaker-pricing
Amazon SageMaker AI Pricing: Detailed Breakdown and Ultimate Guide
ml.p3.2xlarge: $3.825/hour (1 NVIDIA ... such as large datasets and feature engineering. ml.g4dn.xlarge: $0.736/hour (1 NVIDIA T4 GPU, 4 vCPUs, 16 GiB memory)...
🌐
EC2 Pricing Calculator
costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.xlarge
g4dn.xlarge Pricing and Specs: AWS EC2
Explore the EC2 g4dn.xlarge instance type and its costs across various regions, empowering you to make informed decisions for your cloud deployment strategy.
🌐
Concurrencylabs
concurrencylabs.com › blog › sagemaker-ai-cost-savings
How To Keep SageMaker AI Cost Under Control and Avoid Bad Billing Surprises when doing Machine Learning in AWS - Concurrency Labs
Therefore, it is critical to implement processes that constantly adjust the amount of data stored for in-memory requests, otherwise the monthly cost can quickly reach thousands of dollars. ... The chart below displays a cost comparison across a subset of relevant AWS regions. There are regions where cost is >60% more expensive compared to the lower cost ones. Regions such as N. Virginia, Ohio and Oregon are the best options, from a cost perspective. Therefore, it is essential to select the right region for an ML workload and data storage, given potential higher cost in some regions and the fact that there can be substantial charges related to inter-region data transfer.
🌐
Aws-pricing
aws-pricing.com › g4dn.2xlarge.html
g4dn.2xlarge - Amazon EC2 Instance Type
Costs and pricing for Amazon Elastic Compute Cloud (EC2) instance type g4dn.2xlarge in AWS locations in which the instance type is available.
🌐
Aws-pricing
aws-pricing.com › g4dn.xlarge.html
g4dn.xlarge - Amazon EC2 Instance Type
Costs and pricing for Amazon Elastic Compute Cloud (EC2) instance type g4dn.xlarge in AWS locations in which the instance type is available.
🌐
Vantage
instances.vantage.sh › aws › ec2 › g4dn.12xlarge
g4dn.12xlarge pricing and specs - Vantage
Per Minute · Hourly · Daily · Weekly · Monthly · Annually · No Upfront (Savings Plan) Partial Upfront (Savings Plan) All Upfront (Savings Plan) No Upfront · Partial Upfront · All Upfront · No Upfront (Convertible) Partial Upfront (Convertible) All Upfront (Convertible) United States Dollar ($) Compare g4dn.12xlarge to other instances · Having trouble making sense of your EC2 costs?