6 days ago - They provide up to 8 NVIDIA T4 GPUs, 96 vCPUs, 100 Gbps networking, and 1.8 TB local NVMe-based SSD storage and are also available as bare metal instances. G4dn instances are equipped with NVIDIA T4 GPUs which deliver up to 40X better low-latency throughput than CPUs, so more requests can be ...

instances.vantage.sh › aws › ec2 › g4dn.xlarge

g4dn.xlarge pricing and specs - Vantage

The g4dn.xlarge instance is in the GPU instance family with 4 vCPUs, 16 GiB of memory and up to 25 Gibps of bandwidth starting at $0.526 per hour.

Videos

35:51

YouTube

Installing Windows 10 on AWS (g4dn + Nvidia drivers) | Cloud Gaming ...

Virtual Workstations on AWS with EC2 G4dn Instances - YouTube

November 21, 2020

08:03

YouTube

How do I attach and use an Elastic GPU to my Windows EC2 Instance?

NVIDIA GPU Cloud with AWS, Step by Step - YouTube

December 14, 2017

View all

AWS

aws.amazon.com › blogs › aws › now-available-ec2-instances-g4-with-nvidia-t4-tensor-core-gpus

Now Available – EC2 Instances (G4) with NVIDIA T4 Tensor Core GPUs | Amazon Web Services

November 3, 2022 - The instances are equipped with up to four NVIDIA T4 Tensor Core GPUs, each with 320 Turing Tensor cores, 2,560 CUDA cores, and 16 GB of memory. The T4 GPUs are ideal for machine learning inferencing, computer vision, video processing, and real-time ...

Vantage

instances.vantage.sh › aws › ec2 › g4dn.2xlarge

g4dn.2xlarge pricing and specs - Vantage

The g4dn.2xlarge instance is in the GPU instance family with 8 vCPUs, 32 GiB of memory and up to 25 Gibps of bandwidth starting at $0.752 per hour.

Umbrella

umbrellacost.com › home › learning center › aws cost management › amazon ec2 g4 instances

Using AWS EC2 G4dn and G4ad | Amazon EC2 G4 | Umbrella

April 10, 2025 - Amazon G4dn instances provide the latest generation NVIDIA T4 Tensor Core GPUs, AWS custom second generation Intel® Xeon® Scalable (Cascade Lake) processors, up to 100 Gbps of networking throughput, and up to 1.8 TB of local NVMe storage.

EC2 Pricing Calculator

costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.xlarge

g4dn.xlarge Pricing and Specs: AWS EC2

The g4dn.xlarge instance is part of the g4dn series, featuring 4 vCPUs and Up to 25 Gigabit of RAM, with Gpu Instances.

NVIDIA Developer

developer.nvidia.com › blog › getting-the-most-out-of-nvidia-t4-on-aws-g4-instances

Getting the Most Out of NVIDIA T4 on AWS G4 Instances | NVIDIA Technical Blog

August 21, 2022 - AWS offers the G4dn Instance based on NVIDIA T4 GPUs, and describes G4dn as “the lowest cost GPU-based instances in the cloud for machine learning inference and small scale training.”

Vantage

instances.vantage.sh › aws › ec2 › g4dn.12xlarge

g4dn.12xlarge pricing and specs - Vantage

The g4dn.12xlarge instance is in the GPU instance family with 48 vCPUs, 192 GiB of memory and 50 Gibps of bandwidth starting at $3.912 per hour.

VPSBenchmarks

vpsbenchmarks.com › home › gpu plans › amazon ec2 › g4dn.metal gpu plan

g4dn.metal GPU Plan | VPSBenchmarks

Amazon EC2 G4 instances are the industry’s most cost-effective and versatile GPU instances for deploying machine learning models such as image classification, object detection, and speech recognition, and for graphics-intensive applications such as remote graphics workstations, game streaming, and graphics rendering. G4 instances are available with a choice of NVIDIA GPUs (G4dn) or AMD GPUs (G4ad).

Find elsewhere

Google Bing Mojeek

EC2 Pricing Calculator

costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.8xlarge

g4dn.8xlarge Pricing and Specs: AWS EC2

The g4dn.8xlarge instance is part of the g4dn series, featuring 32 vCPUs and 50 Gigabit of RAM, with Gpu Instances.

Medium

nishant-parmar.medium.com › using-aws-g-and-p-series-ec2-instances-for-high-quality-rendering-cloud-gaming-and-machine-55195075334c

Using AWS G and P Series EC2 Instances for High-Quality Rendering, Cloud Gaming and Machine Learning. | by Nishant Parmar | Medium

October 7, 2024 - Using AWS g4dn.xlarge EC2 Instance with Nvidia Tesla T4 GPU for High-Quality Rendering, Cloud Gaming and Machine Learning.

CloudOptimo

cloudoptimo.com › home › blog › aws g4 vs g5 family: a detailed comparison of aws gpu instances

AWS G4 vs G5 Family: A Detailed Comparison of AWS GPU Instances

March 13, 2025 - G4 instances are a cost-effective solution for businesses that require moderate GPU power without needing the highest performance levels. They are best suited for applications that require cost-effective GPU-powered instances.

CloudPrice

cloudprice.net › amazon web services › ec2 › g4dn.xlarge

g4dn.xlarge specs and pricing | AWS | CloudPrice

Amazon EC2 instance g4dn.xlarge with 4 vCPUs, 16 GiB RAM and 1 x NVIDIA T4 16 GiB. Available in 23 regions starting from $383.98 per month.

AWS

aws.amazon.com › about-aws › whats-new › 2022 › 04 › amazon-workspaces-graphics-g4dn-bundles

Amazon WorkSpaces launches new Graphics G4dn bundles to improve performance and optimize costs - AWS

They come with the NVIDIA T4 Tensor Core GPU that features multi-precision Turing Tensor Cores and RT Cores, AWS custom second generation Intel® Xeon® Scalable (Cascade Lake) processors, and the local NVMe storage designed for applications that require fast access to locally stored data.

Vantage

vantage.sh › blog › aws-ec2-gpu-instances-g-family-vs-p-family-g4dn

EC2 GPU Instances | Vantage

Within the accelerated computing instances, two families consist of GPU-based instances, the G and P families. The P family was the first of the accelerated computing instances and was designed for general-purpose GPU compute tasks. The family has since evolved and has become widely adopted for ML workloads, with AI companies, like Anthropic and Cohere, using P family instances.

Vantage

instances.vantage.sh › aws › ec2 › g4dn.4xlarge

g4dn.4xlarge pricing and specs - Vantage

The g4dn.4xlarge instance is in the GPU instance family with 16 vCPUs, 64 GiB of memory and up to 25 Gibps of bandwidth starting at $1.204 per hour.

VPSBenchmarks

vpsbenchmarks.com › home › gpu plans › amazon aws › g4dn.xlarge gpu plan

g4dn.xlarge GPU Plan | VPSBenchmarks

EC2 Pricing Calculator

costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.2xlarge

g4dn.2xlarge Pricing and Specs: AWS EC2

The g4dn.2xlarge instance is part of the g4dn series, featuring 8 vCPUs and Up to 25 Gigabit of RAM, with Gpu Instances.

AWSstatic

d1.awsstatic.com › events › reinvent › 2021 › How_to_select_Amazon_EC2_GPU_instances_for_deep_learning_sponsored_by_NVIDIA_CMP328-S.pdf pdf

Best single-GPU instance for developing, testing, and prototyping: g4dn.xlarge (T4, 16 GB GPU); consider

reddit.com › r/localllama › benchmarking inexpensive aws instances

r/LocalLLaMA on Reddit: Benchmarking Inexpensive AWS Instances

June 10, 2024 -

I recently did some testing using Dolphin-Llama3 across various (inexpensive-ish) AWS instances to compare performance. The results are in line with what one might expect.

Testing was done using default settings with Ollama. I spun up a new instance on Ubuntu, installed Ollama and ran it with Dolphin-Llama3 —verbose.

Key Takeaways:

-Fastest Prompt Eval Rate: AWS g5 (fastest AWS instance tested)
-Fastest Eval Rate: Home PC w/RTX 3080
-Best Cost-Performance Balance: AWS g4dn.xlarge offers a good balance of performance and cost, at $0.58/hr.
-GPU speed is the key differentiator. Within the same family of models, such as the g4dn and g5 instances, the evaluation rates remain consistent. If the model fits in GPU memory there is no need for more cores/memory.
-I did notice that the more system memory available the greater number of tokens used in the output.

Test Results

AWS Instances

c7g.8xlarge (Compute Instance)
•32 cores, 64GB RAM
•Prompt Eval Rate: 38.38 tokens/s
•Eval Rate: 25.07 tokens/s
•Price: $1.27/hr, $941.16/mo
r6g.4xlarge (Memory Instance)
•16 cores, 128GB RAM
•Prompt Eval Rate: 10.15 tokens/s
•Eval Rate: 8.29 tokens/s
•Price: $0.88/hr, $657.10/mo
g4dn.xlarge (GPU Instance)
•4 cores, 16GB RAM, 16GB GPU
•Prompt Eval Rate: 222.23 tokens/s
•Eval Rate: 41.71 tokens/s
•Price: $0.58/hr, $434.50/mo
g4dn.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 32GB GPU
•Prompt Eval Rate: 214.25 tokens/s
•Eval Rate: 41.74 tokens/s
•Price: $0.84/hr, $621.24/mo
g5.xlarge (GPU Instance)
•4 cores, 16GB RAM, 24GB GPU
•Prompt Eval Rate: 624.29 tokens/s
•Eval Rate: 68.08 tokens/s
•Price: $1.12/hr, $831.05/mo
g5.2xlarge (GPU Instance)
•8 cores, 32GB RAM, 24GB GPU
•Prompt Eval Rate: 624.48 tokens/s
•Eval Rate: 66.67 tokens/s
•Price: $1.35/hr, $1,000.96/mo

Local Machines

M2 MacMini
•M2, 8GB RAM, <8GB GPU
•Prompt Eval Rate: 66.38 tokens/s
•Eval Rate: 18.33 tokens/s
M1 MacBook Air
•M1, 16GB RAM, <16GB GPU
•Prompt Eval Rate: 71.58 tokens/s
•Eval Rate: 11.46 tokens/s
Home PC w/RTX 3080
•Intel i5, 64GB RAM, 10GB GPU
•Prompt Eval Rate: 185.67 tokens/s
•Eval Rate: 83.79 tokens/s

Oracle Ampere

Ampere 16 Core, 32GB RAM
•Prompt Eval Rate: 11.96 tokens/s (Duration: 1m34.955180835s)
•Eval Rate: 9.01 tokens/s (Duration: 1m28.461256s)
•Price: $0.1276/hr, $95/mo
Ampere 32 Core, 32GB RAM
•Prompt Eval Rate: 22.54 tokens/s (Duration: 47.93207936s)
•Eval Rate: 14.11 tokens/s (Duration: 44.423782s)
•Price: $0.2796/hr, $208/mo

Here's the data formatted in table for easier viewing - courtesy of u/sergeant113. https://www.reddit.com/r/LocalLLaMA/comments/1dclmwt/comment/l7zrgzm/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button