This link has a table that compares instances' GPU memory
https://docs.amazonaws.cn/en_us/AmazonECS/latest/developerguide/ecs-gpu.html Answer from YoFayed on repost.aws
Vantage
instances.vantage.sh › aws › ec2 › g4dn.xlarge
g4dn.xlarge pricing and specs - Vantage
The g4dn.xlarge instance is in the GPU instance family with 4 vCPUs, 16 GiB of memory and up to 25 Gibps of bandwidth starting at $0.526 per hour.
AWS
aws.amazon.com › amazon ec2 › instance types › g4 instances
Amazon EC2 G4 Instances — Amazon Web Services (AWS)
5 days ago - They provide up to 8 NVIDIA T4 GPUs, 96 vCPUs, 100 Gbps networking, and 1.8 TB local NVMe-based SSD storage and are also available as bare metal instances. G4dn instances are equipped with NVIDIA T4 GPUs which deliver up to 40X better low-latency throughput than CPUs, so more requests can be ...
python - How to activate the use of a GPU on AWS EC2 instance? - Stack Overflow
I have tested it on a g5.xlarge, ... Nvidia GPU machine available (Nvidia A10G) as of December 2023. Make sure to use a US region as they are cheaper there, us-east-1 (North Virginia) was one of the cheapest when I checked, at 1.006 USD / hour, so a negligible cost for most people in a developed country. Just make sure to shutdown the VM each time to not keep paying!!! Another working alternative is g4dn.xlarge, which ... More on stackoverflow.com
Problem with gpu
According to this userguide Running Parabricks on AWS - NVIDIA Docs, the ec2 instance type should be “g4dn.4xlarge”. I followed and when executing the germline pipeline, I received this error: WARNING The system has 62 GB, however recommended RAM with 1 GPU is 64 GB. More on forums.developer.nvidia.com
Resolving GPU memory errors in databricks with PyTorch on g4dn.2xlarge instances
In my Databricks job configuration, I’ve specified node_type_id and driver_node_type_id as g4dn.2xlarge. According to the documentation, this instance type should provide up to 32GB of memory. However, when I run the job, I receive a CUDA out-of-memory error indicating that only 14GB of GPU ... More on learn.microsoft.com
Benchmarking Inexpensive AWS Instances
Here's the data formatted in table for easier viewing: AWS Instances Instance Type | Cores | RAM | GPU | Prompt Eval Rate (tokens/s) | Eval Rate (tokens/s) | Price ($/hr) | Price ($/mo) | c7g.8xlarge | 32 | 64GB | - | 38.38 | 25.07 | 1.27 | 941.16 | r6g.4xlarge | 16 | 128GB | - | 10.15 | 8.29 | 0.88 | 657.10 | g4dn.xlarge | 4 | 16GB | 16GB | 222.23 | 41.71 | 0.58 | 434.50 | g4dn.2xlarge | 8 | 32GB | 32GB | 214.25 | 41.74 | 0.84 | 621.24 | g5.xlarge | 4 | 16GB | 24GB | 624.29 | 68.08 | 1.12 | 831.05 | g5.2xlarge | 8 | 32GB | 24GB | 624.48 | 66.67 | 1.35 | 1,000.96 Vs Local Machines Machine Type | Cores | RAM | GPU | Prompt Eval Rate (tokens/s) | Eval Rate (tokens/s) | M2 MacMini | - | 8GB | <8GB | 66.38 | 18.33 | M1 MacBook Air | - | 16GB | <16GB | 71.58 | 11.46 | Home PC w/RTX 3080 | - | 64GB | 10GB | 185.67 | 83.79 More on reddit.com
Videos
35:51
Installing Windows 10 on AWS (g4dn + Nvidia drivers) | Cloud Gaming ...
02:00
Virtual Workstations on AWS with EC2 G4dn Instances - YouTube
08:03
How do I attach and use an Elastic GPU to my Windows EC2 Instance?
10:06
NVIDIA GPU Cloud with AWS, Step by Step - YouTube
Top answer 1 of 2
2
This link has a table that compares instances' GPU memory
https://docs.amazonaws.cn/en_us/AmazonECS/latest/developerguide/ecs-gpu.html
2 of 2
1
Hello Nathaniel,
You can find this information on the launch blogs here:
for G4 series: 16GB GPU Memory https://aws.amazon.com/blogs/aws/now-available-ec2-instances-g4-with-nvidia-t4-tensor-core-gpus/
for P4 ultraclusters: 320GB GPU Memory https://aws.amazon.com/blogs/aws/new-gpu-equipped-ec2-p4-instances-for-machine-learning-hpc/
Hope this helps
Cloudzero
advisor.cloudzero.com › aws › ec2 › g4dn.12xlarge
g4dn.12xlarge Instance Specs And Pricing
CloudZero's intelligent platform helps you optimize cloud costs and improve infrastructure efficiency.
Top answer 1 of 3
3
Checkout this answer for listing available GPUs.
from tensorflow.python.client import device_lib
def get_available_gpus():
local_device_protos = device_lib.list_local_devices()
return [x.name for x in local_device_protos if x.device_type == 'GPU']
You can also use CUDA to list the current device and, if necessary, set the device.
import torch
print(torch.cuda.is_available())
print(torch.cuda.current_device())
2 of 3
2
In the end it had to do with my tensorflow package! I had to uninstall tensorflow and install tensorflow-gpu. After that the GPU was automatically activated.
For documentation see: https://www.tensorflow.org/install/gpu
nOps
nops.io › blog › amazon-ec2-gpu-instances-the-complete-guide
Amazon EC2 GPU Instances: The Complete Guide | nOps
June 13, 2025 - Training Large Machine Learning Models: Training complex models like BERT, LLaMA, or ResNet requires massive GPU throughput, fast interconnects, and large memory capacity. Features like NVLink, multi-GPU configurations, and EFA on P4 and P5 instances make them the clear choice for scalable training.
Google
docs.cloud.google.com › compute › compute engine › gpu machine types
GPU machine types | Compute Engine | Google Cloud Documentation
November 15, 2025 - It is separate from the instance's memory and is specifically designed to handle the higher bandwidth demands of your graphics-intensive workloads. A4 accelerator-optimized machine types have NVIDIA B200 Blackwell GPUs (nvidia-b200) attached ...
AWSstatic
d1.awsstatic.com › events › reinvent › 2021 › How_to_select_Amazon_EC2_GPU_instances_for_deep_learning_sponsored_by_NVIDIA_CMP328-S.pdf pdf
How to select Amazon EC2 GPU instances for deep learning
Best single-GPU instance for inference deployments: G4 instance type; choose instance size g4dn.(2/4/8/16)xlarge based · on pre- and post-processing steps in your deployed application · • · I need the most GPU memory I can get for large models: p3dn.24xlarge (8 V100, 32 GB per GPU) • ·
AWS
docs.aws.amazon.com › amazon ec2 › instance types › amazon ec2 instance type specifications › specifications for amazon ec2 accelerated computing instances
Specifications for Amazon EC2 accelerated computing instances - Amazon EC2
Detailed specifications for Amazon EC2 accelerated computing instance types.
Amazon Web Services
amazonaws.cn › home › amazon ec2 › amazon ec2 g5 instances
Amazon EC2 G5 Instances
5 days ago - G5 instances deliver up to 3x higher graphics performance and up to 40% better price performance than G4dn instances. They have more ray tracing cores than any other GPU-based EC2 instance, feature 24 GB of memory per GPU, and support NVIDIA RTX technology.
EC2 Pricing Calculator
costcalc.cloudoptimo.com › aws-pricing-calculator › ec2 › g4dn.xlarge
g4dn.xlarge Pricing and Specs: AWS EC2
The g4dn.xlarge instance is part of the g4dn series, featuring 4 vCPUs and Up to 25 Gigabit of RAM, with Gpu Instances.
Cloudzero
advisor.cloudzero.com › aws › ec2 › g4dn.xlarge
g4dn.xlarge Instance Specs And Pricing
Beta Notice · CloudZero Advisor's AI is currently in BETA and may provide inaccurate information · Learn More · Switch Theme · Toggle menu · Sign In · Gathering instance details, stand by · Privacy•Terms•Support