sagemaker vs bedrock reddit

EC2 vs SageMaker vs Bedrock for fine-tuning & serving a custom LLM?

reddit.com › r › aws › comments › 1mzsog9 › ec2_vs_sagemaker_vs_bedrock_for_finetuning

A few thoughts about this Bedrock: fine tunning is only available for certain oss models, I believe mostly Llama, so that's probably a big limitation if you are looking tune other models. If this option works for you, is the easiest since it abstract all the infrastructure, you just call the API with your model-id and prompt. Sagemaker: Jumpstart is a part of Sagemaker that will give you easy access to Jupyter notebooks with a lot of hugging face models ready to start using. That means training, deploying, doing inference, etc. It might be a good starting point. Sagemaker Python SDK: this is probably something you should get familiar with, it will let you create processing and training jobs, serve some models for inference etc, everything directly from your python code. Sagemaker Infra: Jobs, inference endpoints, etc, will handle the underlying infrastructure creation for you, like nodes, storage, containers registry, etc. Inference: If you want to "self-host", I believe VLLM is the most popular, but it is really not easy to scale, specially when you need to divide your model on multiple GPUs. EC2: This is a primitive, it is possible to manually host here, but you usually want something in the middle, like EKS. Google for AI on EKS, there are some guidelines to run VLLM and even Ray to distribute the load between nodes. From Bedrock to EC2, that's the order of abstraction you will have. The lower you go, more stuff you have to handle. Last but not least, GPUs on AWS are not cheap and the are hard to come by. I work with startups and I struggle a lot when they need GPUs, with new accounts, you won't even be able to get the required quotas if you don't have access to your account team. If you got the quotas, getting the capacity without doing reservations is not easy. Keep it in mind. Answer from MinionAgent on reddit.com

reddit.com › r/aws › sagemaker vs bedrock

r/aws on Reddit: Sagemaker vs Bedrock

March 10, 2025 -

What are your pros to using Sagemaker? Seems to me that it’s a little dead whereas bedrock is the future due to it’s ease of use and flexibility specially for getting to use something that’s already “built”

Top answer

1 of 5

They are totally different products? SageMaker is a data science environment and has labeling tools like ground truth and model hosting cababilities. Bedrock is an LLM environment where you can build Knowledge bases and agent pipelines and such.

2 of 5

You are comparing apples to oranges. Sagemaker lets you build the models, bedrock provides interaction with already built models. Obviously an oversimplification but these are in two different categories

reddit.com › r/aws › advice needed: sagemaker vs bedrock for fine-tuned llama models (cost & serverless options)

r/aws on Reddit: Advice Needed: SageMaker vs Bedrock for Fine-Tuned Llama Models (Cost & Serverless Options)

January 20, 2025 -

Hi all,

I’m a self-taught ML enthusiast, and I’m really enjoying my journey so far. I’m hoping to get some advice from those with more experience.

So far, I’ve successfully fine-tuned a Llama model using both SageMaker JumpStart and Amazon Bedrock. (Interestingly, in Bedrock, I had to switch to a different AWS region to access the same model I used in SageMaker.) My ultimate goal is to build a web-based app for users to interact with my fine-tuned model. However, for now, I’m still in the testing phase to ensure the model generalises well to my dataset.

I’d love some guidance on whether I should stick with SageMaker or switch fully to Bedrock. My main concern is cost management, as I’d prefer to use a serverless endpoint to avoid keeping the model “always-on.” Here’s where I’m stuck:

SageMaker: I’ve been deploying real-time endpoints on low-cost instances and deleting them after testing, but this workflow feels inefficient. I tried configuring a serverless endpoint, but I discovered it doesn’t support models requiring certain features (e.g., AWS Marketplace packages, private Docker registries, or network isolation).

Bedrock: It requires provisioned throughput ($23.50/hour per model unit) to serve fine-tuned models. While it’s fully managed, this seems expensive for my testing phase, and I’ve also noticed that Bedrock doesn’t provide detailed insights into the fine-tuning process.

For a beginner like me, what would you recommend?

Should I stick with SageMaker real-time endpoints on a low-cost instance and delete them when not in use?

Would it make sense to fine-tune the model in SageMaker and then deploy it in Bedrock?

Is there another cost-effective solution I haven’t considered?

Thank you for your time and insights!

Top answer

1 of 2

I’ve been seeing a lot of questions about choosing between SageMaker and Bedrock for ML deployments, so let me break down what I’ve learned after working with both. The main thing to understand is that they serve different needs. SageMaker Serverless is your friend if you’re cost-conscious and working with smaller models. It’s basically pay-per-use with no minimum fees, scales to zero when idle, and while it has some limits (6GB RAM, 200 concurrent invocations), it’s great for testing and development. Bedrock, on the other hand, is more of a commitment. For fine-tuned models, you’re looking at $21-50 per hour per model unit with a mandatory 1 or 6-month commitment. This can add up to around $15.5K monthly plus storage fees. No serverless option for fine-tuned models. From my experience, SageMaker is best when you want control. You get to play with different instance types, really dig into model training, and optimize costs. Bedrock’s strength is simplicity - clean API integration, managed infrastructure, and it’s great for quick prototyping and scaling production workloads. For testing, I’d strongly recommend SageMaker. You’ll learn more about the fine-tuning process, have better cost control, and more room to experiment. Plus, here’s a pro tip: consider a hybrid approach. Use SageMaker for fine-tuning but leverage Bedrock’s base models for inference where it makes sense. If you’re building a web application, think about implementing a queue-based architecture - it’ll help manage costs while keeping response times reasonable. My two cents…

2 of 2

Bedrock PT is crazy expensive. I would just use Sagemaker. Especially for testing.

Videos