Opinions on Amazon Redshift versus other RDBMS?
Help with data warehouse selection - on-prem Oracle vs AWS Redshift
Interactive Analytics: Redshift vs Snowflake vs BigQuery
Main takeaways:
-
Redshift has higher compute per dollar saving you more money for the same amount of total compute time
-
Snowflake's advantage is that it makes managing a distributed cloud database easier (
I think it actually uses redshift under the hood, not 100% sure). The premium is for the less overhead in maintaining a distributed database environment, the ability to elastically change your compute and storage size and being only billed for usage (at the hour level). -
BigQuery is more expensive per compute than either, but is also easier to manage than snowflake
-
Think of the three as a gradient of how much hand holding you want and paying for that.
Google BigQuery vs Amazon Redshift (my take on Quora)
Biggest distinction I've found is that you can't update schemas or do deletes in BigQuery. Depending on your data set, those things might be critical, or might not matter. In Redshift, both of these are possible (and depending what you're doing, can be very efficient)
More on reddit.comHas anyone used both Amazon Redshift and at least one other major RBDMS like PostgreSQL, MySQL, Microsoft SQL Server, or Oracle SQL? I just joined a new company and the first project that we've been tasked with is building out an ODS for our Marketing, Sales, Finance, and Product data. I'm arriving just in time to help them decide on the RDBMS and one of the suggestions from another team member is Amazon Redshift.
I've had plenty of experience working within the four db's mentioned above and I'm very comfortable in what they're capable of. I've worked inside of a large ODS built in Microsoft SQL Server that supported multiple databases with tables that had tens of millions of records. The schemas were well designed and the database rarely suffered any performance issues or hiccups. More recently, I architected a smaller marketing/sales database in MySQL, hooked it up to Zapier for data inputs and Chartio (BI tool) for reads, and it worked like a charm. I'm confident that with the data we're looking to capture and report off of, a "traditional" RBDMS would work just fine.
That said, I want to be open to Redshift though and give it a fair shot. What can Redshift bring to the table that db's like PGSQL and MySQL cannot? What would we sacrifice by choosing Redshift? How easy will it be for me to become comfortable designing and working within Redshift if I already know PostgreSQL? I could go on asking questions but generally I'm just looking to understand if Redshift has any distinct advantages over the db's I've worked with before.