really stupid question but is supabase intended for scale Supabase #off-topic

Join Discord

really stupid question, but is supabase intended f...

# off-topic

Milou

01/14/2022, 10:56 AM

really stupid question, but is supabase intended for scale?

ktosiek

01/14/2022, 11:02 AM

Instead of users, how many requests/s do you expect? What are your requirements for availability for writes? Supabase won't scale horizontally as-is, but it can easily be used for MVP - in the worst case, you can move to self-hosted with a beefy primary and multiple read replicas

Milou

01/14/2022, 11:15 AM

I actually don't know yet. Maybe im overestimating, because alot of the stuff I need might be easily solved via a CDN

Milou

01/14/2022, 11:16 AM

i assume arround 50k-100k Request / s peak

Milou

01/14/2022, 11:16 AM

Hmm maybe I should think about my architecture more before I ask these questions.

Milou

01/14/2022, 11:17 AM

generally I just ask so I go with something that doesn't produce scaling issues later on 😅

ktosiek

01/14/2022, 11:29 AM

Everything I know either produces scaling issues later, or programming issues earlier ;-). Supabase is more on the "elastic, but won't force you to think about scaling from the start" side. Not-necessarily-fresh read-only, and especially cache'able read-only requests are relatively easy to scale on pretty much any platform.

ktosiek

01/14/2022, 11:31 AM

50k-100k/s is a lot, wow. I've only found benchmarks for 1-2k/s >_>

Scott P

01/14/2022, 1:58 PM

There's a saying that you should first focus on providing a product that might bring in a million users, and then scale afterwards. Bring in the users by building a great product, and then focus on scalability when you need to. You can often make an otherwise slow product appear much faster through clever use of animations, and interesting loading indicators. That's not to say you don't invest in sufficient architecture at the start, but that you don't worry too much about scalability before it becomes necessary. 50-100K/s is a lot for a single server to handle, I'd argue too much, and I'd almost guarantee you'd need replicas and load balancing. This page (https://severalnines.com/resources/database-management-tutorials/postgresql-load-balancing-haproxy) mentions HAProxy to handle that, while https://www.postgresql.org/docs/current/high-availability.html has some in-depth resources for it. It also depends on the load, 50K `SELECT`s with only a single table are going to be easier to handle than 50K `SELECT`s where you have

JOIN

WHERE

and other filters in place - in the same way that returning

hello world

is quicker than returning computed or calculated responses. Creating partitions and indexes definitely helps in those cases, and Postgres is smart enough to cache common queries so it knows exactly where to look for data in subsequent requests that are using the same query.

Previous Next