When does server-side rendering beat static rendering?

Static rendering suits content that rarely changes and looks the same for everyone, such as marketing pages or blogs. Server-side rendering pays off once every response depends on login, location or live data. Many applications mix both per page rather than committing to one mode globally.

What does a CDN actually do for performance?

A CDN keeps copies of the delivery in data centres close to the user. Instead of 150 milliseconds across a continent the trip drops to a few milliseconds, and the origin server is relieved because it no longer answers most requests itself.

Why is the database often the first bottleneck?

Web servers are easy to multiply, a relational database with a single write primary is not. Under load, connections, locks and slow queries run short first. Indexes, connection pooling and read replicas push that point much further out.

Web Architecture That Grows With the Business

Static rendering, caching layers, CDN and horizontal scaling compared. Why early architecture decisions keep the later bill smaller and growth predictable.

Published on January 7, 20266 min read

Growing With Demand Is a Decision, Not Luck

An application that runs smoothly at twenty concurrent users can collapse at two thousand. The difference rarely lies in a single feature and almost always in the architecture underneath. Scalability describes how well a system absorbs more load without response times climbing or costs exploding.

The tricky part is timing. Architecture is decided early, when load is still small and everything feels fast anyway. That is exactly when the switches are set that later either allow a calm build-out or force an expensive rewrite. With the right boundaries drawn early, the later spend goes toward growth, not toward repair.

Scaling is not a feature bolted on afterwards but a property that follows from the first structural decisions.

Static Rendering Versus Server-Side Rendering

The first major switch concerns the moment data turns into finished HTML. With static rendering this happens once at build time. The finished page then exists as a file and can be served any number of times without a server computing anything per request. With server-side rendering the page is produced fresh on every request, which allows personalised or highly current content but costs compute time per call.

In practice this is not an all-or-nothing choice across the whole application. Modern frameworks let the decision be made per page or even per component. A product overview can be static and regenerated in the background every few minutes (incremental regeneration), while the shopping cart of the same application renders server-side and per user.

Strategy	Compute per request	Freshness	Typical use
Static (build)	near zero	until the next build	marketing, documentation, blog
Incrementally regenerated	low, only on expiry	seconds to minutes	catalogues, price lists
Server-side per request	high, scales with traffic	real time	account, search, personalised view
Client-side fetched	low on the server, more in the browser	real time	dashboards behind login

With Next.js this mix maps cleanly, because the App Router classifies each route on its own rather than forcing a global rule. The rule of thumb stays simple. What looks the same for everyone and rarely changes should not be recomputed on every request.

Caching Is the Cheapest Compute There Is

The fastest request is the one that never reaches the origin server. Caching means storing a once-computed result and reusing it instead of producing it again. The gain is twofold, because the answer arrives faster and the expensive part of the system is relieved.

Caching is not a single switch but a sequence of layers that interlock.

Browser cache on the user's device for files that do not change, controlled through headers such as Cache-Control.
CDN cache at the network edge for whole pages or responses that are identical for many users.
Application cache in the server's memory for prepared data structures.
Data cache (such as Redis) for expensive query results that many requests share.

The real difficulty is not storing but discarding at the right moment. A cache that holds too long shows stale content, a cache that holds too briefly saves nothing. Workable tools for this are short lifetimes for moving data, targeted invalidation on write, and a version stamp in file names so that a new deployment cleanly replaces the old state.

A CDN Brings the Content Closer to the User

Physics sets a hard limit. A signal needs time to cover distance, and a server in Frankfurt inevitably answers a user in São Paulo more slowly than a local one. A content delivery network distributes copies of the delivery across many locations worldwide, so that each request is answered from a nearby node.

The effect is measurable and large. An intercontinental round trip easily costs 150 to 250 milliseconds in travel time alone, while a hit at a nearby CDN node sits in the single-digit millisecond range. Add to that the relief effect, because every request answered by the CDN never reaches the origin server at all. For static content it is easy to absorb more than 90 percent of traffic at the edge.

This latency feeds directly into the Core Web Vitals, the metrics for user experience and visibility. Good thresholds are a Largest Contentful Paint under 2.5 seconds, an Interaction to Next Paint under 200 milliseconds and a Cumulative Layout Shift under 0.1. A CDN mainly improves the first value, because the serving distance shrinks.

The Database Is the Real Bottleneck

Web servers are the easy side of the equation. When load rises, more identical instances go up beside them and a load balancer spreads the requests. This is horizontal scaling, and it works well as long as the servers themselves hold no state. That is exactly why sessions, uploads and caches belong out of the individual server and into shared services, because a stateless server can be multiplied without risk.

The database does not follow this logic so easily. A classic relational database has one node that writes, and that node cannot be copied at will. Under load, three things run short first, namely open connections, locks on shared rows and the runtime of individual queries. A query without a fitting index, invisibly fast at a thousand rows, can stall the whole system at ten million rows.

Ways to Push the Bottleneck Out

Several levers move that point much further out before a fundamental rebuild becomes necessary.

Indexes on the columns that are actually filtered and sorted on, rather than blindly on every column.
Connection pooling, so that not every request opens a new, expensive connection.
Read replicas that hold copies of the data and take over all reading requests, leaving the write node free.
Data caching for results that many users share and that do not change by the second.

Only once these means are exhausted do heavier tools such as splitting the data across several nodes (sharding) come into question. This order is deliberate, because each step buys time at a lower cost than the next.

Why Early Decisions Save Money Later

Architecture is a bet on the future, and the cheapest correction is the one never needed at all. A stateless application, a clear line between static and dynamic parts and a database with indexes in place cost almost no extra effort at the start. Retrofitting the same properties once the system is already under load and customers are watching costs many times more and ties up the team for weeks.

The leverage lies in the order of costs. A missing index is one line, decoupling state from twenty servers after the fact is a project. Treating scaling as a property from the start replaces expensive emergencies with predictable growth. This is exactly what we mean by the movement plan further, described on the Mission page, and it applies earlier than most expect.

That does not mean building for millions of users from day one. It means making the few decisions cleanly that are hard to reverse later, and deliberately keeping everything else simple. Which of these switches matter for a specific project depends on expected growth and the data model, and that can be clarified upfront. Anyone who wants the architecture reviewed early will find the direct route through contact.

An architecture that holds today and grows tomorrow starts with the right questions. We talk it through and order the decisions for the project at hand.