Serverless Postgres: Autoscaling, Branching, and Recovery | Neon Postgres posted on the topic | LinkedIn

LinkedIn respects your privacy

LinkedIn and 3rd parties use essential and non-essential cookies to provide, secure, analyze and improve our Services, and to show you relevant ads (including professional and job ads) on and off LinkedIn. Learn more in our Cookie Policy.

Select Accept to consent or Reject to decline non-essential cookies for this use. You can update your choices at any time in your settings.

View organization page for Neon Postgres

18,080 followers

1mo

We talk a lot about "Serverless Postgres," but what does that actually mean for your daily workflow? By separating storage from compute, Neon gives you a database that actually behaves like modern software: 1. Autoscaling: It scales up when your traffic spikes and scales to zero when it doesn’t, so you don’t pay for idle compute. 2. Branching: Create full copies of your database in seconds. Use them for development, preview deployments, or safe AI sandboxes. 3. Point-in-Time Recovery: Treat your data like code. If something breaks, just roll back to the exact second before the error happened. All the Postgres you love, with none of the infrastructure headaches. Check out the breakdown in the video below!

1 Comment

Transcript

What is NEON? Simple, NEON is serverless postgres. Alright, thanks for watching. OK, NEON is serverless postgres, but what that actually means is NEON offers a fully managed Postgres database that automatically scales with your traffic, supports branching so you can experiment safely, and keeps your data protected with instant point in time recovery. You get all of that without having to manage any servers. But to understand why NEON can do all of that, let's first take a step back. Now, databases are typically comprised of two main components, a storage layer and a compute layer. The storage layer is responsible for persisting your data, and the compute layer is responsible for executing queries to access or modify the data. In traditional databases, these storage and compute layers are tightly coupled, meaning they live on the same machine. Which means if your application receives a lot of traffic, you'll need to upgrade to a larger machine. But when traffic decreases, you're still paying for that bigger machine, even if you're not fully using it. NEON takes a different architectural approach by separating storage and compute, which allows each of these to scale independently for maximum flexibility. So when your app receives more traffic, NEON can automatically scale up your compute capacity, and when your traffic slows back down. NEON can scale your compute back down to reduce your costs, and NEON can even scale all the way down to 0 in cases when you're just getting started and you don't have any traffic yet. Separation of storage and compute also allows NEON databases to support branching for both your schema and your data. So just like you can create git branches to experiment with code changes, now you can create complete copies of your database in seconds. That means you can create branches for isolated. Development and staging environments, but it also means you can create workflows where you spin up branches for preview deployments on pull requests. Plus in the age of AI, branches are perfect for creating safe sandbox data environments where your coding agent can make changes without affecting any data in production. Neons architecture also enables point in time recovery, meaning you can restore your database to an exact moment in time so if you accidentally make a change. That delete some data you can just roll back to a previous state in a matter of seconds. And because NEON is just Postgres, it integrates seamlessly with all of your modern tools and supports many popular Postgres extensions. And lastly, Neon also includes an auth service built right into your database. Neon Auth makes it easy to implement common authentication flows like sign in and sign up, e-mail verification, account management, and a whole lot more. And all of your user data is automatically stored in your database, so you don't have to stitch together separate systems to get started. With Neon, all you have to do is create a project, copy your connection string, paste it into your app, and you're all set. It just works. So that's what we mean when we say NEON is serverless Postgres. If you're ready to give it a try, you can head on over to neon.com to get started.

Trevor I. Lasn, graphic

Trevor I. Lasn 1mo

been running Neon in production for months and the branching is what sold me. I spin up a full database copy for every migration test before touching prod. scale to zero is nice for dev branches but branching is the killer feature nobody talks about enough

To view or add a comment, sign in

More Relevant Posts

Anthony Giuliano
1mo
Report this post
If you're new to Neon Postgres, I've put together a quick little explainer video on what exactly "Serverless Postgres" means.

Neon Postgres

18,080 followers
1mo

We talk a lot about "Serverless Postgres," but what does that actually mean for your daily workflow? By separating storage from compute, Neon gives you a database that actually behaves like modern software: 1. Autoscaling: It scales up when your traffic spikes and scales to zero when it doesn’t, so you don’t pay for idle compute. 2. Branching: Create full copies of your database in seconds. Use them for development, preview deployments, or safe AI sandboxes. 3. Point-in-Time Recovery: Treat your data like code. If something breaks, just roll back to the exact second before the error happened. All the Postgres you love, with none of the infrastructure headaches. Check out the breakdown in the video below!
Like Comment
To view or add a comment, sign in
Manoj Chaudhari
1mo
Report this post
Post #4 in my observability series is live. Last week: We have seen how bad observability impacts organization. This week: Let's look at various key components of observability. In this blog, I have tried to cover three key pillars and a key fourth pillar "Event". https://lnkd.in/e-7fQwKN Comment if you see any other factor!! #Observability #SRE #DevOps #PlatformEngineering #EngineeringLeadership Minutus Computing

The Three Pillars of Observability minutuscomputing.com
Like Comment
To view or add a comment, sign in
Ruth Kegicha
1mo
Report this post
We scaled a FastAPI service from 2 pods to 8. Throughput didn’t change. No errors. 😑 No alerts. 🤔 Just… no improvement. 🥲 The issue wasn’t Kubernetes. It was one early decision: synchronous database calls where async should have been used. But this isn’t an “always use async” post. Both have a place. Knowing when to use each is what separates systems that scale from ones that quietly bottleneck. Have you run into this before? https://lnkd.in/eVhQyBPW

Sync or Async? The Question That Decides Whether Your App Scales medium.com

1 Comment
Like Comment
To view or add a comment, sign in
Data Engineering Weekly

1,527 followers
1mo Edited
Report this post
S3 as a File System? The Gap Just Closed ☁️📂 For years, we’ve managed a "data friction" tax: the gap between S3’s massive scale and the local Linux file systems that many of our favorite tools require. We built complex pipelines, duplicated data to NFS filers, and managed the overhead just to keep our legacy and high-performance apps running. That era is ending. In this week's Data Engineering Weekly, we dive into a major shift in distributed systems: S3 Files. As Werner Vogels recently highlighted, S3 is evolving to provide native NFS access, allowing you to mount buckets directly with sub-millisecond latency. Why this changes the game for Data Engineers: 🔹Native NFS Access: Applications can now work with S3 data directly using existing tools—zero code changes required. 🔹High Performance Caching: By using a two-tier model, small active files are cached on EFS infrastructure for speed, while large reads stream directly from S3. 🔹The Stage & Commit Model: Instead of a PUT for every byte, changes are batched every ~60 seconds. This "Git-like" approach slashes S3 request costs and prevents versioning bloat. 🔹S3 as the Source of Truth: Even with simultaneous API and NFS modifications, S3 remains authoritative, with conflicts safely moved to a lost+found directory to prevent data loss. Whether you are running AI/ML training, processing genomics data, or migrating legacy apps, the ability to use S3 as a shared file system at scale is a massive win for infrastructure simplicity. Read the full deep dive in Data Engineering Weekly: 👉 https://lnkd.in/gaTf3W5r Inspired by Werner Vogels’ latest insights on the changing face of S3: 🔗 https://lnkd.in/gwUewJ-a Which of your legacy pipelines will you simplify first with native NFS access? Let’s discuss in the comments! 👇 #DataEngineering #AWS #S3 #DistributedSystems #CloudInfrastructure #SRE #DataEngineeringWeekly
Like Comment
To view or add a comment, sign in
Soban Taimoor
1mo
Report this post
I recently worked on optimizing a backend system where API latency had become a serious bottleneck. At peak load, response times were inconsistent — affecting user experience and overall system reliability. Instead of scaling blindly, I focused on identifying the real issues: • Inefficient database queries (missing indexes, heavy aggregations) • Repeated calls to external services • Lack of caching for frequently accessed data Here’s what I did: → Optimized MongoDB queries and added proper indexing → Introduced caching for high-read endpoints → Reduced unnecessary external API calls → Refactored parts of the service for better separation of concerns Result: ~40% reduction in latency and much more stable performance under load. One thing I’ve learned: Scaling isn’t always about adding more resources — sometimes it’s about fixing what’s already there.
Like Comment
To view or add a comment, sign in
Pratham Mehta
1mo
Report this post
If you want to learn how databases really work behind the scenes, check out this free YouTube series by Ben Dicken He has covered the entire "Database Internals" book videos where you will learn: • How databases store data using B-Trees • How storage engines work • LSM Trees and write optimization • Distributed systems and leader election • etc Over 10 hours of free content. Highly recommended for anyone into backend development. checkout: https://lnkd.in/gNNSbeWZ #Databases #BackendDevelopment #SoftwareEngineering
Like Comment
To view or add a comment, sign in
Mixpeek

935 followers
1mo
Report this post
AWS shipped something last week that I think is a bigger deal than people are giving it credit for. It's called S3 Files. The short version: you can now mount any S3 bucket like a regular file system. ~1ms latency on cached data, the same bucket reachable through both the S3 API and POSIX, and it's already live in 34 regions. The reason this matters is that most tooling — data science notebooks, build systems, log processors, even the bash scripts you wrote five years ago — expects a normal file system. Until now you'd copy data out of S3 onto EFS or a local disk, run your job, and copy it back. S3 Files just deletes that step. And if you can co-locate it next to compute, there's no data egress at all. One nice side effect: agents get easier. A coding agent already knows ls, cat, and grep, so instead of writing a custom S3 tool wrapper, you can point it at a mount path and let it work the way it already wants to. The bigger thing I'd zoom in on is the pattern. Object storage stays the source of truth, and a faster layer sits on top as a cache. We use the same shape at Mixpeek for vectors — S3 Vectors underneath, Qdrant on top, retrieval pipeline on top of that. Same thesis, applied to two different abstractions: object storage is truth, the fast layer is a cache. mixpeek.com #AWS #S3 #CloudInfrastructure #DataEngineering #AIInfrastructure #ObjectStorage
Like Comment
To view or add a comment, sign in
Choudhury Abhisek Panda
1mo
Report this post
𝙀𝙫𝙚𝙧 𝙏𝙝𝙤𝙪𝙜𝙝𝙩 𝘼𝙗𝙤𝙪𝙩 𝙃𝙤𝙬 𝙄𝙣𝙨𝙩𝙖𝙜𝙧𝙖𝙢 𝙃𝙖𝙣𝙙𝙡𝙚𝙨 𝙈𝙞𝙡𝙡𝙞𝙤𝙣𝙨 𝙤𝙛 𝙇𝙞𝙠𝙚𝙨 𝙖𝙩 𝙩𝙝𝙚 𝙎𝙖𝙢𝙚 𝙏𝙞𝙢𝙚? You tap ❤️ on a post. At the exact same moment, thousands of others do the same. Yet somehow… 👉 The like count updates smoothly 👉 The app doesn’t crash So what’s happening behind the scenes? Here’s a simplified view: 1️⃣ Distributed Architecture Instead of a single server, the system runs across multiple machines, sharing the load of incoming requests. 2️⃣ Load Distribution Traffic is intelligently routed so that no single server gets overwhelmed during spikes. 3️⃣ Data Partitioning (Sharding) User and post data are divided across multiple databases to scale efficiently with massive traffic. 4️⃣ Eventual Consistency The like count might not sync globally at the exact same moment… 👉 But it becomes consistent within milliseconds. 5️⃣ In-Memory Caching Hot data like like-counts are stored in fast systems (like Redis) for quick reads and updates. 6️⃣ Asynchronous Processing (Queues) Instead of processing everything instantly, actions are queued and handled reliably in the background. 💡 The real takeaway: It’s not about processing one request instantly… 👉 It’s about handling millions of actions reliably without breaking the system. That’s where solid system design makes all the difference. #SystemDesign #DistributedSystems #Scalability #BackendEngineering #SoftwareEngineering #TechExplained
2 Comments
Like Comment
To view or add a comment, sign in
Het Siddhapura
2mo
Report this post
Most “Simple” APIs Fail the Same Way, Here’s What Actually Breaks First I took a minimal Flask API (one endpoint + PostgreSQL) and load tested it with k6 until it started to crack. The goal wasn’t to build a high‑performance system, but to understand how a typical service degrades under load and what scaling actually means. Setup: • Flask app with Gunicorn (4 workers) • PostgreSQL with a single indexed table • k6 running on a local machine • One endpoint: GET /data → simple SELECT query What Broke First (and Why): • CPU saturation - At ~350 req/s, CPU spiked to 100%. Response times increased linearly, why because the API logic itself request parsing, JSON serialization became the bottleneck before the database even saw the traffic. • Database contention - After increasing workers, the bottleneck shifted. PostgreSQL showed high wait events and connection queueing. Why? because Concurrent connections overwhelmed the database’s pool. Even simple reads slowed down under concurrency. Tradeoffs I Observed: • Caching → reduced DB load by ~70% but added ~200MB memory overhead • More workers → improved throughput until CPU became the limit again • Database indexes → lowered per‑query latency but didn’t solve concurrency spikes Have you load tested your own services? What was the first bottleneck you hit? #BackendEngineering #DevOps #SystemDesign #Performance #LoadTesting
Like Comment
To view or add a comment, sign in
Data Engineering Weekly

1,527 followers
1mo Edited
Report this post
🚀 HOW TO TAMING S3 SHUFFLE AT SCALE In the latest edition of Data Engineering Weekly, we break down the Zero-to-One Guide to optimizing your data pipelines. 🛠 What’s Holding Your S3 Shuffle Back? 🔹Over-reliance on Local Disk: Traditional shuffle relies on local storage, which leads to "Disk Full" errors and prevents independent scaling of compute. 🔹Ignoring S3 Request Limits: Without proper partitioning and pacing, you’ll hit S3 503 Slow Down errors. 🔹Poor Data Serialization: Inefficient formats like Java default increase payload size, driving up network I/O costs and slowing down transfers. 🔹Lack of Adaptive Execution: Static partitioning fails with data skew. You need Adaptive Query Execution (AQE) to handle skewed keys and coalesce small partitions. 🔹Neglecting Cost Monitoring: API calls (PUT/GET) can cost more than the storage itself. Without lifecycle policies to clean up temporary files, your AWS bill will snowball. 💡 Deep Dive: This breakdown was inspired by the insightful work of Manoj Babu and Meghanath Macha in their recent article on optimizing S3 shuffle performance. 🔗 Read the full breakdown here: https://lnkd.in/gPmjEuDv #DataEngineering #AWS #S3 #BigData #CloudComputing #Spark #DataArchitecture #DataEngineeringWeekly
Like Comment
To view or add a comment, sign in

BERJAYA

Neon Postgres

18,080 followers

View Profile Follow

Explore content categories