← All learning pathsLearning path
Advanced~87 min·4 systemsLarge-Scale Infrastructure
The systems underneath the systems: unique ID generation, distributed object storage, event streaming, and a federated social protocol — the plumbing the internet runs on.
After this path you will be able to
Reason about infrastructure-level trade-offs: ID generation at microsecond precision, erasure coding vs replication for durability, partition leadership failover, and the CAP implications of a federated architecture.
Interview approach for this path
- 1.Start with the durability and availability requirements before anything else. For infrastructure, losing data is worse than downtime.
- 2.Explain your ID generation strategy: if you need globally unique IDs at scale, say Snowflake-style IDs and explain the bit layout (timestamp, machine id, sequence).
- 3.For storage at scale, distinguish replication (copies for availability) from sharding (partitions for capacity). Address both.
- 4.Discuss leader election explicitly: who owns a partition, how does failover happen, and how long is the downtime window?
- 5.Address CAP trade-offs for each component. A broker like Kafka is CP for partition leadership, but AP for read replicas. Be precise.
- 6.Talk about operational concerns: how do you add capacity without downtime, how do you rebalance partitions, and how do you handle a slow follower?
Systems in this path
4 total1
Unique ID Generator (Snowflake)Beginner·12 min
64-bit time-sortable IDs: timestamp + worker id + sequence, minted locally with no per-request coordination.
→2
Amazon S3 (Object Storage)Advanced·25 min
Multi-AZ erasure coding, sharded metadata, strong read-after-write.
→3
Apache KafkaAdvanced·25 min
Partitioned, replicated log. Brokers, ISR, consumer groups, leader failover.
→4
Bluesky (AT Protocol)Advanced·25 min
Federated social: per-user PDS, global firehose Relay, plug-in AppViews and custom feeds.
→
Concepts reinforced throughout