@Scale: Systems & Reliability 2026

Schedule

Thu, 25 Jun, 2026 at 08:30 am

UTC-07:00
Location

Meydenbauer Center | Bellevue, WA

Advertisement
Building the advanced infrastructure necessary to power today's sophisticated AI models represents a monumental engineering challenge. This endeavor demands the creation of highly scalable, high-performance, and supremely reliable computing systems that are not just general-purpose but are meticulously tailored to the unique, often unpredictable, demands of machine learning workloads.
This specialized infrastructure includes custom silicon accelerators (like GPUs and TPUs), ultra-high-speed networking fabrics, and novel storage architectures designed for massive data throughput—all optimized for parallel processing at an unprecedented scale.
Simultaneously, the very AI that this infrastructure supports is fundamentally revolutionizing the systems themselves. AI is now being deployed to design, operate, and optimize these large-scale systems, leading to a new generation of intelligent, efficient, and resilient system management. Machine learning algorithms are used for dynamic resource allocation, predictive maintenance to prevent outages, and sophisticated anomaly detection to maintain system health.
This AI-driven optimization moves system management from reactive problem-solving to proactive, self-optimizing operation.
Register here: https://atscaleconference.com/events/systems-reliability-2026/
Advertisement

Where is it happening?

Meydenbauer Center, 11100 NE 6th St, Bellevue, WA 98004-5524, United States

Event Location & Nearby Stays:

Know what’s Happening Next — before everyone else does.
At Scale
Host or PublisherAt Scale

Ask AI if this event suits you