Principal Distributed Systems Engineer
Location: Remote (US & Canada)
Type: Full-Time
Join a senior engineering team building large-scale distributed and AI-driven systems. You'll design, scale, and optimize the core services that power real-time, intelligent decisioning across billions of events.
Responsibilities
*
Architect and scale distributed and agentic AI systems.
*
Drive best practices in scalability, consistency, and fault tolerance.
*
Lead development of core platform services and real-time analytics.
*
Ensure resilience, observability, and operational excellence.
*
Collaborate with Data, AI, Product, and Platform teams.
Requirements
*
10+ years in distributed systems engineering.
*
Strong knowledge of distributed systems theory and high-throughput architectures.
*
Experience with Kafka, Pulsar, DynamoDB, Cassandra, ClickHouse, Flink, etc.
*
Hands-on with AI agents, RAG, and durable workflows.
*
Proficient in Python (preferred), Go, Rust, or Java.
*
Deep AWS experience (S3, ECS/EKS, DynamoDB, Lambda, Bedrock).
*
Strong architectural leadership and mentorship skills.
If you're passionate about building resilient, large-scale systems and driving technical direction, we'd love to connect.
