Principal Software Architect – Observability & Data Platforms

24 Oct 2025
Apply

We’re looking for a Principal Software Architect to design and implement next-generation, AI-enabled observability and data platforms that power real-time insights and operational reliability across hybrid cloud environments.This role reports to the Senior Director of Engineering and partners closely with Platform, Product, and SRE leadership to define the technical vision and implementation strategy for observability and data systems across the organization.You’ll lead the architecture and design of telemetry, monitoring, and data platforms that form the backbone of our engineering ecosystem — enabling visibility, intelligence, and scalability across our services.What you get to do in this role:Define and evolve the architecture and design of AI-enabled observability and data platforms across distributed systems.Shape the technical strategy and design principles for metrics, traces, logs, and events pipelines.Drive the application of AI and agentic AI to enhance observability capabilities — including intelligent alerting, predictive analytics, and automated insights.Partner with platform, SRE, and application teams to standardize instrumentation and telemetry frameworks.Establish SLAs, SLOs, and data contracts that connect observability to system and business outcomes.Lead architectural design sessions, technical reviews, and cross-team alignment on observability and AI integration.Author architecture documents, design proposals, and technical playbooks to guide engineering teams.Provide deep technical mentorship on distributed systems, observability design, and data architectures.Drive the adoption of OpenTelemetry, modern observability standards, and AI-assisted tooling across engineering teams.Oversee platform scalability, cost efficiency, and reliability from an architectural perspective.Collaborate with leadership to align platform and AI roadmaps with enterprise engineering strategy.Design and develop scalable, maintainable, and reusable software components with a strong emphasis on performance and reliability.Collaborate with product managers to translate requirements into well-architected solutions, owning features from design through deliveryBuild intuitive and extensible user experiences using modern UI frameworks, ensuring flexibility for customer-specific needs.Contribute to the design and implementation of new products and features while enhancing existing product capabilities.Integrate automated testing into development workflows to ensure consistent quality across releases.Participate in design and code reviews ensuring best practices in performance, maintainability, and testability.Develop comprehensive test strategies covering functional, regression, integration and performance aspectsFoster a culture of continuous learning and improvement by sharing best practices in engineering and qualityPromote a culture of engineering craftsmanship, knowledge-sharing, and thoughtful quality practices across the team.Platform Architecture & StrategyDefine the architecture and roadmap for a multi-cloud, multi-tenant observability platform.Design for scale, performance, and reliability with cost-aware architecture choices.Ensure systems are cloud-native, container-aware, and optimized for Kubernetes and service mesh environments.Monitoring, Instrumentation & Developer EnablementDefine architectural standards for scalable telemetry systems for logs, metrics, traces, and events.Design frameworks and best practices for instrumentation, monitoring, and observability adoption.Ensure observability validation is embedded into CI/CD and developer workflows.Data Platform ArchitectureDesign data pipelines for hot/cold telemetry paths and long-term retention.Define governance, privacy, and access control frameworks for observability data.Enable analytics and reporting across telemetry and operational data.Technical LeadershipOwn architectural direction and design standards across observability and data teams.Champion engineering excellence, automation, and quality at scale.Mentor engineers and serve as an internal thought leader for telemetry and AI-driven platform design.

  • ID: #54712744
  • State: Georgia Atlanta 30301 Atlanta USA
  • City: Atlanta
  • Salary: USD TBD TBD
  • Job type: Full-time
  • Showed: 2025-10-24
  • Deadline: 2025-12-23
  • Category: Et cetera
Apply