The Big Picture____________________________________________
The Senior Platform Engineer serves as the backbone of ClosedLoop’s engineering ergonomics, system quality, and delivery speed. You own the glue code, services, and internal tooling that make product teams fast and safe — spanning AI-first enablement, SRE/DevOps practices, AWS cloud-native infrastructure management, pipelines, ephemeral environments, and IaC. You instrument the platform with actionable dashboards, automate the boring tedious parts of feature delivery, drive blameless incident management, and hard-wire “start from production and work backwards” quality practices into the product development lifecycle to eliminate friction.
The Day-to-Day___________________________________________
- Platform & Glue Code: Build and maintain the libraries, CLIs, and automations that unify repos, services, and workflows.
- Engineering Ergonomics & Efficiency: Reduce cognitive load; standardize golden paths, templates, and guardrails to raise team throughput.
- SRE & DevOps: Own reliability baselines (SLOs, error budgets), incident tooling, runbooks, and continuous improvement loops.
- AWS Cloud-Native Infra: Design, provision, and operate scalable, cost-aware infrastructure using IaC (Terraform/CDK) and least-privilege IAM.
- CI/CD & Pipelines: Ship-on-green with progressive delivery, canary/feature flags, and deployment annotations; optimize build times.
- Ephemeral Environments: On-demand preview stacks for every PR to shift validation left and accelerate feedback.
- Observability & Dashboards: End-to-end traces, metrics, logs, and product analytics wired into clear, decision-making dashboards.
- Preventative Quality: Bake tests, policies, and security checks into pipelines so the path of least resistance is the path of best practice.
The Right Person for the job has_______________________________
- Platform-first, AI-first engineer. Owns the glue code, CLIs, and templates that make teams fast and safe; obsessed with engineering ergonomics and golden paths assisted by agentic workflows and automation.
- SRE/DevOps depth. Sets SLOs/error budgets, designs incident tooling/runbooks, and drives MTTR down via rigorous post-incident learning.
- AWS + IaC expert. Provisions and operates scalable, cost-aware stacks using Terraform/CDK, least-privilege IAM, and repeatable environments from laptop → prod.
- CI/CD authority. Builds ship-on-green pipelines, progressive delivery (canary/flags), deployment annotations, and caching strategies that cut build/test times dramatically.
- Observability native. Wires metrics, logs, traces, and product analytics into crisp dashboards that drive decisions—not vanity charts.
- Production-backwards mindset. Starts with how it will run and fail in production, then works upstream to bake in quality and safety.
- Polyglot builder. Master-level coding in at least one of Python/TypeScript/Go; writes small, composable tools and reliable automation.
- Cloud container fluency. Designs, builds, and scales Docker/K8s workloads with strong image hygiene, SBOMs, and supply-chain controls.
- Compliance by design. Practical experience with SOC 2/HITRUST/HIPAA controls embedded in pipelines (secrets, audit trails, least access).
- Partner to Product & Eng. Translates product intent into testable, observable outcomes; communicates strategy clearly and drives alignment across teams.
- Calm collaborator. Resolves cross-team friction with data and empathy; raises the bar for code quality, reviews, and operational discipline.
- Detail-oriented & proactive. Spots edge cases early, automates the boring stuff, and tackles gnarly problems head-on.
Required Education & Experience_______________________________
- 5+ years of experience in engineering roles at software companies.
- Proficiency in Python, Go, TypeScript, and/or Scala development.
- Prior experience leveraging automation tools (Locust, Playwright, Jest) and test case management systems
- Experience with Docker, CI/CD systems, and compliance standards.