Intro call
Alignment on the role, expectations, and what you’re looking for.
Operate and improve AWS/Azure platforms under Critical Support, using Datadog as the operational foundation for unified observability, incident response, and continuous improvement.
This role is for someone who enjoys operating real production systems and making them better every week — not just keeping the lights on.
You’ll work across AWS and Azure environments for tech-led customers. Datadog is the backbone: metrics, logs, traces, security signals, cloud cost insights, and alerting all live in one place, and both we and the customer operate from that shared view.
You’ll be part of an on-call rotation and you’ll also deliver improvement engineering — automation, guardrails, and tuning — so platforms become more reliable, secure, and cost-controlled over time.
The day-to-day responsibilities of the role.
Datadog skills are essential for this role.
Must-have
Nice-to-have
A simple process designed to respect your time.
Alignment on the role, expectations, and what you’re looking for.
Real scenarios: Datadog signals, incidents, systems, trade-offs.
A realistic task. No long take-home marathons.
Working style fit, then we move quickly.
Critical Cloud is an equal opportunity employer. We value diverse perspectives and are committed to creating an inclusive environment for everyone.
Email us your CV and a short note on why this role fits you. We’ll get back to you as soon as we can.