Senior Site Reliability Engineer—
Universe opens up the possibilities of the internet to everyone. A magical grid that distills the web into simple Lego-like building blocks. With our app, anyone on earth can build a custom website or online store in seconds — without code, all from a phone.
The goal of opening up the internet to everyone brings exciting challenges of scale — and we need someone dedicated to managing our infrastructure and operations (you!). As we continue to grow, we want to make sure that our creators' sites stay performant and accessible, while also scaling up our DevOps team to continue to meet that ever-expanding challenge.
What you’ll do 🔧
As our Senior Site Reliability Engineer you will own our technical infrastructure.
This means you will make and implement plans to improve our reliability, security, scalability, and development speed, as well as expand our ability to find bugs and inefficiencies. You will design and maintain build and deployment pipelines across our teams (API, Database, Web, App) and manage the collaboration between those teams to make sure the pipelines are best addressing their needs. You will manage our various AWS services, both through the AWS portal and with CloudFormation templates, while also configuring other external services that our application relies on. (Including Cloudflare, Auth0, Stripe and Imgix, among others.)
As we grow, you will continue to scale your systems and operations by expanding (and leading) our DevOps/SRE team.
Who you are 👀
You love systems. Not just working within them, but designing them yourself and taking ownership of them to make sure they fulfill your vision.
You like working in an environment where you are empowered to come up with your own solutions to desired outcomes. You are focused on getting to the right solution — not on being right — and enjoy collaborating with others who share that focus.
You have a strong operational excellence mindset. You want to continuously improve our error rates, alerts, and downtime, and don’t want to get woken up at 5 AM by an unnecessary alarm, because the unnecessary alarm never goes off.
You are a team player. You understand that systems of people are every bit as important as technical systems. You enjoy working with other engineers and helping them improve, whether that be by working through problems together or building infrastructure to make their work faster and smoother.
You have strong experience with AWS — particularly CloudFormation templates, S3, SNS, SQS, RDS, IAM — Docker, Kubernetes, secret management and scripting. (If you also have experience with Helm, SOPS and Istio, even better!)