About the Role
Clerk is looking for product-minded Infrastructure Engineers / SREs to help us bring new functionality to the Clerk platform. This role will have you work on nimble teams supporting our mission to help other builders launch and grow their businesses. This role has a focus on Site Reliability and Database oversight.
Responsibilities:
- Infrastructure Management: Lead the design, deployment, and maintenance of scalable, efficient, and secure infrastructure on GCP
- Automation & Infrastructure as Code: Develop and maintain automation scripts and tools for provisioning, configuration, and deployment, ensuring repeatability and reliability
- Monitoring and Alerting: Establish and maintain robust ongoing monitoring and alerting systems to proactively identify and address performance bottlenecks and issues
- Incident Management: Participate in on-call rotation and work closely with the engineering team to resolve incidents, conduct post-incident analysis, and implement preventive measures
- Security: Collaborate with Security engineers to implement best practices for infrastructure security, compliance, and vulnerability management
- Documentation: Maintain detailed and up-to-date documentation for infrastructure configurations, processes, and procedures
- Mentorship: Provide guidance and mentorship to junior team members, fostering a culture of learning and improvement
- Identify and Diagnose: Support reliability by identifying emergent or future bottlenecks and work with our product teams to diagnose and implement scalability improvements
Qualifications:
- Proven experience in an Infrastructure role, with a minimum of 8 years of hands-on experience
- Proficiency in scripting and programming languages such as Golang
- Strong knowledge of GCP and infrastructure as code
- Familiarity with Cloudflare infrastructure and edge computing technology like Cloudflare Workers
- Comfort in operating with autonomy. While you’ll be on a team, much of your work will be driven by your own intuition and require you to identify issues and then work with the right people in the organization to deliver solutions
- Deep knowledge of database management and optimization preferably using our DB stack (specifically Postgres, Redis, and Cloudflare)
- Experience with containerization and orchestration technologies
- Expertise in monitoring and alerting solutions
- Understanding of network protocols, security best practices, and compliance standards
- Excellent problem-solving skills and the ability to work well under pressure in a fast-paced startup environment