Senior Site Reliability Engineer / DevOps Engineer
Senior Site Reliability Engineer / DevOps Engineer
Are you passionate about ensuring the seamless operation of large-scale, distributed, and robust systems? Do you thrive on optimizing performance, increasing reliability, and automating tasks to create more efficient processes? Are you hungry for learning? If so, we would want to chat to you!
As a Senior Site Reliability Engineer (SRE) / DevOps Engineer at our organization, you’ll play a pivotal role in combining software and systems engineering to build, maintain, and enhance our mission-critical services. You’ll be responsible for guaranteeing the reliability and uptime of both internal and external systems, all while driving continuous improvement at a rapid pace.
- Collaborate with a diverse team of software engineers, engaging in iterative processes and effective task planning to drive our projects forward.
- Take ownership of the end-to-end availability and performance of our services, proactively identifying potential issues, and implementing automation to prevent the recurrence of problems.
- Participate in an on-call rotation, ensuring our systems remain stable and responsive even during off-hours.
- Foster collaboration with other engineering teams, promoting the reuse of existing frameworks and gaining insights into their operation.
- Lead the development, implementation, and achievement of service-level objectives that are instrumental in maintaining product reliability.
- Collaborate with software engineering teams to design, implement, and maintain CI/CD pipelines, enabling rapid and reliable software releases.
- Automate and optimize our infrastructure provisioning, configuration, and management processes using industry-standard tools and best practices.
- Implement and manage containerization and orchestration technologies to enhance scalability and resource utilization.
- Maintain and enhance version control systems and repositories for codebase management.
- Steer and drive the SRE / DevOps roadmap, assuming full ownership while actively engaging in negotiation and strategic planning to ensure its successful execution.
- Stay current with industry trends, emerging technologies, and best practices in SRE, DevOps, and automation.
Why Join Us?
- Embrace a culture of diversity, intellectual curiosity, and problem-solving that is essential to our success.
- Work in a blame-free environment that encourages collaboration and innovation.
- Enjoy the freedom to take the lead on meaningful projects and opportunities for professional growth.
- Benefit from a supportive and mentorship-driven environment to continuously learn and expand your skill set.
If you’re ready to take on the exciting challenges of managing systems at scale, apply today to become a valued member of our Site Reliability Engineering team. Together, we’ll shape the future of reliability, performance, and innovation.
- Bachelor’s degree in Computer Science, a related technical field, or equivalent practical experience.
- 5 years of experience as a Site Reliability Engineer or DevOps Engineer, working with software and infrastructure.
- Experience in one of the cloud platforms: Azure, AWS, or GCP.
- Experience with Azure Cloud.
- Experience with high availability systems.
- Experience troubleshooting and debugging production code.
- Experience with application deployment and data pipelines.
- Understanding of distributed computing systems.
- Experience with Snowflake and/or relational databases
Come join an emerging tech company just as we hit our inflection point. Vantage plays in a $250BN addressable market in North America that is seeing significant disruption. Retailers are transforming their digital marketing practices to drive customer acquisition and are looking for new profit centers in retail media networks.
Vantage is uniquely positioned in this space, having established a technology platform that is custom-built for retail media. We offer the only turnkey integrated retail media network. We significantly outperform online media benchmarks by leveraging automation, machine learning, and AI. Ours is the market-leading platform and we have real traction with some of the biggest names in retail.
We are excited to expand the team and take the company to the next level. You would have the opportunity to get in early and obviously, that comes with great possible financial upside, but it also comes with an opportunity to shape the culture of the team. So, we’re picky about the people we invite to join the journey. We’re looking for true team players, not lone wolves, or temporarily hired guns. We are professionals with a passion for doing great work and driving real success for Vantage and our clients.
Headquartered in Toronto but currently working fully remotely, the Vantage team is diverse, creative, and fun. Our belief is that our firm commitment to diversity & inclusion enables Vantage to be better. We also believe that people are happiest and can accomplish the most amazing things when they have the freedom and flexibility to customize their work and life environments and can take on huge, stimulating challenges with fantastic colleagues.
Our tech stack includes Python, Django, React, AngularJS, Snowflake, MySQL, Postgres, Snowplow, Stitch Data, RabbitMQ, Redis, CircleCI, and PagerDuty; hosted mainly on Azure, with limited usage of GCP and AWS.
What We Offer
- Competitive compensation
- Employee share ownership – because Vantage wants you to truly share the success
- Flexible work/life, remote-first philosophy
- Great health benefits from Day 1
- Career development / continuing education allowance – we all want to stay sharp
- Mac laptop & other Apple equipment (we’ll give you a budget to build your perfect work-from-home environment because ergonomics matter)
- Team building activities to keep things fun: wonderland days, cottage trips (COVID permitting), virtual games & escape rooms, team lunches, and weekly nacho breaks on Fridays
Please submit a resume with a cover letter to Careers@GotVantage.com