Join Blue Signal Search as a Senior DevOps Engineer to design, deploy, and manage scalable systems for industrial automation platforms. This role offers a chance to work on cutting-edge infrastructure and impact factory operations across North America. Collaborate with Site Reliability Engineering and software development teams to develop resilient microservices and APIs.
Key Highlights
Technical Skills Required
Benefits & Perks
Job Description
About The Company
Blue Signal Search is a distinguished and award-winning executive search firm renowned for its specialization across diverse industry sectors. With a dedicated team of experienced recruiters, Blue Signal has established a reputation for delivering exceptional talent acquisition solutions tailored to client needs. The firm prides itself on its comprehensive understanding of various professional services and industry verticals, enabling it to connect organizations with top-tier professionals who drive growth and innovation. Committed to excellence and integrity, Blue Signal continuously strives to foster long-term partnerships with both clients and candidates, ensuring mutual success and organizational excellence.
About The Role
We are seeking a highly skilled DevOps Engineer to join our dynamic team remotely, supporting operations across Canada, with a preference for candidates located in the Greater Toronto Area. This role offers an exciting opportunity to build and maintain a cutting-edge infrastructure that seamlessly integrates cloud and edge computing environments. As a key member of the engineering team, you will be responsible for designing, deploying, and managing scalable, resilient systems that underpin our client’s innovative industrial automation platform. Your work will directly impact the safety, efficiency, and reliability of factory operations across North America, providing tangible results that enhance operational uptime and performance.
Qualifications
The ideal candidate will possess over five years of experience in DevOps, Platform Engineering, or Site Reliability Engineering roles. You should have practical expertise in deploying and managing Kubernetes clusters in production environments, troubleshooting containerized workloads, and implementing Infrastructure-as-Code (IaC) using tools such as Terraform, Helm, or Ansible. Strong scripting skills in Python, Go, or similar languages are essential, along with deep Linux system knowledge, networking fundamentals, and performance tuning. Experience with CI/CD pipelines—using platforms like GitHub Actions, GitLab CI, or Jenkins—is required. Exposure to distributed or hybrid cloud and edge systems, especially those involving GPU scheduling or industrial IoT, is advantageous. Excellent communication skills, a collaborative mindset, and a proactive approach to ownership and problem-solving are critical for success in this role.
Responsibilities
- Engineer and automate the cloud-plus-edge infrastructure backbone that ensures the continuous operation, security, and health of thousands of Linux-based devices deployed across various sites.
- Manage and optimize Kubernetes clusters supporting GPU-accelerated computer vision workloads in both public cloud environments and on-premise customer locations.
- Design and implement zero-touch deployment pipelines supporting blue/green, canary, and instant rollback strategies for over-the-air updates, ensuring seamless and reliable software delivery.
- Utilize Infrastructure-as-Code tools such as Terraform and Helm to standardize environment provisioning, guaranteeing consistency across deployments.
- Embed scalable observability solutions using Prometheus, Grafana, and OpenTelemetry to monitor fleet health and performance metrics in real time, enabling proactive issue resolution.
- Collaborate closely with Site Reliability Engineering and software development teams to develop resilient, easy-to-deploy microservices and APIs that meet operational demands.
- Lead post-incident reviews, analyze root causes, and implement systemic improvements to enhance system reliability and prevent recurrence of issues.
- Champion security best practices, including identity management, secrets handling, and network segmentation, across cloud and edge environments to safeguard sensitive data and infrastructure.
- Competitive base salary complemented by meaningful equity options, reflecting the importance of your contribution to the company's growth.
- Flexible working hours and a fully remote work setup, supporting work-life balance and accommodating diverse schedules.
- Annual home-office stipend to enhance your remote working environment.
- Comprehensive health, dental, vision, and mental health coverage to support your overall well-being.
- Generous paid time off, quarterly recharge Fridays, and a dedicated professional development budget to foster continuous growth and rejuvenation.
Blue Signal Search is committed to fostering an inclusive and diverse workplace. We are proud to be an equal opportunity employer and do not discriminate based on race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status. We believe that a diverse team brings unique perspectives and innovative solutions, and we are dedicated to creating an environment where all employees can thrive and contribute to our collective success.