Overview
HYBRID, 2 days/week onsite in Montreal, Quebec
3-month contract, 40 hours/week, some on-call responsibilities
SKILLS: 7+ years of DevOps, Azure, CI/CD, microservices, mobile, web, Terraform, HCL, AKS, Docker, Kubernetes, SAST, DAST, Azure Key Vault, RBAC, Prometheus, Grafana, ELK, Azure Monitor, Python, SQL, GitHub, Bitrise, Swift, Kotlin, Open Telemetry, Firebase, MongoDB, cybersecurity, Agile
Responsibilities
DevOps Strategy & Ownership
- Own the end‑to‑end DevOps vision, roadmap, and strategy for the organization.
- Define and enforce best practices across CI/CD, infrastructure management, cloud architecture, observability, and security.
- Evaluate, select, and manage DevOps tooling and services for long‑term scalability.
CI/CD Pipeline Management
- Design, build, and maintain CI/CD pipelines for microservices, mobile, and web applications to ensure DevOps maturity and reliable delivery.
- Establish deployment standards, branching strategies, and automated testing integration across all teams.
Cloud & Infrastructure Architecture
- Architect, deploy, and manage Azure cloud infrastructure using Terraform and HCL.
- Own Azure governance including subscription management, networking architecture, identity policies, cost optimization, and scalability patterns.
- Implement secure, scalable patterns for AKS, container registries, serverless components, and supporting cloud services.
Containerization & Orchestration
- Define and maintain Docker image standards, base images, build pipelines, and container security practices.
- Govern Kubernetes deployment patterns, cluster configuration, and workload management on AKS.
Security & Compliance
- Implement security best practices including SAST/DAST, dependency scanning, and secure pipeline requirements.
- Manage secrets (Azure Key Vault or equivalent), RBAC, identity policies, and secret rotation workflows.
- Lead responses to cybersecurity audits, vulnerability management, patching, and penetration test remediation.
Technical Documentation Management
- Develop and maintain comprehensive infrastructure documentation, diagrams, runbooks, and operational procedures.
- Ensure all DevOps and cloud systems are documented for maintainability and knowledge sharing.
Monitoring, Logging & Observability
- Set up and maintain monitoring systems such as Prometheus, Grafana, ELK, Azure Monitor, or similar tools.
- Implement distributed tracing, SLIs/SLOs, alerting standards, and incident dashboards following modern SRE practice.
- Continuously improve observability to reduce MTTR and enhance reliability.
Automation & Scripting
- Automate operational workflows using Bash, Python, or equivalent languages.
- Build internal tooling to support developers, reduce toil, and improve operational efficiency.
Incident Management & Reliability
- Lead response to production incidents, conduct root cause analyses, and drive long‑term reliability improvements.
- Own production uptime, on‑call responsibilities, and alerting hygiene.
- Establish disaster recovery strategies, participate in DR exercises, and ensure infrastructure resilience.
Cross‑Team Collaboration & Developer Enablement
- Work closely with mobile, backend, and web development teams to ensure seamless integration of DevOps practices across the entire SDLC.
- Provide internal training, coaching, and support to developers on DevOps tools, pipelines, and environments.
- Participate in Agile ceremonies and contribute to architectural and operational decision‑making.
Continuous Improvement
- Identify process bottlenecks and implement improvements to increase efficiency, reliability, security, and standardization.
- Proactively monitor industry trends and introduce new practices that enhance the organization’s DevOps maturity.
Reporting & Operations
- Provide clear, concise reporting on priorities, status updates, production health, and ongoing initiatives.
- Support ad‑hoc nighttime production deployments as required.
Requirements
- 7+ years of experience in DevOps, SRE, cloud infrastructure, or related roles. DevOps certification is a plus.
- Proven ability to operate independently as the sole DevOps engineer for an organization.
- Strong understanding of SDLC processes, Agile methodologies, and cross‑team collaboration.
- Proficiency in Terraform and HCL, with experience architecting cloud infrastructure.
- Deep knowledge of Microsoft Azure, including AKS, networking, identity, SQL Server, and .NET/C# microservices.
- Strong experience with Docker, container lifecycle management, and Kubernetes workload patterns.
- Experience with CI/CD tools such as Azure DevOps, GitHub Actions, and Bitrise.
- Familiarity with mobile (Swift, Kotlin) and web (Angular, TypeScript) development workflows.
- Proficient in monitoring, logging, and observability technologies (Prometheus, Grafana, ELK, OpenTelemetry).
- Experience with Firebase (remote config, Firestore, analytics).
- Experience with MongoDB (asset).
- Solid cybersecurity knowledge including vulnerability management, patching, and secure pipeline practices.
- Strong problem‑solving, communication, and documentation skills.
- Experience with production‑level deployments and on‑call responsibilities.