Job Description
About the role
The DevOps / Cloud + Streaming Infrastructure Engineer – SwishPlay is responsible for designing, operating, and scaling the cloud and infrastructure systems that support SwishPlay’s video streaming platform. Reporting to the Head of SwishPlay and working closely with Streaming, Media, and Platform Engineering teams, this role ensures SwishPlay’s infrastructure can reliably handle video ingestion, processing, storage, and global delivery at internet scale.
This role combines cloud infrastructure engineering, DevOps practices, and streaming-specific operational expertise.
Key Responsibilities
Cloud & Streaming Infrastructure Operations
Design and operate cloud infrastructure for video ingestion, transcoding, and delivery
Ensure high availability, fault tolerance, and scalability of streaming services
Manage compute, storage, networking, and CDN integrations
DevOps, CI/CD & Automation
Build and maintain CI/CD pipelines for streaming and backend services
Automate infrastructure provisioning and deployment workflows
Support infrastructure-as-code and repeatable environments
Performance, Reliability & Cost Optimisation
Monitor performance, latency, throughput, and error rates
Optimise resource usage and cloud costs for media workloads
Support traffic spikes during viral content and live events
Security & Platform Integrity
Implement infrastructure security controls and hardening
Support secure media delivery, access control, and abuse prevention
Work with Security and Trust & Safety teams on platform risks
Incident Response & Operational Excellence
Participate in on-call rotations and incident response
Lead infrastructure-related incident troubleshooting and recovery
Drive post-incident reviews and reliability improvements
Documentation & Continuous Improvement
Maintain infrastructure documentation and runbooks
Improve observability, automation, and operational tooling
Support long-term platform scalability initiatives
Expectations
Deliver reliable, scalable streaming infrastructure
Apply automation-first and reliability-focused practices
Communicate clearly with engineering and operations teams
Operate effectively in a fully remote environment
Requirements
Experience
Proven experience in DevOps, SRE, or cloud infrastructure roles
Experience supporting media, streaming, or high-bandwidth platforms
Experience operating production cloud environments at scale
Experience working in remote or distributed teams preferred
Skills
Strong cloud platform expertise (AWS, GCP, Azure, or similar)
Experience with CI/CD, infrastructure-as-code, and automation
Understanding of video streaming architectures and CDNs
Strong troubleshooting and incident response skills
Clear written and verbal communication abilities
Qualifications
Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience)
Cloud, DevOps, or SRE certifications preferred but not required
Job Tags
Remote job, Full time,