Site Reliability Engineer
We are hiring a Site Reliability Engineer (SRE) to join our team and work on our cutting-edge video streaming platform, StreamShark.io. As a trusted end-to-end video streaming service, StreamShark powers secure, high-profile live streams for some of the world’s largest companies and VIPs. We deliver internal (confidential) Town Halls, All Hands meetings, and large-scale public or private events to global audiences, where flawless execution is paramount.
StreamShark is engineered for scalability and reliability from the ground up. We are continuously innovating in areas like live streaming, video encoding, AR/VR integration, and cloud-based infrastructure. In this role, you’ll contribute to both customer-facing features and critical backend services, helping us continue to deliver exceptional streaming experiences for major global enterprises.
As an SRE at StreamShark, you will play a key role in managing, evolving, and taking significant ownership of key areas within our infrastructure and DevOps stack.
Your key responsibilities will include:
- Ensuring our VMs, cloud systems, software, and databases are highly available and performing optimally in an environment where 24/7 uptime is critical. This includes participation in an on-call rotation to address urgent issues.
- Working with our alerting tools (and customer feedback) to analyze, troubleshoot, and rapidly diagnose application and networking issues.
- Being passionate and proactive in identifying opportunities for consolidation, cataloging our systems, and driving automation initiatives.
- Creating and maintaining internal dashboards and visualizations to provide greater visibility into the health and utilization of our infrastructure.
- Working closely with the development team to improve processes around change, patch, and release management in accordance with our Information Security Management System (ISMS).
- Being security-aware, and helping us mitigate the threat landscape via proactive monitoring, patching, and adherence to best practices to maintain our key security certifications.
- Acting as a technical escalation point for complex client issues related to platform reliability, and collaborating with account management to provide technical insights to key enterprise clients, ensuring contractual obligations around service delivery and reliability are met.
- Working with our Continuous Integration (CI) environment to ensure the integrity and stability of the system.
- Optimising our existing architecture, identifying risks, and proposing improvements through consolidation, containerisation, or serverless approaches to drive efficiencies and reduce infrastructure costs.
To be considered for this position, you must meet the following criteria:
- A Computer Science, Software Engineering, or equivalent IT degree.
- 3+ years of professional SRE working experience.
- Expertise in Linux system administration, with solid experience in remote access, scripting (e.g., Bash or Python).
- Hands-on experience with cloud platforms such as AWS, Google Cloud, and their related compute services.
- Experience with configuration management (CM) and deployment tools such as Ansible and Terraform.
- Experience with containerisation tools (e.g., Docker, Kubernetes).
- Solid understanding of networking fundamentals (OSI model, DNS, VPCs, firewall management, routing, etc.) coupled with hands-on experience in designing and provisioning related network solutions.
- Experience with a modern version control system such as Git, and building/packaging/shipping software under UNIX-like operating systems.
- Continuous Integration (CI) experience.
- Proven experience with designing, building, and maintaining scalable distributed systems.
- Maintain a collaborative environment with 3 days per week in our Melbourne CBD office, and occasionally accommodate early starts for global operations.
- You are required to be an Australian Citizen or Permanent Resident to work legally in Australia.
Desirable Experience:
- Experience with monitoring tools including, New Relic, Pingdom, and Tenable.io.
- Experience working with CDNs, HAProxy, Varnish, and Wowza Streaming Engine.
- Building internal dashboards or visualisations with common stacks such as InfluxDB/Grafana, Elasticsearch/Logstash, or equivalent.
- Understanding of live and on-demand video streaming, and the associated software tools, video encoders, and workflows.
Employee Benefits:
- Competitive base salary + superannuation.
- Bonus pool based on team performance targets.
- Professional development budget for certifications and courses.
- Regular Brown Bag lunch sessions.
- Mac laptop provided.
About StreamShark
At StreamShark, you’ll be an integral part of a passionate and agile team where your contributions directly shape the future of our cutting-edge video streaming platform. We champion an environment where you gain broad, hands-on experience across diverse technological domains. Your work will have a visible and immediate impact on our product and its success with global enterprises.
We thrive on innovation and are committed to exploring the forefront of video technology. You’ll have the chance to experiment with and implement the latest advancements, especially in live streaming, video encoding, content delivery, and immersive AR/VR experiences.
Collaboration and mutual support are the cornerstones of our team. We foster a dynamic and inclusive atmosphere where knowledge-sharing is encouraged, and every voice is valued. We’re proud that many of our team members choose to build and advance their careers with us long-term, a testament to the supportive, challenging, and rewarding environment we’ve cultivated.
Our team is based in a friendly and modern co-working space in the heart of Melbourne’s CBD, conveniently located for public transport.
To apply for this position, email [email protected] with your resume/cover letter.