Sr Systems Reliability Engineer

The Skywalker Sound Development Group is seeking a skilled Sr System Reliability Engineer to join our team. The Skysound Development Group is developing a set of next-generation tools for audio soundtracks and media distribution. We aim, through the synthesis of institutional wisdom of creative, high-quality audio and cutting-edge software engineering, to bridge the divide between content creation and audience experience.

As a Sr Systems Reliability Engineer within the Group, you will play a key role in conceiving and developing tools to help usher in the new era of post production audio content creation, working in areas such as application development, cloud computing, database, and state-of-the-art security implementations. The Development Group works closely with master audio content creators to produce novel technology for immediate utilization. Your expertise in modern development tools, cloud infrastructure, and security practices will ensure the delivery of reliable, high-quality solutions that serve the needs of creative teams and global audiences.

This role is considered Hybrid, which means the employee will work 2-3 days onsite at our Nicasio, CA office and occasionally from home.

What You'll Do

  • Design, manage and maintain critical infrastructure for both software development and deployed global production resources.

  • Collaborate on the provisioning of cloud infrastructure in AWS using Terraform to ensure consistency and scalability.

  • Maintain and manage multiple Kubernetes clusters across both cloud and on-premise environments.

  • Implement and enforce best practices for secure software development and deployment in alignment with industry standards.

  • Monitor, troubleshoot, and optimize build and deployment processes to maximize efficiency and minimize downtime.

  • Collaborate with cross-functional teams, including developers and security experts, to ensure systems meet operational requirements.

  • Develop, maintain, and enhance CI/CD pipelines using GitLab to support build automation, unit testing, and integration testing.

  • Continuously evaluate and implement tools and technologies to improve workflows and platform reliability.

What We’re Looking For

  • BS Degree in Computer Science

  • 5+ years of experience in DevOps, Site Reliability Engineering, or a related field.

  • Extensive AWS knowledge: EC2, ECS/EKS, Lambda, ELB, ASGs, Route53, KMS, SSM, IAM, S3, ACM, VPC, RDS, Elasticache.

  • Proficiency with modern observability practices: application monitoring, tracing, and profiling tools (e.g. Datadog, New Relic, OpenTelemetry, Splunk).

  • Proficiency with GitLab CI, Terraform, Helm, and Packer 

  • Demonstrated experience designing and managing CI/CD pipelines for complex software platforms.

  • In-depth knowledge of Containers and Container Orchestration technologies: Docker, Kubernetes

  • Experience with Terraform or other infrastructure as code tooling.

  • Strong scripting skills in Python, Bash, or similar languages.

  • Familiarity with modern security practices for protecting sensitive assets in distributed systems.

  • Exceptional problem-solving skills, with a proactive and collaborative mindset.

Preferred Qualifications

  • Experience working with media and entertainment pipelines or pre-release content workflows.

  • Proficiency with Golang, Python, or C++ 

  • Experience with modern AI/ML frameworks (e.g., TensorFlow, PyTorch, Hugging Face) and their integration into operational workflows.

  • Knowledge of container security tools and systems, such as Falco or Aqua Security.

  • Experience with emerging deployment systems like ArgoCD or Flux for GitOps workflows.

  • Familiarity with serverless computing paradigms and technologies such as AWS Lambda or Google Cloud Run/Functions.

  • Understanding of high-performance computing systems in cloud environments.

  • Experience with administering VMWare vSphere clusters.


 


The hiring range for this position in Nicasio, CA is $155,400 to $208,400 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...