Senior DevOps Engineer

at Sysdig (view profile)
Location San Francisco, CA
Date Posted August 12, 2018
Category United States - H1B Visa Jobs
Job Type python


Here at Sysdig, we’re what you might call container-obsessed. It starts with our unique technology, which listens to the heart of the operating system to surface the deepest data with the least overhead. From there, we’ve created the first-ever Container Intelligence Platform, which proactively uncovers issues before they manifest, and allows for deep digging to solve the most complex problems.

We’re looking for a Senior DevOps Engineer to help us lead the container revolution. You’ll build solutions to enhance availability, performance, and stability of the Sysdig SaaS offering. Together with the engineering team you will support the On-Prem version of Sysdig through - data migration, implementation, troubleshooting, and monitoring.

This role can be based in either San Francisco, CA or Belgrade, Serbia.

Role Responsibilities:

  • Build and manage various components of the internal and production environments with a focus on configuration management, continuous integration and platform automation
  • Implement disaster recovery and reliability improvement initiatives
  • Build and manage software delivery, systems integration, and developer support tools
  • Take Kubernetes and Docker to production for all our new microservices
  • Manage Kubernetes and Cassandra clusters
  • Take ownership of features that range from services provisioning on SaaS or On-Prem
  • Enhance developer CI/CD pipeline using Jenkins and Github
  • Automate our infrastructure and EC2 deployments (CloudFormation) as well as our build automation systems (Jenkins)
  • Conduct performance tuning, load testing, and optimization of information/data processing, maintenance and support of the production environment  

Required Qualifications:

  • Proficiency with configuration management tools like CloudFormation or Terraform (or at least Puppet, Chef, or SaltStack)
  • Solid experience in monitoring cloud services using tools like Sysdig, Datadog, Prometheus, Grafana, Graphite, Nagios, or Zabbix
  • Experience in managing AWS resources including EC2, RDS, Auto Scaling groups, ALB/NLB, IAM
  • Experience in diagnosing and troubleshooting customer facing production service outages
  • Aptitude for troubleshooting complex problems in high-throughput web applications and network services
  • Command of at least one of the following : Java, Python, Bash, and Golang
  • Solid understanding of Linux systems and networking
  • Working knowledge of Git

Desired Qualifications:

  • Worked with containers such as Docker or Rocket
  • Deployed Kubernetes or OpenStack clusters
  • Managed any of these clusters - Cassandra, HBase, HDFS, Elasticsearch
  • Set up Kafka or Redis clusters
  • Used log aggregation services like Elasticsearch or Splunk
  • Familiar with CI/CD pipelines using Jenkins, Bamboo or TeamCity
  • Knowledge of ITIL terminology for incident and problem management
  • Background in PCI/HIPAA compliant infrastructure in the cloud

Why work at Sysdig?

  • We’re a well funded startup that already has a large enterprise customer base.
  • We have a pragmatic, approachable engineering culture, from the CEO down.
  • We have an organizational focus on delivering value to customers.
  • Our open source tools ( are widely used and loved by technologists & developers.
  • We have fun team and company events, beer outings, and lots of espresso (if you’re in to that).

Visa Assistance

Open to assisting the right candidate with the following Visa(s) / Work Permit(s)

1) United States - H1B Visa Jobs

Drop files here browse files ...