We are seeking a highly skilled Senior AWS SRE/DevOps Operations Engineer to join our globally distributed team. The ideal candidate will have expertise in AWS, Linux, Kafka, and Big Data technologies, as well as experience with automation, cloud monitoring, and security. The role involves provisioning, monitoring, and operating cloud services, analyzing and solving operational issues, and ensuring the integrity and security of servers and systems.
Key Highlights
Technical Skills Required
Benefits & Perks
Job Description
Role: Software Engineer Senior AWS SRE/DevOps Operations Engineer
Location – REMOTE [East Coast candidates]
Duration - 3+ years
Interview – Video(s)
Rate – Open all inclusive
*** CANDIDATE MUST HAVE RECENT PUBLIC SECTOR PROJECTS EXP ***
MUST HAVES:
- Expertise with GIT
- Expertise with Concourse including setup, management and troubleshooting of new pipelines
- Expertise with Linux specifically SUSE and Ubuntu
- Expertise with Kafka, Zookeeper and Big Data technologies
- Expert in development of automation for testing, deployment, scalability, and management cloud services
- Expertise with building, implementing, and/or supporting cloud monitoring tools
- Expert knowledge of Cloud Computing and Databases
- Expert understanding of web services, networking, virtualization, and internet protocols
- Expertise with security fundamentals as they pertain to SaaS Multitenant Application systems
- Experience with AWS Route 53, EC2, S3, CloudWatch, DynamoDB, RDS, IAM, ACM, KMS, VPC
- Experience with Cloud Foundry based environments
- Experience with Jenkins and/or Chef automation and Terraform
- Expert with Kubernetes, troubleshooting, operations, management, and configuration of complex Kubernetes services
- Exposure to and understanding of troubleshooting IP networks and application stacks
- Experience with observability tools such as Prometheus and Grafana.
- BS/BA degree in Computer Science, Management Information Systems, or related IT discipline preferred
Job Requirements:
- Provision, monitor, and operate cloud services in a globally distributed team
- Analyze and solve operational issues and respond to incidents
- Exposure to working with appropriate complex systems and database administration as well as managing landscape maintenance, upgrades, and hotfixes
- Maintaining the integrity and security of servers and systems
- Exposure to developing and operating monitoring policies and standards
- Ensure proper resource allocation related to the use of computing resources across cloud environments
- Conduct incident root cause analysis and implement continuous improvements
- Partner with product development team to design and enhance service reliability
- Exposure in developing and implementing testing strategies and documenting results
- Work in a diverse environment and cross-train with other global team members
- Willingness to Support On-call rotation schedule
- Flexible schedule which may include weekend or after-hours work
Similar Jobs
Explore other opportunities that match your interests
Bright Vision Technologies
executiveplacements.com