Spruce InfoTech is a leading information technology firm that provides varied services to help clients change manage and transform their businesses by means of high quality, innovative and cost effective solutions. We provide services to different companies from small scale level to even fortune 500 organizations and guide them in the best possible way to maximize IT investment and also reduce the cost of acquiring new technologies.
Role – Site Reliability Lead Engineer
100% remote
1 year in length
Status: USC/GC
Skills:
1. Kubernetes container management
2. Config management tools (Ansible puppet etc preferred Ansible)
3. Medium VMware knowledge
4. Medium-high vRA and vRO knowledge including workflow and blue print configurations.
5. JavaScript (angularJS specifically) (TS should be fine if they can backward work angular JS)
6. Programming experience (Python or Java or grails) with Ci/cd experience.
NOTE - If candidate does not have VRA or VRO, it is ok, as long as the candidate can Manage and work with an API. Understanding what he/she needs to get done.
Also if candidate does not have Angular JS it is ok.
Site Reliability Engineering combines software and systems engineering to manage some of the most complex environments of our customers. Client’s large-scale, fault-tolerant environments are deployed for our customers run some of the most complex applications. You will be working as part of our newly formed SRE managed services group as an SRE engineer.
SRE looks for creative ways to automate and secure our environments. SRE is a mindset and a set of engineering approaches to running better production systems. Much of our managed services focuses on optimizing existing environments for our customers, building highly scalable infrastructure and eliminating work through automation. You, as a software developer are expected to use a variety of tools including Kubernetes, Jenkins, Prometheus, Grafana, and more to orchestrate these complex systems and ensure operational stability and increase reliability. You will be using your experience with platforms such as C, C++, Java, Python, NodeJS, Javascript/Typescript, GO etc to build custom tooling, modifying application code to improve operational stability and ease of functionality. You will also be working in a dynamic multi-cloud/hybrid-cloud environment including AWS, Azure, GCP and VMware. The ideal SRE wants to limit time spent on operational work, proactively identify potential ways our systems can fail, and enjoys a blameless post-mortem when incidents occur.
Reliability is at the heart of our promise to our customers, so the SRE role is at the heart of our technical team. We're always on call to keep our environments up and running, ensuring our investors reliably earn staking rewards. You will be responsible for designing, implementing and maintaining these systems alongside other members of the support organization.
Responsibilities:
-Learn new technologies -Implement software development practices and maintain operational code implementing SSDLC concepts.
Minimum Qualifications:
Preferred Qualifications:
All your information will be kept confidential according to EEO guidelines.