Senior Engineer - Software

US-CO-Littleton

Attract-careers1

Req #: 96268
Type: Fulltime-Regular
logo

EchoStar

Connect With Us:
Connect To Our Company
				Overview:

Our Technology teams challenge the status quo and reimagine capabilities across industries. Whether through research and development, technology innovation or solution engineering, our team members play a vital role in connecting consumers with the products and platforms of tomorrow.

Responsibilities:

Candidates must be willing to participate in at least one in-person interview, which may include a live whiteboarding or technical assessment session.

We are looking for a Sr. AIOps Engineer to architect and champion our "Self-Healing" infrastructure. This role is pivotal in moving our operations beyond traditional threshold-based monitoring and into the era of Observability and Predictive Intelligence. Your core mission will be to eliminate operational noise, automate Root-Cause Analysis (RCA), and ensure comprehensive 360-degree health monitoring of our end-to-end customer order journeys.

This role also involves providing enterprise-level assistance to our customers and monitoring applications deployed in AWS cloud environments. You will diagnose and troubleshoot issues related to order provisioning and device & SIM activations, utilizing email and chat applications to provide quick answers and resolution for clients' IT issues.

Key Responsibilities:

* Intelligent Observability: Design and maintain full-stack observability using Dynatrace and AWS CloudWatch, focusing on the "Boost Mobile" order journey
* Autonomous Remediation: Develop and deploy "Runbooks-as-Code" using Python and Ansible to automate the resolution of recurring Tier-1 and Tier-2 incidents
* Predictive Analysis: Leverage AI-driven anomaly detection to identify "silent failures" and performance bottlenecks before they impact the end-users
* Data-Driven Reporting: Transform raw operational logs into business-level insights for IT and Business Ops Executive leadership
* Monitoring: Take full ownership of customer-reported issues, monitoring alerts and operational dashboards to proactively identify availability, latency, performance, and capacity concerns through to resolution
* Automation: Develop and utilize automation scripts to streamline error handling, manage order fallouts, and improve operational efficiency
* Operational Support: Maintain a professional and positive support experience by communicating clearly with customers and vendors through phone, email, or chat This includes asking targeted questions and providing accurate, timely updates

Qualifications:

Education and Experience:

* Education: Bachelor's Degree in Computer Science, Information Technology or relevant field or 4+ years software engineering experience

* Experience: 4+ years in SRE or IT Operations with a proven track in AIOps. 2+ years experience in AI powered development using tools like Kiro

Skills and Qualifications:

* Strong ability to diagnose, troubleshoot, and resolve technical issues across computer systems and other technology products
* Excellent problem-solving skills with the ability to provide clear, step-by-step technical guidance both verbally and in writing
* Solid understanding of cloud infrastructure, particularly AWS, and container orchestration using Kubernetes
* Demonstrated fearlessness and curiosity to learn new technologies and tackle complex technical challenges
* Background in software development, with knowledge of modern programming languages, design principles, and coding best practices
* Proficiency in reading, writing, and debugging code across multiple programming languages to support troubleshooting and automation efforts
* Observability Stack: Proficiency in Dynatrace(Grail, Davis AI), Datadog, or similar.
* Cloud Fluency: Hands-on experience with modern AWS/GCP services (Serverless, Kubernetes, EC2)
* Automation/Scripting: High proficiency in Python or Go for API orchestration and custom tool development. Hands-on experience with Bash scripting
* AI/ML Basics: Understanding of vector databases, LLM integration (RAG), and basic data science concepts for tuning anomaly detection models
* Version Control: Strong "GitOps" mindset using GitLab for all infrastructure and alerting configurations

Visa sponsorship not available for this role
			
Share this job: