Senior AI Platform and Reliability Engineer

US-CO-Littleton

Attract-careers1

Req #: 97484
Type: Fulltime-Regular
logo

EchoStar

Connect With Us:
Connect To Our Company
				Overview:

Our Technology teams challenge the status quo and reimagine capabilities across industries. Whether through research and development, technology innovation or solution engineering, our team members play a vital role in connecting consumers with the products and platforms of tomorrow.

Responsibilities:

Candidates must be willing to participate in at least one in-person interview, which may include a live whiteboarding or technical assessment session.

This role addresses the critical challenge of transitioning from reactive, threshold-based monitoring to a proactive, self-healing infrastructure. The primary focus is bridging the gap between experimental machine learning and production-grade software by architecting resilient systems that orchestrate Large Language Models and agentic workflows. By leveraging predictive intelligence and deep observability, this position ensures the stability and efficiency of high-traffic order provisioning and device activation journeys in complex cloud environments.
What Success Looks Like (Objectives)
* Architect and deploy self-healing infrastructure that moves operations toward predictive intelligence and autonomous remediation

* Design high-scale AI orchestration workflows and RAG pipelines to solve real-world business problems while optimizing for token costs and performance

* Automate the resolution of recurring Tier-1 and Tier-2 incidents using Runbooks-as-Code to reduce manual toil and improve system reliability

* Lead the fine-tuning and evaluation of open-source models to ensure high-quality data retrieval and minimize hallucination rates in production

* Establish AI engineering best practices through comprehensive code reviews and mentorship of junior engineers transitioning to AI-native development

* Implement full-stack observability across the customer order journey to diagnose and resolve complex provisioning issues before they impact the user experience

Qualifications:
Core Skills and Competencies (What you'll bring)
* Expert proficiency in Python and Java to develop production-grade backend systems and complex AI microservices

* Deep technical expertise in Generative AI architectures, including Transformer models, prompt engineering, and the integration of Large Language Models

* Critical experience architecting Retrieval-Augmented Generation (RAG) pipelines and managing unstructured data within vector databases like Milvus

* Advanced knowledge of observability platforms such as Dynatrace and AWS CloudWatch to drive data-driven decision-making and system performance

* Demonstrated ability to design and implement automation tools using Python, Ansible, and Bash to achieve autonomous infrastructure remediation

* AI literacy and the capacity to innovate by applying emerging AI-Ops tools and predictive intelligence to traditional site reliability challenges

Additional Qualifications
* Contributions to open-source AI projects or a documented portfolio of Agentic AI applications

* Professional certifications in Cloud AI (e.g., AWS Certified Machine Learning - Specialty)

* Experience building scalable applications leveraging IBM Watsonx.ai and/or AWS Bedrock, ensuring our AI solutions are governed, secure, and performant
Minimum Requirements
* Minimum Education: Bachelor's Degree in Computer Science, Information Technology, or a relevant field (or 5+ years of equivalent software engineering experience)

* Minimum Experience: 5+ years of experience in backend or full-stack software development, with at least 2 years focused on AI-powered development

* Required Technical Skills:

* Critical experience in Python-based API orchestration and custom tool development

* Hands-on expertise with AWS cloud services, Serverless architectures, and Kubernetes

* Proven track record in AI-driven software development with 2+ years of specialized experience in architecting and deploying autonomous AI agents and intelligent bot frameworks
			
Share this job: