Data Engineer

US-VA-McLean

External

Req #: 6624
Type: Full-Time
logo

Steampunk

Connect With Us:
Connect To Our Company
				Overview:

We are looking for seasoned Data Engineer to work with our team and our clients to develop enterprise grade data platforms, services, and pipelines. We are looking for more than just a "Data Engineer", but a technologist with excellent communication and customer service skills and a passion for data and problem solving.

Responsibilities:

* Designing and leading the implementation of greenfield data solution stacks in the cloud or on premises, using the latest data services, products, technology, and industry best practices
* Architecting and leading the migration of legacy data environments with performance and reliability
* Data Architecture contributions include assessing, understanding, and implementing data sources, data models and schemas, and data workflows
* Data Engineering contributions include assessing, understanding, designing, and implementing ETL jobs, data pipelines, and workflows
* BI and Data Visualization contributions include assessing, understanding, designing, and implementing reports, selecting BI tools, creating dynamic dashboards, and setting up data pipelines in support of dashboards and reports
* Addressing technical inquiries concerning customization, integration, enterprise architecture and general feature / functionality of data products
* Experience in crafting and implementing data lakehouse solutions in the cloud (Preferably AWS. Alternatively, Azure, GCP). This includes relational databases, data warehouses, data lakes, and distributed data systems.
* Key must have skill sets - broad understanding of data exploitation lifecycle and capabilities, technical leadership in the data field
* Support an Agile software development lifecycle
* You will contribute to the growth of our Data Exploitation Practice!

Qualifications:

* 
* Must have current FPAC Clearance.  
* 8+ years industry experience coding commercial software and a passion for solving complex problems.
* 8+ years experience in Data Engineering or AI/ML with experience in tools such as:
* Big data tools: Hadoop, Spark, Kafka, etc.
* Relational SQL and NoSQL databases, including Postgres and Cassandra
* Data pipeline and workflow management tools: Azkaban, Luigi, Airflow, etc.
* AWS cloud services: EC2, EMR, RDS, Redshift 
* Data streaming systems: Storm, Spark-Streaming, etc.
* Search tools: Solr, Lucene, Elasticsearch
* Object-oriented/object function scripting languages: Python, Java, C++, Scala, etc.
* Amazon S3, Athena, Redshift Spectrum, AWS Glue, AWS Glue Catalog, AWS Functions, and Amazon EC2 with SQL Server Developer.

* Advanced working SQL knowledge and experience working with relational databases, query authoring and optimization (SQL) as well as working familiarity with a variety of databases.
* Experience with message queuing, stream processing, and highly scalable 'big data' data stores.
* Experience manipulating, processing, and extracting value from large, disconnected datasets.
* Experience manipulating structured and unstructured data for analysis.
* Experience constructing complex queries to analyze results using databases or in a data processing development environment.
* Experience with data modeling tools and process.
* Experience architecting data systems (transactional and warehouses).
* Experience aggregating results and/or compiling information for reporting from multiple datasets.
* Experience working in an Agile environment.
* Experience supporting project teams of developers and data scientists who build web-based interfaces, dashboards, reports, and analytics/machine learning models.
* Experience with SAP.
			
Share this job: