Senior Data Engineer

US-OH-Columbus

Global Data Consultants

Req #: 3912
Type: Hourly
logo

GDC IT Solutions

Connect With Us:
Connect To Our Company
				Overview:

GDC IT Solutions is currently seeking a Senior Data Engineer to work in a hybrid capacity, with 2 days on-site per week at the Ohio Department of Medicaid in Columbus, Ohio. This position requires candidates to be located within commuting distance of Columbus and offers a collaborative work environment focused on managing and automating public health data using modern cloud and machine learning platforms.

Responsibilities:

* Collect and manage public health data from national sources including US Census and CDC via APIs.

* Ingest and curate public health data from State of Ohio agencies using the Innovation Ohio Platform (IOP).

* Develop and maintain git-based projects within the IOP Cloudera Machine Learning Environment (CML) to manage and version control data pipelines.

* Build and automate data collection and ingestion workflows using Python-based Jupyter Notebooks and CML services.

* Create and update Apache HIVE tables for curated public health data sets to support analytics and reporting.

Qualifications:

* Proven experience collecting and managing public health data from public sources such as US Census and CDC using APIs.

* Experience working with public health data from State of Ohio agencies through the Innovation Ohio Platform.

* Hands-on experience with git-based project management within the Cloudera Machine Learning Environment (CML).

* Proficient in Python programming, especially developing automation jobs with Jupyter Notebooks.

* Experience with Apache HIVE or similar data warehouse technologies.
			
Share this job: