Data Engineer

logo

Data Engineer

Ryde

icon Fairfax, VA, US, 22030

icon28 June 2024

Apply Now

Position: Data Engineer

Location: Fairfax, VA

Clearance: TS/SCI

Education: Bachelor’s Degree Applied Data Engineering or Science, engineering, and Computer Science fields of study, such as statistics, mathematics, data analytics, data science, computer science or other technology related fields.

Outcomes:

The successful candidate is expected to accomplish the following outcomes in the first year on the position:

  • Develop and implement data engineering eco-systems and architectures that enable data Extraction, Transformation and Loading (ETL) operations for predictive and prescriptive data modeling.
  • Ability to perform root cause analysis on external and internal processes and data to identify opportunities for improvement and answer questions.
  • Build networking connections and opportunities for data acquisition.

Responsibilities:

The Combatant Command Intelligence Enterprise Management Support Office (CCI EMSO) requires a Data Engineer to implement methods to improve data reliability and quality. The Data Engineer will combine raw data from different data sources to create consistent and machine-readable formats that will be passed to the Data Science Team for Data Science Analysis and Studies, with a specific focus on creating and engineering data pipelines that can transfer the necessary Data Sets via Application Programming Interfaces (APIs), Resilient Distributed Datasets (RDD) and direct interface with the data source to the CCI EMSO data repository. The Data Engineer shall develop and implement data engineering eco-systems and architectures that enable data Extraction, Transformation and Loading (ETL) operations for predictive and prescriptive data modeling. Data Engineering builds will include, but not be limited to, establishment of a Data Lake and Eco-System which can move data from the DevSecOps to Data Science Production and Analysis Environment. The primary responsibilities of the CCI EMSO Data Engineer will be analyzing raw data, developing, and maintaining a repository of CCI EMSO specific data sets, and improving data quality and efficiency. CCI EMSO provides OSD senior leaders with data driven analyses using the scientific method to enable senior leadership decision making to deliver modernized capabilities that ensure the Combatant Commands are equipped with modern systems and the Joint Force is aligned with future Joint Warfighting Concepts.

Specific responsibilities include but are not limited to:

  • Develop, construct, and deploy data lake and ecosystems infrastructure that will include a DevSecOps and production environment.
  • Analyze and organize raw data for Data Science analysis.
  • Build Data Systems and data transfer pipelines to ensure the CCI EMSO Data Science Team has the requisite data to answer Combatant Command, ISREC and ISPR Data Science Requests for Information (RFI).
  • Evaluate CCI EMSO mission and vision needs and objectives and codify a data governance plan that supports them.
  • Prepare data for prescriptive and predictive modeling.
  • Build algorithms and prototypes to meet the objectives and requirements set by the Combatant Commands, ISREC and ISPR.
  • Combine raw data from different sources and normalize into import ready data sets to pass to the CCI EMSO Data Science Team for data science analysis.
  • Draft, develop and formalize a data governance plan, in conjunction with the CCI EMSO data scientists, that enhances data quality and reliability.
  • Identify networking connections and opportunities for data acquisition.
  • Work with stakeholders including data, design, product, and executive teams and assisting them with data-related technical issues.
  • Identifying, designing, and implementing internal process improvements including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes.

The candidate must have the following qualifications:

  • A minimum of 10 years of experience (including demonstrated expert ability to use statistical software programs (e.g., R, Python, Weka, Apache Spark, and SQL).
  • Ability to build and optimize data sets, ‘big data’ data pipelines and architectures as required by the CCI EMSO mission and vision and ISREC and ISPR Strategic Guidance.
  • Ability to perform root cause analysis on external and internal processes and data to identify opportunities for improvement and answer questions.
  • Excellent analytic skills associated with working on unstructured datasets.
  • Experience in supporting technical, managerial, or operational fields and mature judgement required to interface with external stakeholders and senior government personnel.
  • Strong organizational, oral, and written communication skills are required.
  • Ability to build processes that support data transformation, workload management, data structures, dependency, and metadata.
  • Strong interpersonal skills and the ability to build consensus, work effectively and independently, and demonstrate a consultancy mindset towards customer engagement.
  • Experience navigating experimental environments and making recommendations on risk management while maximizing innovation.
  • Thorough knowledge of research designs, particularly systems thinking and/or design thinking.
    • Have a working knowledge and experience in working in an Organizational Program Maturity business model, with the ability to move between Agile and standard Defense Acquisition University (DAU) and Project Management Institute (PMI) Program Management processes and methodologies.
  • Ability to effectively communicate complex, multi-disciplinary ideas, and insights.
  • Ability to translate complex, technical findings into an easily understood narrative (i.e., tell story with data).
  • Analytical and critical thinking skills, including superior ability to think strategically.
  • Demonstrated experience using analytic methods and methodological tools in mathematics or statistics.

The following qualifications are desired:

  • Data Science Council of America (DASCA) Associate Big Data Engineer (ABDE).
  • Databricks Certified Data Engineer Professional
  • PMI Agile Certified Practitioner (PMI-ACP)
  • PMI Project Management Professional (PMP) (or DAU Equivalent)

Travel: Some travel may be required