Ryde Technologies is looking for a Sr. Data Architect to support our government customer located in Albuquerque, NM.
The role of the Sr Data Architect is to collaborate with data scientists, researchers, users, model and sim engineers, user experience designers, software engineers, digitizers, cataloguers, and system administrators to design a system to collect, manage, and convert raw data from the DTRIAC archive into usable information for U.S. deterrent missions.
The Sr. Data Architect will answer:
- What variables should be stored?
- What data quality issues might arise?
- What database options are best for near term and long term?
- What is the design of the data management system?
- What system design will meet customer requirements?
Our customer maintains a repository of over 20,000 films, 2,000,000 photographs, and 60,000,000 pages of documents dating back to The Manhattan Project. They're working to make the repository digitally accessible to DTRA deterrent mission researchers in a way that provides knowledge and meaning using Machine Learning (ML), Artificial Intelligence (AI), and data science to return relevant search results to user queries
DESCRIPTION OF RESPONSIBILITIES:
- Design tools and methodologies to process the digital collection in a production mode.
- Develop and implement methods for, and configurations of, the Data Lake to support navigation, search, insertion, or extraction of information or files by the government or other performers without requiring proprietary software, tools, or data other than widely available commercial-off-the-shelf (COTS) tools, and software that can be authorized for use on government IT systems.
- Develop, maintain, and improve capabilities, such as scripting, to efficiently perform maintenance, synchronization, and production processing of data in the Data Lake on Windows- and Linux-based IT systems, including HPCMP clusters.
- Implement, configure, perform functional testing, and operate the data and applications of the Advanced Search and Discovery (ASD) environment as a hosted capability on government IT systems.
- Leverage the collection, capabilities, and team to perform targeted analyses and studies and to provide dedicated support to missions and end users.
- Create documentation or training materials for Project Products.
- Support integration or hosting of capabilities or products on government IT systems.
- Hold and participate in Gate Reviews.
- Other duties as assigned.
REQUIRED DEGREE/EDUCATION/CERTIFICATION:
Current Security+ Certification or equivalent required.
REQUIRED SKILLS AND EXPERIENCE:
- 5+ years relevant experience.
- Experience with databases, especially NoSQL and ElasticSearch
- Experience with DevSecOps
- Experience building and working with data pipelines and large data sets.
- Experience with schema design and data modeling.
- Deep understanding of algorithms and efficient data structures.
DESIRED SKILLS AND EXPERIENCE:
- Experience using revision control software such as git
- Experience using CI/CD tools such as GitHub
- Experience with containerization using Docker, Podman, Apptainer/Singularity
- Experience with ML/DL particularly NLP
- Experience with OCR
- Experience with computer vision/image processing
- Proficiency in Python
REQUIRED CITIZENSHIP AND CLEARANCE:
- Must be a U.S. Citizen
- Active Secret or DOE Q Clearance required.