Responsibilities:-
- Create and maintain optimal data pipeline architecture for enterprise big data and analytics in a cloud based big data environment
- Design and assemble large, complex data sets that meet functional / non-functional business requirements.
- Implement internal process improvements in data analytics: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
- Build the infrastructure and workflows required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and 'big data' technologies.
- Build analytics applications that utilize the data pipeline to provide actionable insights into key business performance metrics.
- Create tools and processes to keep sensitive data, including PHI and PII data separated and secure across clients.