Principal Data Architect-Remote, USA
Ambry Genetics
- USA
- $156,000-180,000 per year
- Permanent
- Full-time
- Develop and maintain the overall data architecture strategy, roadmap, and implementation plans to support the company's data-driven initiatives and business objectives.
- Design and implement scalable, secure, and high-performance data architectures, including data warehouses, data lakes, and data pipelines, leveraging both on-premises and cloud technologies.
- Establish data governance policies, standards, and best practices for data management, data quality, data security, and data privacy across the organization.
- Lead the development and implementation of real-time data streaming solutions, including event-driven architectures, data ingestion, transformation, and consumption using technologies like Apache Kafka, Apache Flink, and AWS Managed Streaming for Kafka (MSK).
- Oversee the creation and maintenance of Business Intelligence (BI) platforms, data visualization tools, and self-service analytics capabilities to enable data-driven decision-making across the organization.
- Collaborate with cross-functional teams, including software engineers, data scientists, and business stakeholders, to understand data requirements, define data models, and provide technical guidance and mentorship.
- Lead and manage a team of data engineers, database administrators, and data analysts, fostering their professional growth, promoting best practices, and ensuring adherence to organizational standards and processes.
- Stay up-to-date with emerging technologies, industry trends, and best practices in data architecture, data engineering, and data management, and drive the adoption of new technologies and approaches within the organization.
- Ensure compliance with relevant regulations, such as HIPAA, and industry standards for data management and privacy.
- Participate in the review and approval of data architecture designs, technical specifications, and project plans to ensure alignment with organizational goals and best practices.
- Other duties as assigned
- Bachelor's or master’s degree in computer science, Information Systems, or a related field, or equivalent professional experience.
- Minimum of 8-10 years of experience in data architecture, data engineering, and data management, with a strong background in designing and implementing enterprise-level data solutions.
- Proven experience in leading and managing teams of data professionals, including data engineers, database administrators, and data analysts.
- Extensive knowledge of data warehousing, data lake architectures, data modeling, and data integration techniques, including ETL/ELT processes and tools.
- Strong proficiency in data streaming technologies, such as Apache Kafka, Apache Flink, and AWS Managed Streaming for Kafka (MSK).
- Experience with cloud platforms and services, such as AWS, Azure, or GCP, for data management and analytics solutions.
- Familiarity with data governance principles, data quality management, and data security best practices.
- Excellent problem-solving, analytical, and communication skills, with the ability to effectively communicate complex technical concepts to both technical and non-technical audiences.
- Experience in an Agile or Scrum development environment is preferred.
- Knowledge of genomics, healthcare data, or bioinformatics is a plus.
- Experience with Change Data Capture (CDC) techniques and tools for data replication and synchronization.
- Hands-on experience with AWS services like AWS Glue, Amazon Elasticsearch Service, Amazon Redshift, and data queuing systems like Amazon MQ (RabbitMQ) or Amazon SQS.
- Experience with managing and securing Protected Health Information (PHI) and other sensitive data in compliance with regulations like HIPAA.
- Knowledge of distributed systems architectures, including multi-region and cross-account infrastructure designs.
- Experience with caching technologies like Redis for improving application performance and scalability.