ETL AWS Data engineer

Remote
Full Time
Mid Level

 

About BAO Systems

BAO Systems is an industry leader in digital data solutions for health and development. We empower our partners to implement scalable and sustainable solutions that uncover data-driven insights to improve livelihoods, strengthen health systems, and achieve equitable human development.

 

Our team comprises passionate public health and development practitioners, information system experts, software engineers, system engineers, monitoring and evaluation advisors, and data scientists. We excel in providing a broad spectrum of services and products. For more information, please visit www.baosystems.com

 

Purpose of the Role

We are looking for a motivated Data Engineer to join our team. The ideal candidate is a domain expert who applies broad technical skills with deep industry knowledge and business acumen. Designs and implements cloud-native technical solutions to deliver data and analytics platforms to end users.

Core Tasks and Responsibilities

  • Build, test, and maintain ELT/ETL data pipelines
  • Develop custom scripts to automate repetitive tasks
  • Develop/assess solutions for automated data validation
  • Understand and support cloud storage and cloud computing tools
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.
  • Build the infrastructure required for extraction, transformation, and loading of data from a wide variety of data sources with emphasis on increased data automation (automated cataloging, tagging, statistics, lineage...)
  • Design and implement flexible, open-source ETL/ELT pattern/solution to support customers looking for alternatives to AWS and Azure
  • Build analytics tools that utilize the data pipeline to provide actionable insights into customer behaviors, operational efficiency and other key business performance metrics
  • Develop solutions which favor
    • Reduced data latency
    • Reduced data movement
    • Traceability
    • Enhanced security, PII protection
    • Reusability
    • Reproducibility
  • Assess and recommend use of processes, technologies, frameworks (e.g., differential security, NLP, graph db, etc.)

 

Required Qualifications

  • Bachelor's degree and 5+ years experience in M&E, data analysis, data engineering, or related field
  • Proven skills programming in SQL, Python, Java
  • Understanding of data set management, JSON, schema definition languages, and serialization libraries and patterns
  • Understanding of enterprise schema registries
  • Experience using AWS technologies, including S3 and Redshift
  • High-level experience in methodologies and processes for managing large-scale databases
  • Demonstrated experience in handling large data sets and relational databases
  • Ability to work with diverse stakeholders to assess potential risks
  • Ability to translate business requirements into technical specifications

 

Desired Qualifications

  • Programming in R, Scala, NoSQL
  • Experience using Azure and off-the-shelf technologies

 

Classification

Full-time

 

This position is contingent on contract award.

 

BAO Systems provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws.

Share

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file