Education:
• Candidates with a degree in computer science or related field; or a degree in the chemistry disciplines with strong programming capabilities.
Experience
4-6 years of relevant experience
Must have/required skill
Python 3.9+ software development, including:
• Cloud Services – AWS (Lambda Functions, S3, Cloud Formation Templates, RDS, ECR)
• Development of ETL Processes / Data Workflows / Data Pipelines / Data Wrangling / Data Ingestion.
- Packages: Boto3, Pandas, pyodbc, openpyxl
- Virtual environments: conda
- IDEs: Visual Studio Code or PyCharm
- Experience with software design, development, and testing (unit and system testing).
- Proficiency with version control (Git, GitHub) and CI/CD workflows (GitHub Actions).
- Strong database skills (relational databases, SQL, data modeling and design).
- Familiarity with multiple file formats (XLSX, YAML, JSON, CSV, TSV).
- Excellent verbal and written communication skills.
- Ability to work independently and collaboratively in team settings.
- Demonstrated drive for continuous improvement and innovation in data workflows.
Preferred:
- Additional AWS cloud services experience: SQS, DLQ, SNS, EventBridge, API Gateway.
- Python packages (Cerberus, PyYAML, logging), linters, type hints, and regular expressions.
- Experience with data pipeline tools such as Dataiku or Trifacta.
- Previous IT or data engineering experience in pharmaceutical research.
- Analytical or genomics experience related to scientific data generation and interpretation.
Education:
Bachelor’s degree in Computer Science or related field; OR
Bachelor’s degree in Chemistry (or related discipline) with strong programming capabilities.