At Oracle’s Health Data Intelligence and Life Sciences organization, we are focused on transforming healthcare and improving the health of people globally. Whether it is helping bring new therapies to market faster or enhancing care delivery, we are transforming the experience of providers, payers, and life sciences organizations. The data platform team (Oracle Health Data Services - OHDS) is foundational to software products being built and serves the critical role of transforming, enriching, persisting and serving large-scale health and life sciences data for AI agentic and application use cases.
The OHDS team is looking for a Principal Software Developer to drive the data platform forward. In this role you will collaborate with software architects on system design, serve as the technical leader responsible for component design, implement and deliver projects involving scalable data processing, storage and retrieval for agentic and traditional workloads. Partnering with product management, applied science and multiple development teams, you will be responsible for building software systems and end-to-end delivery to improve customer outcomes. You will drive operational excellence and best practices that enable us to deliver highly scalable and reliable cloud hosted software services.
Internal Responsibilities
Responsibilities
Implement key components in a multi-tenant data platform serving AI, agentic, and traditional analytics workloads, including ingestion, transformation, indexing, storage, and serving layers.
Perform component design, implement and deliver features that increase the adoption of the data platform by AI-first and agent-based architectures.
Use strong technical knowledge to help resolve complex issues and dissect issues inherent in the design or implementation of technologies.
Ensure platform architectures and operations meet standards for scale, reliability, resilience, security, compliance, and cost efficiency in regulated healthcare environments.
Qualifications
- 7+ years relevant experience, BS or MS degree in CS or equivalent experience relevant to functional area.
- Highly proficient in Java, Python, or similar languages.
- Strong understanding of cloud concepts and cloud native services on OCI, AWS, Azure or GCP, and ability to apply this knowledge toward development and running of cloud hosted software solutions.
- Strong knowledge of distributed storage systems including data warehouses and lakehouse/table formats.
- Hands-on experience with streaming + CDC and large-scale batch processing (Spark, Flink, etc.)
Preferred Qualifications
Experienced with:
Multi-modal persistence patterns such as relational, document, vector, and graph storage.
Search (lexical, vector, hybrid) and retrieval (MCP) for AI use cases.
Applied LLM/NLP for extraction, entity resolution/linking, enrichment.
Data governance and compliance: classification/tagging, lineage, retention, audit logs, access controls, and secure handling of PHI/PII.
External Responsibilities
Responsibilities
Implement key components in a multi-tenant data platform serving AI, agentic, and traditional analytics workloads, including ingestion, transformation, indexing, storage, and serving layers.
Perform component design, implement and deliver features that increase the adoption of the data platform by AI-first and agent-based architectures.
Use strong technical knowledge to help resolve complex issues and dissect issues inherent in the design or implementation of technologies.
Ensure platform architectures and operations meet standards for scale, reliability, resilience, security, compliance, and cost efficiency in regulated healthcare environments.
Qualifications
- 7+ years relevant experience, BS or MS degree in CS or equivalent experience relevant to functional area.
- Highly proficient in Java, Python, or similar languages.
- Strong understanding of cloud concepts and cloud native services on OCI, AWS, Azure or GCP, and ability to apply this knowledge toward development and running of cloud hosted software solutions.
- Strong knowledge of distributed storage systems including data warehouses and lakehouse/table formats.
- Hands-on experience with streaming + CDC and large-scale batch processing (Spark, Flink, etc.)
Preferred Qualifications
Experienced with:
Multi-modal persistence patterns such as relational, document, vector, and graph storage.
Search (lexical, vector, hybrid) and retrieval (MCP) for AI use cases.
Applied LLM/NLP for extraction, entity resolution/linking, enrichment.
Data governance and compliance: classification/tagging, lineage, retention, audit logs, access controls, and secure handling of PHI/PII.