Shape the Future of Healthcare AI with Azra
Are you passionate about building AI that saves lives? Do you want to push the boundaries of machine learning by creating systems that act, learn, and adapt in the real world?
Azra AI is looking for a Principal Data Scientist to lead the next generation of healthcare AI—expanding beyond NLP into imaging, time-series signals, and eventually agentic intelligence. We’ve already transformed how cancer is identified by analyzing millions of pathology and radiology reports. Now, we’re deepening our impact in oncology and expanding into cardiology, neurology, and medical imaging.
If you’re experienced in fine-tuning foundation models, deploying production-grade AI in complex clinical environments, and excited by the potential of agentic AI systems, this is your opportunity to drive innovation with real-world purpose.
What You’ll Do
Lead and Architect AI Models
- Own the end-to-end development of deep learning and machine learning models for healthcare use cases.
- Design and fine-tune transformer-based NLP models (e.g., MEGA, BERT, BioBERT) for NER, information extraction, classification, summarization, and generative tasks.
- Build state-of-the-art models for medical image analysis (X-ray, CT, MRI) using CNNs, ViTs, and 3D architectures (e.g., YOLO, EfficientNet, VGG, ViT).
- Incorporate semi-supervised, weakly-supervised, and transfer learning techniques to improve performance with limited labeled data.
- Implement advanced strategies such as RAG, GraphRAGs, and long-context modeling for clinical reasoning.
Build Towards Agentic AI
- Collaborate with leadership to define the roadmap for agentic AI systems—autonomous agents that make decisions, take action, and learn continuously from real-world data.
- Innovate in prompt engineering, autonomous task orchestration, and clinical decision frameworks.
Expand into Multi-Modal Healthcare Data
- Work with structured and unstructured data from multiple domains:
- Pathology and radiology reports
- Cardiology: both clinical narratives and potentially ECG signals (requiring signal processing knowledge)
- Neurology: text-based reports as well as EEG signals and potentially fMRI scans
- Support multimodal modeling combining text, image, and signal data.
Drive Production-Grade MLOps
- Own robust CI/CD pipelines for ML/DL model training, validation, deployment, and monitoring.
- Ensure scalability, observability, and compliance across hospital deployments.
- Build model monitoring, quality control, and orchestration workflows to manage complex inference pipelines.
Work Cross-Functionally
- Collaborate with engineering, product, and clinical teams to turn models into intuitive, trustworthy features.
- Contribute to a high-performing culture driven by curiosity, speed, rigor, and impact.
What We’re Looking For
Required Experience
- 8+ years in machine learning, deep learning, or data science, with a focus on healthcare or clinical applications.
- Strong experience with transformer-based NLP models and medical image models.
- Hands-on experience with semi-supervised, weakly-supervised, and transfer learning techniques.
- Proficient in Python, especially with PyTorch, Hugging Face, spaCy, scikit-learn, and pandas.
- Production experience in GCP (Vertex AI, BigQuery, Cloud Storage) or similar cloud platforms.
- Background in MLOps: model deployment, lifecycle management, and monitoring in production.
- Experience working with clinical data (radiology, pathology, EHR/EMR), including HIPAA compliance and data privacy.
Bonus Points
- Experience with agentic AI systems or autonomous task agents.
- Experience preparing CI/CD pipelines for ML/DL models.
- Familiarity with Streamlit or lightweight UIs for internal model monitoring and visualization.
- Strong foundation in medical imaging, including 2D, 2.5D, and 3D processing techniques.
- Signal processing expertise for ECG, EEG, or other biomedical time-series data.
- Familiarity with Docker, GitLab, and relational databases (PostgreSQL, MySQL).
Qualification:
Bachelor’s or Master’s degree in computer science or related field.
PhD in data science is a plus.
Why Join Azra AI?
You won’t just be training models—you’ll be advancing the frontiers of healthcare. You’ll join a mission-driven team where your ideas matter, your models are used in real hospitals, and your work can directly save lives. Join us. Build AI that matters.