Skip to main content

Data Scientist/Engineer

Full-Time

Remote

Apply Using Form Below

 

Overview:

Join Azra AI on its mission to improve healthcare through innovative applications of natural language processing (NLP). At Azra AI, we enable health systems to enhance clinical workflows by analyzing pathology and radiology reports in real-time, identifying the presence and type of cancer, and automating registry abstraction through text extraction. These reports are presented to clinicians in an intuitive workflow tool, allowing them to provide timely care to patients while focusing on what they do best—saving lives.

Your Adventure at Azra AI:

The Data Scientist/Engineer will combine data science expertise with data engineering skills to build scalable data pipelines and perform complex data analysis. This hybrid role requires strong data wrangling capabilities, along with the ability to create and optimize AI/ML models. The role involves working on healthcare data, ensuring clean, structured data is available for AI model training and real-time analytics.

Key Responsibilities:

  • Build and maintain robust data pipelines for processing large healthcare datasets.
  • Perform data analysis and feature engineering to support AI/ML model development.
  • Design and implement AI models to solve specific healthcare problems (e.g., oncology, radiology).
  • Collaborate with MLOps and engineering teams to deploy models into production.
  • Ensure data integrity, security, and scalability in all engineering processes.
  • Work with structured and unstructured data from diverse sources (e.g., EMRs, clinical notes, radiology reports).
  • Monitor and improve model performance based on real-world data.
  • Use data visualization techniques to present insights and recommendations to stakeholders.

Qualifications:

  • Bachelor’s or Master’s degree in Data Science, Engineering, or a related field.
  • 5+ years of experience in data science and data engineering.
  • Expertise in Python, SQL, and cloud platforms (preferably GCP).
  • Strong understanding of data processing frameworks (Apache Beam, Spark) and database systems (BigQuery, Postgres).
  • Experience in building and deploying machine learning models.
  • Familiarity with healthcare data formats and data governance is a plus.
  • Strong problem-solving skills and attention to detail.