View Our Website View All Jobs

Natural Language Processing Data Scientist

Passionate about making a difference in the world of cancer genomics?

With the advent of genomic sequencing, we can finally measure and process our genetic makeup. We now have more data than ever before but providers don't have the infrastructure or expertise to make sense of said data, let alone use their extensive patient charting to complement the data achieved through genome sequencing. Here at Tempus, we believe that the wholistic approach for the detection and treatment of cancer lies in the deep understanding of molecular activity coupled with the ability to use the latest NLP and predictive modeling techniques to extract information and insights from the patient’s chart.

Our Natural Language Processing Data Scientist will use state of the art techniques to process and analyze vast amounts of clinical data in a way it has never been done before. They’ll also help create a highly scalable infrastructure to house billions of records from the ground up. We’re looking for someone who will collaborate with product, research, and business development teams to develop the most advanced data fusion platform in cancer care.

What you'll do:

  • Design and develop clinical NLP methods that ingest large unstructured clinical data sets, separate signal from noise, and provide personalized insights at the patient level that directly improve our analytics platform
  • Develop innovative methods for processing and storing data
  • Interrogate analytical results to resolve algorithmic success, robustness and validity


  • PhD or equivalent work experience in statistics, computer science, bioinformatics or related field
  • Experience with a variety of NLP methods for information extraction, topic modeling, parsing, and relationship extraction
  • Familiarity with developing, deploying, and maintaining production NLP models with scalability in mind
  • Experience with knowledge databases and language ontologies
  • Quantitative training in probability, statistics and machine learning
    • Classical statistical tools, machine learning algorithms, ensemble methods
  • Analytical development and programming skills
    • Python, R, Javascript, or Lua
    • Reproducible research methods
  • Experience in genomics is a plus, especially experience with next-generation sequencing data processing and modeling
  • Goal-oriented thinking
  • Great problem solving skills
  • Self-driven and works well in an interdisciplinary team with minimal direction
  • Experience with communicating insights and presenting concepts to a diverse audience
Read More

Apply for this position

Apply with Indeed
Attach resume as .pdf, .doc, or .docx (limit 5MB) or Paste resume

Paste your resume here or Attach resume file