## 🌟 Data Science in Biotech Research Eric J. Ma Moderna --- ## 📝 About me > "Sr. Principal Data Scientist, DSAI (Research) @ Moderna" - Sr. Principal Data Scientist: Data science team lead - DSAI: Computer nerd - Research: Science nerd --- ## 📅 Today > Explore what it takes to build a successful data science team in a biotech/pharma research organization. --- ## 🔍 Scope - Data science team - Biotech/pharma - Research organization --- ## 🎨 Format ```python themes = ["mission", "problem classes", "value delivery", "challenges"] for theme in themes: discuss() ``` --- ## 🌀 Format (cont'd) ```python def discuss(): preface() small_group_discussion() large_group_discussion() ``` --- ## 💡 Exchange of ideas - Discussion over lecturing - Pragmatism over ideology - Clarity over sophistry - Commonalities and uniqueness --- ## 🤝 I am here to engage in two-way learning with everyone --- ## 📋 Agenda 1. Mission 2. Problem classes 3. Value delivery 4. Challenges --- ## 📒 Notes hackmd.io/
@ericmjl
/
ds-biotech-org
--- ## 🎯 Mission > Why does your team of data scientists exist? ---- > Why does your team of data scientists exist? - Charter? - North star? --- ## ❓ Problem classes > Biotech/pharma operates on the holy trinity of indication-target-molecule - Indication: disease - Target: hypothesized gene/protein/RNA/cell/tissue that, if perturbed, will reverse the course of disease - Molecule: the entity that we introduce to perturb the target. --- > What problems does your team solve? ---- > What problems does your team solve? - In scope? - Out of scope? ---- ## 💊 Therapeutic hypothesis > {{ molecule }} will hit {{ target }} to reverse the course of {{ indication }} ---- > What methodologies? ----
domain
$\times$
methodology
----
domain
$\times$
methodology
| |
mRNA
|
Proteins
|
LNPs
| |---|------|----------|------| |
AI library design
| ✅ | ✅ | ✅ | |
Computer vision
| ✅ |❌ | ✅ | |
Probabilistic models + custom algorithms
| ✅ | ✅ | ✅ | --- ## 🌐 Value delivery > What is the value of the problems that your team solves? ---- > What is the value of the problems that your team solves? - Return on investment (ROI)? - Priority level within the company? What examples do you have, to the extent that it can be revealed? ---- > What frameworks can we use to estimate the ROI of our work? - Time saved by laboratory scientists vs. time needed to build solution - Cost of experiments that would otherwise needed to be done - Replacement of vendor software costs What other questions would you ask? --- ## 🚧 Challenges > What challenges will one face when building out a new data science team within a new biotech's research organization? ---- > Laboratory research processes are unstable, generating non i.i.d. data. Examples: - Condition scouting in antibody evolution. - Cell type changes in protein efficacy assays. - Change in vendor supplier for molecules. How do we overcome this challenge? ---- > Do we need a team of unicorns? Or do we need a team to cover the unicorn skillsets? - Scientific domain knowledge - Mathematical modelling skill - Software development skills - Communication skills - People skills Reference [essay on hiring](https://ericmjl.github.io/essays-on-data-science/people-skills/hiring/). ---- > Time is ticking; how do we demonstrate value early on? ---- > Research is a revenue-consuming organization that takes years to deliver a new product. How do we demonstrate that there is value delivered by the data science team in the interim? --- ## ⭐️ Thank you