Ehud Karavani

Modified

May 4, 2024

Email LinkedIn Scholar Medium GitHub

About me

Highly skilled in causal inference, machine learning, (Bayesian) statistics, and data visualization. An applied researcher and data scientist, I spend my time between building reusable tools for research and putting them into use. Advocating Clean Code for research code. Strong preference for eclectic, collaborative environments.

Experience

2017 – present

Research Staff Member
Causal Machine Learning for Healthcare and Life Science, IBM-Research, Israel

  • Creator of causallib – a one-stop-shop open-source Python package for flexible causal inference modeling.

    • Received an IBM Research Accomplishment award (2023)
  • Individual Contributor (IC)
    Causal inference consultant for Research labs globally

  • Project leader: drug repurposing by applying high-throughput causal inference to observational healthcare data

    • Managing a team of researchers.
      Leading the scientific pipeline, system design, and visualization app

    • Generating 100s of hypotheses in minutes

    • Served several external engagements with pharma clients, bringing millions in revenue

  • Organized the 2018 Atlantic Causal Inference Conference Data Challenge

  • Mentored interns and students.
    Onboarding lead

  • Published papers and issued patents

2022

Applied Statistician
Laboratory for Gait & Neurodynamics, Ichilov Hospital

  • Bayesian hierarchical/multilevel models and causal inference for gait analysis in multiple sclerosis patients
2016 – 2017

Teaching Assitant
The School of Computer Science, Hebrew University

  • Introduction to Data Science

  • Workshop in Computational Bioskills

2015 – 2016

Research Associate / Computational Biologist
Institue for Medical Research Israel-Canada, Hebrew University, Faculty of Medicine

  • Large-scale RNA analysis for finding high-resolution protein-RNA interactions

Education

2016 – 2019

M.Sc. in Computer Science and Computational Biology
Faculty of Science, the Hebrew University of Jerusalem, Israel

Thesis: quantifying the utility of embryo selection using genomic prediction of traits
published in Cell

2013 – 2016

B.Sc. in Computer Science and Computational Biology
Faculty of Science, the Hebrew University of Jerusalem, Israel

  • Dean’s List of Academic Excellence (2016)

Bachelor’s thesis published in Nucleic Acids Research

Skills

Programming skills
  • Python scientific stack (fluent)

    • Scikit-Learn, Pandas, Statsmodels, Seaborn (objects), Matplotlib, Altair, Streamlit, Pytorch, Keras, cvxpy, PyMC, Bambi, Arviz…
  • R (when needed)

  • Git + GitHub

  • Continuous integration (Travis, GitHub Actions)

  • Linux and remote development (Cloud/AWS + Jupyter lab / VS Code)

Languages
  • Fluent English

  • Native Hebrew

General
  • Data enthusiast: past DataHack mentor and judge

  • Musician 🎸, hiker / backpacker 🏔️ | | |

  • Friendly 🙂

Awards

2023

IBM-Research Accomplishment

For my work on causallib and research engagement with the Cleveland Clinic Foundation.

2019

Best of RSNA

For the paper Predicting Breast Cancer by Applying Deep Learning to Linked Health Records and Mammograms, published in Radiology.

2019

Best Talk: Israeli Population Genetics Meeting

For the paper Screening Human Embryos for Polygenic Traits has Limited Utility.

2019

Featured Theory of the issue (Cell)

For the paper Screening Human Embryos for Polygenic Traits has Limited Utility.

Publications

Date Title Venue DOI
2024 Improving Inverse Probability Weighting by Post-calibrating Its Propensity Scores Epidemiology https://doi.org/10.1097/ede.0000000000001733
2023 Causalvis: Visualizations for Causal Inference CHI: Conference on Human Factors in Computing Systems https://doi.org/10.1145/3544548.3581236
2023 FairPRS: adjusting for admixed populations in polygenic risk scores using invariant risk minimization Pacific Symposium on Biocomputing https://doi.org/10.1142/9789811270611_0019
2021 Trends in clinical characteristics and associations of severe non-respiratory events related to SARS-CoV-2 MedRxiv https://doi.org/10.1101/2021.03.24.21251900
2019 Screening human embryos for polygenic traits has limited utility Cell https://doi.org/10.1016/j.cell.2019.10.033
2019 A discriminative approach for finding and characterizing positivity violations using decision trees Arxiv https://doi.org/10.48550/arXiv.1907.08127
2019 Predicting breast cancer by applying deep learning to linked health records and mammograms Radiology https://doi.org/10.1148/radiol.2019182622
2019 An evaluation toolkit to guide model selection and cohort definition in causal inference Arxiv https://doi.org/10.48550/arXiv.1906.00442
2019 Comment: causal inference competitions: where should we aim? Statistical Science https://doi.org/10.1214/18-STS679
2018 In vivo cleavage rules and target repertoire of RNase III in Escherichia coli Nucleic Acids Research https://doi.org/10.1093/nar/gky816
2018 Benchmarking Framework for Performance-Evaluation of Causal Inference Analysis Arxiv https://doi.org/10.48550/arXiv.1802.05046
No matching items

May go out of date. Please see my Google Scholar page for the most up-to-date information.