University of California, San Francisco: Doctor of Philosophy (PhD) | Distinguides Dissertation Award; Cognates in Machine Learning and Precision Health (2018); Fellow: Big Data Coursework for Computational Medicine (BDC4CM) in NYC, program supported by NIH grant R25-EB 020381
University of California, San Francisco: Training in Clinical Research (TICR) | School of Medicine, Department of Biostatistics & Epidemiology | Advanced coursework in biostatistics and supervised machine learning (2017)
University of California, San Francisco: Master of Science in Health Systems Leadership (2013)
DataCamp | Data Scientist in Python Certificates (2021-22)
Training in data management, importing, cleaning, manipulating, and visualizing data. Hands-on coding with many Python libraries, including pandas, NumPy, Matplotlib, and many more. Analyzing real-world datasets to apply statistical and machine learning such as train decision trees, natural language processing (NLP), and other methods.
Harvard University | edX Data Science in R Professional coursework; program supported by NIH grant R25-GM114818 (2019)
Training in probability, inference, regression, and machine learning. Skills: R programming, R studio, data wrangling with dplyr, data visualization with ggplot2, predictive modeling, machine learning with caret; file organization with Unix/Linux, version control with git and GitHub.
Board Certification in Informatics | ANCC (2015-2025)
Performance Improvement Advisor Certification | Kaiser Permanente (2014)
National Manager, Data Science & Advanced Analytics | Kaiser Permanente Program Offices (2022 – present)
Leading a national data science team of 4 direct reports and 12 cross-team data scientists to deliver business-critical insights on consumer experience; Customer business audience: National and regional business executives serving 10.6M members across eight U.S. markets.; analytic capabilities include development and validation of AI/ML algorithms, prediction of service demands, topic modeling, anomaly detection, NLP, and business intelligence services (dashboards and reports). Oversee and lead large-scale analytic efforts using high-performance computing across an integrated data lake. Coach to junior data scientists and analysts. Partnering with internal data engineers on data quality and validity.
Program Manager, Research & Data Science | Kaiser Permanente Northern California (2018 – 2022)
Led and advanced the operating strategy of a regional program serving 21 hospitals and regional divisions. Served as board member and scientific reviewer on the regional IRB; Regional Section Chief, Clinical Operations in COVID-19 Regional Command Center; Interim Regional Analytics Lead for large program evaluations and data visualization projects; led the development 5-year business case with millions in operational benefits for a regional program; Led AHRQ 2019 Data Science in Healthcare Predictive Modeling Challenge team.
Research Fellow and Sr. Data Consultant | Kaiser Permanente Northern California, Division of Research, Systems Research Initiative (2016 – 2018)
Informatics Consultant & Data Visualization Developer | Kaiser Permanente National IT, Innovation and Informatics (2013 – 2016)
Clinical Analyst | Sepsis Performance Improvement Program, University of California, San Francisco Medical Center, Quality & Patient Safety (2012 – 2013)
Python | DataCamp Data Scientist in Python Certificate program (2021-2022): Anaconda/Spyder IDE; import, clean, manipulate, and visualize data; pandas, NumPy, Matplotlib, etc.; statistical and machine learning techniques; train models and use natural language processing (NLP)
R | Harvard University DS coursework (2016-2019): R Studio IDE, git, dplyr, ggplot2, caret, statistical and machine learning programming, UCSF Machine Learning coursework, Software Carpentry, Kaiser Permanente Data Science workshop
Stata | UCSF School of Medicine, Department of Biostatistics and Epidemioology (2015-2018): Data acquisition and management, data transformation, statistical programming and testing, derivation and validation of multivariable and hierarchical time-series regression models
Tableau | Kaiser Permanente National IT (2013-present): Data acquisition, data management, customized coding, visual analysis, development of interactive electronic dashboards, visual design, publishing to Tableau sever
SQL | Kaiser Permanente National IT (2014-2018): Training in data acquisition from Epic© Clarity and other relational databases; data management