Data scientist with a strong academic background and expertise in
statistical analysis and modeling. With a Master’s degree in Statistics
and coursework covering advanced statistical methodologies, I am
equipped with the skills to handle large volumes of data, analyze
complex datasets, and provide evidence-based insights. Proficient in
various programming languages such as R, Python and SQL, I apply my
statistical expertise to contribute to meaningful projects that shape
public policy and benefit the most vulnerable residents of
California.
Education
- Statistics M.S., University of California, Irvine,
March 2023
- Coursework in Probability and Statistical Theory, Statistical
Methods in Linear Models, Categorical Data and Longitudinal Data,
Bayesian Data Analysis, Infectious Disease Modeling, Spatial Statistics,
Social Network Data, Survival Analysis
- Designed the
ggomoku
R package and Shiny app as a pair
project
- Statistical Science B.S., University of California,
Santa Barbara, June 2018
- Concentration in Applied Statistics
- Coursework in Data Mining, Regression Analysis, Stochastic
Processes, Non-Parametric Methods, Time Series, Advanced Statistical
Methods, SAS Programming, Statistical Data Science
Skills
- Software Tools and Languages
- R, Python, SQL
- Windows, Linux
- Bash, Git, Markdown, LaTeX
- Snowflake, Oracle
- Languages
- Spanish (B1), Portuguese (B1)
Research Experience
- University of California, Irvine, Mar 2022 - Jun
2022
- Collaborated with Dr. Veronica Berrocal to analyze and visualize the
spatial distribution of immune system cells relative to cancer outcome.
Employed advanced statistical techniques and designed custom R functions
to handle large-scale spatial data.
Work Experience
- Research Data Specialist, State of California, Department of Social
Services, Dec 2023 - Present
- Co-developing a data-imbued methodology to counter electronic
benefits transfer (EBT) theft.
- Utilizing Python, R and SQL and data warehousing platforms such as
Snowflake to clean and analyze data and build machine learning
models.
- Developed the
canectar
R package which generates
hexagonal choropleth (hexbin) plots for county-level data in the shape
of the state of California.
- Authoring documentation on the long-term adoption of replicable
coding standards.
- Water Equity Lab Project Policy Analyst, Water Equity Lab, UC Irvine, Jul 2022 -
Apr 2023
- Analyzed and processed vector and raster data to identify
disparities in access to water infrastructure among minority
demographics.
- Developed over 50 R functions utilizing the sf package, resulting in
streamlined productivity for the team.
- Data Analyst, Valeo Networks, Sep 2018 - Sep 2020
- Implemented predictive analytics solutions to identify trends and
patterns in large datasets, contributing to data-driven
decision-making.
- Created a multiple linear regression pricing model in R, resulting
in improved pricing strategies for a client’s business.
- Developed interactive dashboards using Power BI, SAP Analytics
Cloud, and BrightGauge to visualize key performance indicators and track
business metrics.