LinkedIn Profile | GitHub Projects
Data Scientist/Analyst with 8+ years of hands-on experience using Python | VS Code | Azure AI | SQL | R | R Markdown/Shiny | Alteryx | Snowflake | Git | Jira | MS Office for data science, analytics, machine learning & actuarial modeling, applied research, data prep & database management (ETL flow), developing R packages & interactive apps.
Career interests in data science, ML, analytics, and personal finance.
4. Credit Default Logistic Regression (w/ Python (pandas, sklearn) + Jupyter NoteBook)
– Jupyter Notebook with Python code analyzing credit default data, how load duration, installment rates, living arrangements & other parameters impact probability of a default.
– created Python code analyzing advertising experiment on Treatment group, using Control group as a baseline showing how T-tests can be used to see statistical difference in outcomes (conversion rate) before and after the advertising.
–
6. Isolation Forest (w/ Python sklearn.ensemble + seaborn)
– Detecting Inliners and Outliers based on salary and FTE, assuming 10% of population are outliers (Unsupervised learning)
7. Pension Trends Linear Regression (w/ R Shiny)
– Interactive app that allows to build boxplots and linear regression by picking 2 variables and Type of public employee.
8. Candy Data Logistic Regression (w/ R + Jupyter NoteBook)
– Jupyter Notebook with R code analyzing candy data, price-to-sugar relationship & why some win more often when matched than others.
9. Seoul Bike Share Analysis (w/ R + R Markdown)
– Analyzing 2018 data on bikes rented in Seoul and parameters (like season, temperature, and hour) that impact the number of bikes rented.
[2.1 Data workflow (Palantir Foundry)]
– Interactive data pipeline using Palantir Foundry that cleans, sorts, and joints two tables.
2.2 Pension Database (App w/ PostgreSQL + R Shiny)
– Interactive app showing 2001-2021 data for 110+ U.S. pension plans from internal database & linked public sources. User can filter data by state, by plan, by starting year, select columns, view interactive charts, and download filtered data in csv format.
2.3 Invesment Returns (App w/ R Reactable + Shiny)
– Interactive table & histogram/line chart showing distribution of 2020-2021 pension plan investment returns, and 2001-2021 pension debt.
User can filter data by state, plan, and add percentiles to distribution histogram
2.4 Invesment Returns 2022 (App w/ R + datawrapper)
– Interactive table & line chart showing distribution of 2022 pension plan investment returns built using R for data manipulation & datawrapper for front-end
3. US Electricity Supply/Demand (Viz w/ Python pandas)
– Line Charts showing share of US electricity supply at Peak and Low-load hours during each day in 2020
4. Compound Change in Debt (App w/ Shiny + HTML + Netlify)
– Interactive Waterfall chart showing how pension plan’s debt compounded (by category) over 2000-2021 period. Slider resets starting year.
5. R Package (w/ R + Roxygen2 + GitHub)
– “pensionviewr” R package contains functions to pull, filter, manipulate, and visualize pension data from internal database & public sources.
1. PowerPoint Slides (w/ RMarkdown + “pensionviewr” R pkg)
– RMarkdown files allow to automate pulling, analyzing and visualizing data, then knit charts/text/tables into set of powerpoint slides.
1. Pension Benefit Projection (App w/ R + Shiny)
– Benefit model (w/ interactive interface and custom R funcitons) that allows to recalculate total present value of public employees’ pension benefits.
Functionality includes Defined Contribution plans benefits, different employee tiers, custom discount rates and other
2. Pension Benefit Model (Interactive Doc w/ RMarkdown + Shiny)
– RMarkdown document w/ interactive Benefit model elements that allows to write text, bullet points, automate numbers, and interact w/ benefit model.
– R code (with for-loops and custom functions) that projects pension plan’s funding and contributions. Functionality includes stress-testing, changing parameters and amortization method.
1. ChatGPT API (Jupyter Notebook w/ openai pkg)
– Code for ChatGPT prompts, allowing to set install, set up, and execute API-based connection w/ openai Python package
1. New Normal in Investment Returns (w/ Excel & R)
2. Alaska TRS Employee Separation Effects (w/ R & Stata)
*– Peer-reviewed academic research and analysis involving data collection and ID joints, descriptive and regression (ML) analysis, probability analysis (Monte Carlo simulations), etc. *
Page template forked from evanca