This is my Data Science portfolio, where you can find a sampling of my projects and articles I have written on subjects related to data science and analysis.
Recent Projects
Pytorch Influence Functions
Determining which images have the greatest positive and negative impact on a Deep Neural Network model’s predictions.
Link to Github repository
Using PyTorch to Identify Model Influences This project is based on the nimarb reimplementation of Influence Functions from the ICML2017 best paper: Understanding Black-box Predictions via Influence Functions by Pang Wei Koh and Percy Liang. The reference implementation can be found here: link.
The Dataset The project makes use of the CIFAR-10 image dataset, which consists of 50,000 training images and 10000 test images in 10 classes.
read more
Pneumonia X-ray
Link to Github repository
Chest x-ray image of normal lungsThis project involves building a deep neural network that trains on a large dataset for classification on a non-trivial task. In this case, the task is using x-ray images of patients to classify whether or not they have pneumonia.
The Dataset The dataset originates from Kermany et al. on Mendeley.
The particular subset used for this project is sourced via Kaggle.
read more