Agnes McFarlin Data Scientist
mountains

My Expertise

Data Scientist with experience building and testing Machine Learning Models.

-------------------

-----------------

----------------------

Featured Projects

mountains

Second place Prize for the AI Data Readiness Challenge For the NCI and NIH

  • Data mining, Data quality assessment, Machine Learning, Pytorch, Python

I performed this project for the NIH, as part of the AI data readiness challenge. The purpose of this project was to assess the AI readiness of publicly available data from The National Lung Screening Trial (NLST) dataset. NIH Link

Check it out
mountains

Statistical Analysis of Population data

  • Data Cleaning, Statistical Analysis, Working with Large datasets

Performed Statistical Analysis on a large amount of population data in Python in order to determine whether specific areas had higher incidences of certain diseases. For the purpose of budget allotment.

Check it out
mountains

Pytorch Tutorial

  • Pytorch, Pytorch Ignite, Python

A project providing instructions on how to build a custom data loader and model in Pytorch. The model is a neural Network classifier. And it performs binary classification. The model is also tested against out of sample data, as part of first steps toward creating something more scalable.

Check it out
mountains

Medical Image Classification

  • Python, Keras, Machine Learning, Medical Image analysis

A project to optimize a machine learning model that was trained to identify Cancerous lung nodules in slide images.

Check it out
mountains

Forecasting Model

  • Python, Sci-kit learn, prediction,

The purpose of this project was to use Machine learning models to cast a prediction for the future spending in the residential sector on electricity for the year 2021. Their results will then be compared and a report will be drafted.

Check it out
mountains

Clustering Project to automate bird identification

  • Python, sci-kit learn, K-means

I attempted to identify birds by their skeletal structure using machine learning, namely K-Means.

Check it out