top of page

DATA SCIENCE PROJECTS 

DS Projects

PRE-PROCESSING PIPELINE DEVELOPMENT FOR DBDP

BIG IDEAS Lab, Duke University

August 2020 - Present

  • Obtaining data for analysis in the desired and right format is a tedious job.

  • Built pre-processing functions for Digital Biomarker Pipeline Discovery (DBDP) to obtain pandas and/or .csv files for data from wearable sensors.

  • The pipeline preprocess modifies column names, changes timestamps, calculates elapsed time and converts the file type to accessible .csv file for further analysis.

  • The pipeline is developed using Python Programming languages. Libraries used pandas, matplotlib, numpy, seaborn, datetime, pytz, os, sys, json, rowingdata, mne, re

  • Skills acquired : Writing pre-processing functions, Reading data from different file types like .tcx, .EDF, etc. and Blog writing

  • https://github.com/DigitalBiomarkerDiscoveryPipeline/Pre-process/tree/master/pipeline

Screenshot%202021-01-23%20140855_edited.

PREDICTION OF BLINDNESS IN GLAUCOMA PATIENTS

Data Science Intern, Data+ Program, Rhodes Information Initiative, Duke University

May 2020 - July 2020

  • Used data from Electronic Health Records and Durham Neighborhood Compass to predict blindness in glaucoma patients and analyze progression of blindness.

  • The project involved analysis using R programming in the PACE environment. 

  • Used multi-variate statistical model, random forest, support vector machine algorithm for analysis and performed survival analysis to study progression of blindness. 

  • Skills acquired : Preprocessing data to create blindness labels using medical definition of blindness, exploratory data analysis in R, modeling, poster presentation.

RNFL.jpg

CENTER OF MASS DETECTION OF LEFT VENTRICLE OF THE HEART

Cardiac Ultrasound Imaging & Function (BME 543), Duke University

August 2020 - November 2020

  • Used Echocardiograms to detect the center of mass of the left ventricle of the heart. 

  • The goal of the project was to find a pattern in change of center of mass.

  • The class also provided the opportunity to understand heart anatomy through detailed dissection sessions and lectures in the Duke medical school.

  • The analysis was done in MATLAB.

  • Skills acquired : Image processing, Pig heart dissection, Algorithm development, Understanding 3D images, Using Philips QLabs software

echocardiogram_1280.jpg

MIMIC DATASET : MORTALITY PREDICTION

Data Science & Health (BME 590), Duke University

January 2020 - April 2020

  • The goal of the project was to predict mortality in patients being admitted in the hospital.

  • Data was downloaded from Physionet MIMIC website and the data was accessed directly from the database.

  • The project was implemented using Python programming. Libraries used : pandas, numpy, string, re, sql, sqlite3, sklearn, matplotlib

  • Skills acquired : Accessing data from database, Exploratory data analysis, Data preprocessing, feature extraction, modeling (logistic regression, random forest classifier) using lasso.

logo%20(1)_edited.png

PREDICTION OF ACTIVITY & CALORIES BURNT USING WEARABLES DATA

Biomedical Data Science (BME 590), Duke University

August 2019 - December 2019

  • Built a classification & regression model to predict activity and calories burnt respectively, using Apple Watch Dataset.

  • The model was tested using Fitbit dataset.

  • The type of activity was classified using classification algorithms - Support Vector machines and Decision Trees, whereas, the amount of calories burnt was predicted using regression models - Multiple Linear regression and Multivariate adaptive regression splines (MARS).

  • The project was implemented in R using libraries like dplyr, tidyverse, caTools, e1071.

  • Skills acquired : Data pre-processing, Classification (support vector machines and decision trees) and regression (multiple linear regression and multivariate adaptive regression splines) modeling, cross validation, test-train validation.

Data

KAGGLE's WiDS DATATHON 2020

Remote

January 2020 - February 2020

  • The aim of the project was to create a model to predict patient survival using data from first 24 hours of intensive care.

  • We achieved a Area under Receiver Operating Characteristic Curve of 0.88296.

  • Skills learnt & Models used : Data imputation, Data Normalization, Generalized Boosted Regression Modeling (GBM), Lasso Regression, Logistic Regression, AUC score, Sensitivity & Specificity

WiDS_Logo_edited.jpg

COVID-19 ICU PHYSIOLOGICAL DATA ANALYSIS

BIG IDEAS Lab, Duke University

August 2020 - Present

  • Nutritional and metabolic needs can result in under or over-feeding which can in turn result increased probability of mortality or problems after discharge.

  • To understand metabolic profile of COVID patients using data collected using PhysioFlow (wearable device) and BIA devices from patients in the ICU.

  • Determine effect of COVID on muscle physiology and body composition.

  • Data for the study is obtained under the LEEP COVID study being conducted at Duke University.

  • Analysis is being done using Python programming language. Libraries used till now : Pandas, Numpy, Plotly, Scikit-learn, Seaborn, Datetime, Missingno, Matplotlib

  • Skills acquired : Extensive Exploratory Data Analysis, Development of Pre-processing functions for Physioflow and InBody BIA device data, Statistical decisions regarding outliers, Imputation and modeling, Research - oriented thinking

thumbs_b_c_05921e97eec4b76ed409ab0fd3937

ASSESS ACCEPTABILITY OF DATA SHARING

BIG IDEAS Lab, Duke University

January 2020 - Present 

  • The goal of the project is to understand acceptability of devices from wearables and mobile phones (health apps) data sharing in Duke patients through surveys.

  • The project is on-hold due to the COVID-19 pandemic and alternate methods of survey deployment is being considered.

  • Skills acquired : Filing IRB application, Building survey questions in collaboration with behavioral scientists, doctors and professors, Using Qualtrics survey building resources.

Image by Celpax

FETAL HEAD CIRCUMFERENCE MEASUREMENT USING ULTRASOUND IMAGES

Machine Learning in Imaging (BME 590L), Duke University

January 2020 - April 2020

  • Used Unet Segmentation algorithm and Convolutional Neural Network for segmentation and fetal head measurement respectively in TensorFlow.

  • The goal of the project was to measure fetal head circumference to monitor fetal health during the gestation period.

  • Ultrasound images were used from publicly available HC-18 dataset on the Grand Challenge website.

  • Segmentation method using Unet was used to detect fetal head. Regression algorithm VGG-16 was used to calculate the fetal head circumference using ultrasound images.

  • I used Python programming language and tensorflow to complete the project.

  • Skills acquired : Use of tensorflow library, implementation of Unet segmentation algorithm, Convolutional neural network, VGG-16 algorithm and Resizing images.

Ultrasounds

INDIAN LIVER PATIENT DATASET ANALYSIS

Computational Linear Algebra (BME 790L), Duke University

August 2019 - December 2019

  • Obtained data from UCI Machine Learning Repository.

  • Used unsupervised learning to cluster individuals with liver illness using R. The goal was to identify similarities in a set of patient population with liver disease, this could be used to further identify the type of liver disease.

  • Used labels of the data to build a classification model to classify people with and without liver illnesses using MATLAB.

  • The project was completed using R and MATLAB. Libraries such as : dplyr, tidyverse, Amelia, mlbench, Rcpp, Hmisc, lattice, caTools, e1071, 

  • Skills acquired : K-means clustering, Principal component analysis, Preprocessing data in R, Exploratory data analysis in R, modeling (linear regression, support vector machine).

6a831600-9a0d-429f-9d34-d957c45b9517.png

MIT COVID19 CHALLENGE - BEAT THE PANDEMIC II (HACKATHON)

MIT Hackathon, Remote

May 2020

  • Ideated a non-invasive device to detect biomarkers to predict infection.

  • The team consisted of 7 members and the project pitch was presented for the panel.

ojdJ1XiQ_400x400.jpg

SOFTWARE PROJECTS 

Software Projects

HEART RATE SENTINAL SERVER

  • Built a centralized heart rate sentinel server that receives GET and POST requests from patient heart rate monitors in Python.

  • Developed patient side (upload information) and monitoring side GUI (view information). Server sends email to the physician if tachycardic heart rate occurs.

TSH ANALYSIS

  • Use data from text files to generate diagnosis data based on thyroid values, store patient information in python dictionaries and create JSON files for each patient.

ECG ANALYSIS

  • Used ECG data from a text file to calculate the duration of ECG, number of beats, heart rate and time of beat occurrence and save JSON file for each patient.

QUANTITATIVE PATHOLOGY AGENT BASED MODELING

  • Built a NETLOGO models to show role of the immune system in AIDS, windkessel model and Luo Rudy model. Explored heart dysfunction during heart attack.

  • Developed model of a myelinated fiber connected to a muscle fiber and further developed the model for Chronic Inflammatory Demyelinating Polyneuropathy.

DESIGN PROJECTS 

Design Projects

K-SENSE GAIT AND KNEE FLEXION MONITORING DEVICE

  • Designed a device to provide feedback to children with disrupted gait cycles due to conditions such as cerebral palsy, down syndrome, and lower-limb paralysis.  

  • Utilized design thinking principles to create a device that can give support to child’s physiotherapist in devising exercises along with improving the child’s gait. 

  • Included a sound alert that alerts the client when he/she correctly performs the correct gait pattern: a heel strike first (knee extended) followed by foot flat (knee extended) and then toe touch (knee flexed).

  • Provides real-time feedback in the form of entertaining music to encourage the child to walk right without continuous supervision from the PT.

  • The device includes two parts : Ankle and knee part.

  • Award: People's Choice Award, BME Design Symposium

  • The goal of the device is to improve gait of the child.

  • Skills acquired : Autodesk Fusion 360, Eagle, Arduino programming, 3D printing, Design thinking principles

Screenshot%202021-01-24%20132137_edited.

SKILL BUILDING - WEATHER STATION

  • Build a weather station which can measure Light intensity, temperature & humidity in the atmosphere.

  • The casing of the circuit was designed using Autodesk Fusion 360 and 3D printed using Ultimaker 3 printer.

  • The device passed IPX2 rating and survived 3ft drop test.

  • Skills acquired : Designing using Fusion 360, 3D printing, Different device ratings, Eagle

IMG-2832.PNG

ELECTRONIC CODE LOCK USING MICROCONTROLLER (AT89C51) 

  • A digital code lock with a default password. Major features include set user-desired password and determination of number of people accessing the lock

  • Microcontroller was programmed using an 8051 programmer and WLPRO software.

  • Circuit was simulated using Proteus Professional software and the hex file was created using Keil μVision software (Coded in C language).

  • Skills acquired: C Programming, Proteus Professional software, Keil μVision software, Integrating LCD, Assembly programming

Image by Vishnu Mohanan

PORTABLE MUSCULOSKELETAL SUPPORT AND SIMULATION DEVICE

  • Developed a portable device (crutches) that automatically adjusts the height of the crutches on stairs and provides support while sitting and ascending and descending stairs

  • Prototyped a muscle stimulator to include in the design that reduces muscular pain.

  • Skills acquired : PCB soldering, Product designing in workshop, literature review

musculoskeletal device3.jpg

SKILL BUILDING - PHONE STAND (FOLDABLE)

  • Designed a foldable phone stand as a single body using Autodesk Fusion 360. The design was 3D printed using Ultimaker 3.

  • Skills learnt : 3D printing, Designing using Autodesk Fusion 360

Screenshot 2021-01-24 140903.png

DC MOTOR SPEED & LED BRIGHTNESS CONTROL USING IC (555 TIMER)

  • Controlled LED brightness using a potentiometer, i.e. by changing the resistance of the circuit.

  • The light intensity of the LEDs and speed of the motor was controlled.

  • Skills acquired : Basics of electronics, PCB soldering

Circuit Board
linkedin.png
github.jpg
lab.png

Website best viewed in Desktop / Laptop

bottom of page