Python Developer | Web Development | Data Engineering | Data Analytics | Big Data | Machine Learning
Book sharing platform for users to share their used books. Exploratory data analysis(EDA) done on data from Gooogle Books and GoodReads API and cleaned data loaded(ETL) into online database hosted by Heroku. A content based collaborative filtering model based on Nearest Neighbors (KNN model) was also incorporated to provide recommendations to users.
ToolKit: Python3, Pandas, PostgreSQL, Flask, HTML/CSS/JavaScript, D3.js, Seaborn, scikit-learn, Leaflet, Google Co-lab, Heroku
More details can be found here
Interactive visualizations and dashboard to explore the Belly Button Biodiversity dataset, which catalogs the microbes that colonize human navels.
ToolKit: HTML/CSS/JavaScript, Plotly.js, D3.js
Dashboard Link
More details can be found here
A web application that scrapes various websites (NASA MARS News Site, JPL Mars Images, Mars Weather Twitter Account, Mars Space Facts, USGS Astrogeology Site ) for data related to the Mission to Mars and displays the information in a single HTML page
ToolKit: Python3, Pandas, Flask, BeautifulSoup, Selenium/Splinter, ChromeDriver,MongoDB, HTML/CSS/JavaScript
More details can be found here
Performed the ETL process completely in the cloud on the Amazon's product reviews dataset and uploaded a DataFrame to an RDS instance. Used PySpark and SQL to perform a statistical analysis of selected data to analyze whether reviews from Amazon's Vine program are trustworthy.
ToolKit: Python3, Google Co-lab, AWS S3, PySpark, PostgreSQL
More details can be found here
Repositories for other projects and assignments done as part of self development and learning. Updated on regular basis.
ToolKit: Python3, PostgreSQL, Jupyter Notebooks, HTML/CSS/JavaScript, Python Libraries, VBA, Excel
More details can be found here
GPA: A+
GPA: 8.64
Looking to Hire or to collaborate on projects?
Feel free to connect with me: