This repository contains coursework, assignments, and projects completed during my Master of Professional Studies in Data Analytics at Penn State University. The courses span various topics such as data collection, cleaning, databases, predictive analytics, and data-driven decision-making.
Covers methods for collecting, cleaning, and preparing data for analysis.
- Web scraping, APIs, and data automation
- Data wrangling and transformation
- Handling missing or incomplete data
Key Projects:
- Web Scraping Financial Data
- Data Cleaning and Transformation of Survey Data
Focuses on the design and management of large-scale databases and data warehouses.
- RDBMS concepts and SQL
- ETL processes
- Database performance optimization
Key Projects:
- Relational Database Design and Optimization
- Designing a Data Warehouse for Retail Data
Explores techniques for leveraging data to make informed business decisions.
- Decision-making frameworks
- Predictive modeling and analytics
- Simulation and scenario analysis
Key Projects:
- Predictive Model for Business Decision-Making
- Scenario-Based Decision Simulations
Provides fundamental knowledge of predictive analytics using machine learning techniques.
- Regression, classification, and clustering methods
- Feature engineering
- Model evaluation and validation
Key Projects:
- Predictive Model for Customer Churn
- Classification of Medical Data
Covers foundational concepts in database architecture, design, and administration.
- ER modeling
- Normalization and database schema design
- Advanced querying
Key Projects:
- ER Model for Hospital Management System
- Query Optimization in a Large-Scale Database
Introduces data mining concepts and techniques for extracting valuable insights from data.
- Association rule mining
- Clustering, classification, and outlier detection
- Text and web mining
Key Projects:
- Market Basket Analysis using Association Rules
- Sentiment Analysis on Social Media Data
Focuses on programming techniques and libraries for analytics using Python.
- pandas, NumPy, and scikit-learn
- Data manipulation and preprocessing
- Model development and testing
Key Projects:
- Building a Predictive Model in Python
- Data Preprocessing and Feature Engineering for a Kaggle Competition
Teaches techniques for creating effective data visualizations to convey insights.
- Dashboard design and storytelling with data
- Data visualization tools: Tableau, matplotlib, and seaborn
Key Projects:
- Interactive Dashboard for Business KPIs
- Visualization of Economic Indicators with Python
Capstone course focusing on end-to-end analytics project design and deployment.
- End-to-end analytics lifecycle
- Agile analytics development
- Deployment and implementation strategies
Capstone Project:
- Design and Implementation of an Analytics Solution for a Real-World Business Problem
|-- DAAN_822/
|-- DAAN_825/
|-- DAAN_881/
|-- IE_575/
|-- INSC_521/
|-- SWENG_545/
|-- DAAN_862/
|-- DAAN_871/
|-- DAAN_888/