Practical Big Data Analytics

Syllabus:

  • Data Reduction (Python Dictionaries and Pandas DataFrames)
  • Data Visualization
  • Prediction and Classification Problems
  • Recommender Systems
  • Cluster Analysis
  • Big Data and Distributed Computing

Projects Ideas:

Data Visualization

Learn how to use Altair or Bokeh to visualize your data

Interactive web-based data visualization

Learn how to use Dash to create Interactive web apps

Natural language processing

Email Spam Filtering

Fake news

Yelp reviews

Time Series Forcasting

sktime library

Anomaly Detection Algorithms

Implement and apply an anomaly detection algorithm like Isolation Forest or Local Outlier Factor

Fraud Detection Datasets

Association Analysis

Implement the apriori anomaly and apply it on a dataset

Be creative! Here you have some famous dataset repositories