You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project simulates a streaming data pipeline for a fictional eCommerce website. It generates fake data and publishes it into Apache Kafka topics. Apache Spark then consumes, processes, and analyzes the streaming data before loading it into a MongoDB database.
This project aims to build an ETL pipeline that extracts YouTube comments and Reddit headlines related to the newly launched iPhone 16. The pipeline cleans and transforms the data, conducts sentiment analysis to assess public opinion, loads the processed data into a MongoDB database, and queries the database to generate visualizations for reporting