This repository serves as a beginner-friendly introduction to data science and machine learning. The content is organized into Jupyter Notebooks covering various topics.
-
Python Basics 1.ipynb
- Perform basic Python operations.
-
Python Basics 2.ipynb
- Perform basic Python operations using Python data structures.
-
Numpy.ipynb
- Perform basic array operations using the NumPy library.
-
Pandas.ipynb
- Perform operations on series and dataframes using pandas.
-
Data Visualization.ipynb
- Read raw data and visualize them using various visualization techniques.
-
PreProcessing.ipynb
- Load data, analyze, rescale, and transform the data.
-
Classification.ipynb
- Perform classification using Decision Tree, Naive Bayes, SVM, and KNN algorithms.
-
Clustering.ipynb
- Perform clustering on raw data using k-means algorithm and DBSCAN.
-
Spark_Wordcount.ipynb
- Use Spark and Hadoop with map reduce techniques to obtain word count from big data.
A Python demonstration of a simple neural network as an educational introduction to the fundamental functions of a neural network
Make sure you have Jupyter Notebook installed to run the notebooks. Additionally, install required Python libraries using:
pip install numpy pandas matplotlib seaborn scikit-learn findspark