
DEPT
.
OF COMP
.
ENGG
.
(
NOT FOR SALE
)
TOOLS AND LIBRARIES
1. NumPy is a Python library used for scientific computing. It provides support for
multi-dimensional arrays and matrix data structures, along with a large collection of high-level
mathematical functions to operate on these arrays.
2. Pandas stands for Python Data Analysis Library. It is a fast, powerful, flexible, and easy-to-use
open-source data analysis and manipulation tool built on top of Python. Pandas introduces the
DataFrame (a two-dimensional, size-mutable, tabular data structure with labelled axes), which is
used throughout the labs to load, clean, explore, and manipulate datasets .
3. Matplotlib is one of the most popular Python libraries for data visualisation. It provides a
MATLAB-like interface for creating static, animated, and interactive plots. In these labs,
Matplotlib is used to plot histograms, training and validation loss/accuracy curves, regression
and cluster scatter plots, genetic-algorithm convergence curves, and confusion-matrix heatmaps,
helping students visualise data distributions and model performance.
4. Seaborn is a Python data visualisation library built on top of Matplotlib. It provides a high-level
interface for drawing attractive and informative statistical graphics. In these labs, Seaborn is used
to generate pair plots, correlation heatmaps, box plots, count plots, and confusion-matrix
visualisations, supporting exploratory data analysis across the experiments.
5. Scikit-Learn is one of the most useful libraries for machine learning in Python. It contains a
large collection of efficient tools for machine learning and statistical modelling, including
classification, regression, clustering, and dimensionality reduction. In these labs, Scikit-Learn is
used to implement linear and multiple linear regression, logistic regression, K-nearest neighbours
(KNN), support vector machines (SVM), decision trees, random forests, and clustering and
anomaly-detection methods (K-Means, DBSCAN, Isolation Forest, Local Outlier Factor, and
One-Class SVM), as well as preprocessing tools such as StandardScaler and LabelEncoder and
evaluation metrics including accuracy, precision, recall, F1-score, and confusion matrices.
6. TensorFlow is an open-source library for numerical computation and large-scale machine
learning, developed by Google. It provides a comprehensive ecosystem of tools and libraries for
building and deploying machine learning and deep learning models. In these labs, TensorFlow is
used to build and train artificial neural networks (ANNs) for HVAC fault detection,
convolutional neural networks (CNNs) and transfer-learning models for image pattern
recognition, and bidirectional LSTMs for sentiment analysis.
7. Keras is a high-level deep learning API built on top of TensorFlow 2. It is a powerful and
easy-to-use open-source Python library for developing and evaluating deep learning models.
Keras provides intuitive building blocks such as the Sequential model and Input, Dense, LSTM,