This repository contains a deep learning project implementing three powerful CNN architectures layer by layer for multi-class image classification.
-
Models implemented:
- GoogleNet
- EfficientNet
- Vision Transformer (ViT)
-
Dataset: 2,484 images categorized into 7 classes
-
Achieved 97.423% accuracy
-
Ranked 1st (tied) on the Ain Shams AI department Kaggle leaderboard
βββ notebooks/
β βββ cnn_classification.ipynb # Main Jupyter notebook with model training and evaluation
βββ testing_scripts/
β βββ test.py # Script for testing trained models on new data
β βββ output/ # Folder where CSV results will be saved
β βββ models/ # Folder where trained models are saved
β βββ Test/ # Folder where test photos should be placed
βββ README.md
Achieved 97.423% accuracy on 2484 unseen test samples and ranked 1st (tied) on the Ain Shams CS department Kaggle leaderboard
- Open and run the notebook in the
notebooks/
folder to train and evaluate models. - Use the testing script in
testing_scripts/
to run inference on new images.