Computer Vision · Medical Imaging · ECE 5524
Multi-class MRI brain tumor classification (Glioma, Meningioma, Pituitary, No Tumor) across 7,023 images. Compared 8 CNN architectures. VGG16 achieved 98% accuracy, ROC-AUC 1.0. Grad-CAM confirms tumor localization for clinical interpretability.
Brain tumors are life-threatening neurological conditions requiring prompt and accurate diagnosis. MRI provides detailed structural insight but manual interpretation is time-consuming and error-prone. This project automates four-class tumor classification using state-of-the-art CNN architectures with Grad-CAM visualization for clinical interpretability.
The four classes: Glioma (aggressive cancerous), Meningioma (non-cancerous, slow-growing), Pituitary Tumor (hormone-affecting), No Tumor.
Combined Figshare (3,064 images — Glioma, Meningioma, Pituitary) and BR35H datasets (No Tumor class). Images normalized to [0,1], resized to 128×128, batched at 32.
Simple sequential structure. Best performing overall — simple yet highly effective.
Solves vanishing gradient via residual blocks. Highly robust performance.
Slightly deeper than VGG16. Balanced performance.
Feature reuse across layers. Compact and efficient. Sharpest Grad-CAM.
Parallel convolutions capturing multi-scale MRI features.
Best efficiency-accuracy tradeoff for resource-constrained environments.
Deeper variant. Slightly lower due to overfitting on dataset size.
PyTorch baseline: 3 conv layers (32→64→128 filters) + 2 FC layers.
All pretrained models achieved ROC-AUC 0.99–1.00
Grad-CAM applied to all architectures to visualize which MRI regions drive each model's prediction. Critical for clinical trust.
Real Grad-CAM outputs: Red/orange = high activation · Blue = low activation · Smaller hotspot = more precise localization
All pretrained CNN architectures significantly outperformed the custom baseline. VGG16 leads on accuracy (98%) while DenseNet121 produces the most clinically useful Grad-CAM visualizations. Future work: ensemble approaches combining VGG16 + DenseNet121, 3D volumetric analysis using full MRI scan sequences, and clinical validation with radiologist feedback.