← Back to Projects

Computer Vision · Medical Imaging · ECE 5524

Brain Tumor Classification
using Deep Learning

Multi-class MRI brain tumor classification (Glioma, Meningioma, Pituitary, No Tumor) across 7,023 images. Compared 8 CNN architectures. VGG16 achieved 98% accuracy, ROC-AUC 1.0. Grad-CAM confirms tumor localization for clinical interpretability.

Medical ImagingGrad-CAM 98% AccuracyVGG16/19 ResNet50/152DenseNet121 Transfer LearningPyTorch

Overview

Brain tumors are life-threatening neurological conditions requiring prompt and accurate diagnosis. MRI provides detailed structural insight but manual interpretation is time-consuming and error-prone. This project automates four-class tumor classification using state-of-the-art CNN architectures with Grad-CAM visualization for clinical interpretability.

The four classes: Glioma (aggressive cancerous), Meningioma (non-cancerous, slow-growing), Pituitary Tumor (hormone-affecting), No Tumor.

Dataset

7,023
MRI Images Total
4
Tumor Classes
70/15/15
Train/Val/Test Split

Combined Figshare (3,064 images — Glioma, Meningioma, Pituitary) and BR35H datasets (No Tumor class). Images normalized to [0,1], resized to 128×128, batched at 32.

Architectures Compared

VGG16

16 layers · Uniform 3×3 filters

Simple sequential structure. Best performing overall — simple yet highly effective.

98% acc ★

ResNet50

50 layers · Skip connections

Solves vanishing gradient via residual blocks. Highly robust performance.

97% acc

VGG19

19 layers · Extended VGG

Slightly deeper than VGG16. Balanced performance.

97% acc

DenseNet121

121 layers · Dense connections

Feature reuse across layers. Compact and efficient. Sharpest Grad-CAM.

96% acc

InceptionV3

48 layers · Multi-scale

Parallel convolutions capturing multi-scale MRI features.

96% acc

EfficientNetB0

Compound scaling · Edge-ready

Best efficiency-accuracy tradeoff for resource-constrained environments.

94% acc

ResNet152

152 layers · Deep extraction

Deeper variant. Slightly lower due to overfitting on dataset size.

94% acc

Custom CNN

Baseline · 3 conv layers

PyTorch baseline: 3 conv layers (32→64→128 filters) + 2 FC layers.

90% acc

Quantitative Results

VGG16 ★ Best
98%
ResNet50
97%
VGG19
97%
DenseNet121
96%
InceptionV3
96%
ResNet152
94%
EfficientNetB0
94%
Custom CNN
90%

All pretrained models achieved ROC-AUC 0.99–1.00

Grad-CAM Visualization

Grad-CAM applied to all architectures to visualize which MRI regions drive each model's prediction. Critical for clinical trust.

Grad-CAM Visualization — all architectures on meningioma MRI
Grad-CAM image — Grad-CAM visualization — 8 architectures on meningioma MRI

Real Grad-CAM outputs: Red/orange = high activation · Blue = low activation · Smaller hotspot = more precise localization

Grad-CAM Finding: VGG16 and DenseNet121 produce the sharpest, most localized heatmaps focused exclusively on the tumor region. ResNet and EfficientNet maintain strong tumor focus but are slightly broader. VGG19 and InceptionV3 occasionally capture non-tumor regions.

Conclusion

All pretrained CNN architectures significantly outperformed the custom baseline. VGG16 leads on accuracy (98%) while DenseNet121 produces the most clinically useful Grad-CAM visualizations. Future work: ensemble approaches combining VGG16 + DenseNet121, 3D volumetric analysis using full MRI scan sequences, and clinical validation with radiologist feedback.

← Previous
CaféVision: Real-Time Space Monitoring