Brain Tumor Classification

Overview

Brain tumors are life-threatening neurological conditions requiring prompt and accurate diagnosis. MRI provides detailed structural insight but manual interpretation is time-consuming and error-prone. This project automates four-class tumor classification using state-of-the-art CNN architectures with Grad-CAM visualization for clinical interpretability.

The four classes: Glioma (aggressive cancerous), Meningioma (non-cancerous, slow-growing), Pituitary Tumor (hormone-affecting), No Tumor.

Dataset

7,023

MRI Images Total

4

Tumor Classes

70/15/15

Train/Val/Test Split

Combined Figshare (3,064 images — Glioma, Meningioma, Pituitary) and BR35H datasets (No Tumor class). Images normalized to [0,1], resized to 128×128, batched at 32.

Architectures Compared

VGG16

16 layers · Uniform 3×3 filters

Simple sequential structure. Best performing overall — simple yet highly effective.

98% acc ★

ResNet50

50 layers · Skip connections

Solves vanishing gradient via residual blocks. Highly robust performance.

97% acc

VGG19

19 layers · Extended VGG

Slightly deeper than VGG16. Balanced performance.

97% acc

DenseNet121

121 layers · Dense connections

Feature reuse across layers. Compact and efficient. Sharpest Grad-CAM.

96% acc

InceptionV3

48 layers · Multi-scale

Parallel convolutions capturing multi-scale MRI features.

96% acc

EfficientNetB0

Compound scaling · Edge-ready

Best efficiency-accuracy tradeoff for resource-constrained environments.

94% acc

ResNet152

152 layers · Deep extraction

Deeper variant. Slightly lower due to overfitting on dataset size.

94% acc

Custom CNN

Baseline · 3 conv layers

PyTorch baseline: 3 conv layers (32→64→128 filters) + 2 FC layers.

90% acc

Quantitative Results

VGG16 ★ Best

98%

ResNet50

97%

VGG19

97%

DenseNet121

96%

InceptionV3

96%

ResNet152

94%

EfficientNetB0

94%

Custom CNN

90%

All pretrained models achieved ROC-AUC 0.99–1.00

Grad-CAM Visualization

Grad-CAM applied to all architectures to visualize which MRI regions drive each model's prediction. Critical for clinical trust.

Real Grad-CAM outputs: Red/orange = high activation · Blue = low activation · Smaller hotspot = more precise localization

Grad-CAM Finding: VGG16 and DenseNet121 produce the sharpest, most localized heatmaps focused exclusively on the tumor region. ResNet and EfficientNet maintain strong tumor focus but are slightly broader. VGG19 and InceptionV3 occasionally capture non-tumor regions.

Conclusion

All pretrained CNN architectures significantly outperformed the custom baseline. VGG16 leads on accuracy (98%) while DenseNet121 produces the most clinically useful Grad-CAM visualizations. Future work: ensemble approaches combining VGG16 + DenseNet121, 3D volumetric analysis using full MRI scan sequences, and clinical validation with radiologist feedback.

Brain Tumor Classificationusing Deep Learning

Overview

Dataset

Architectures Compared

VGG16

ResNet50

VGG19

DenseNet121

InceptionV3

EfficientNetB0

ResNet152

Custom CNN

Quantitative Results

Grad-CAM Visualization

Conclusion

Brain Tumor Classification
using Deep Learning