Cifar-100 classifier in PyTorch (with demo)

1 hour ago 2

A state-of-the-art deep learning project for CIFAR-100 image classification using transfer learning with ResNet-50, achieving 84.35% test accuracy through progressive fine-tuning and advanced training techniques.

This project implements a robust image classification pipeline for the CIFAR-100 dataset, featuring:

Transfer Learning: Leveraging ResNet-50 pretrained on ImageNet
Progressive Fine-tuning: Three-stage unfreezing strategy for optimal convergence
Advanced Augmentation: Comprehensive data augmentation with Mixup/CutMix
Interactive Web Interface: Streamlit-based application for real-time predictions
Production-Ready: Complete training pipeline with early stopping and model checkpointing

✨ High Accuracy: 84.35% test accuracy on CIFAR-100
🎨 Rich Augmentation: ColorJitter, RandomRotation, GaussianBlur, RandomErasing, Mixup/CutMix
📊 Interactive UI: Upload images and get instant predictions with confidence scores
🔄 Progressive Training: Three-stage fine-tuning strategy
📈 Learning Rate Scheduling: OneCycleLR for optimal convergence
🛡️ Early Stopping: Prevents overfitting with patience-based monitoring
📉 Comprehensive Logging: Track training metrics across all epochs

Model Architecture
Training Strategy
Installation
Usage
Results
Dataset
Project Structure
Customization
Contributing
License
Acknowledgments
References
Contact

Architecture: ResNet-50
Pretrained Weights: ImageNet1K-V2
Total Parameters: 23,712,932
Final Layer: Modified FC layer (2048 → 100 classes)

Optimizer: SGD (Nesterov momentum=0.9, weight_decay=5e-4) Loss Function: SoftTargetCrossEntropy (training) / CrossEntropyLoss (validation) Scheduler: OneCycleLR with cosine annealing Batch Size: 32 Input Size: 224×224

Three-Stage Progressive Fine-tuning

Stage 1: FC Layer Only (Epochs 1-10)

Only the final fully connected layer is trainable
Learning Rate: max_lr=0.01
Warmup: 30% of cycle
Purpose: Adapt the classifier to CIFAR-100 without disrupting pretrained features

Stage 2: Deep Layers (Epochs 11-25)

Unfreeze: Layer 3, Layer 4, and FC
Differential Learning Rates:
- Layer 3: 0.0005
- Layer 4: 0.001
- FC: 0.005
Purpose: Fine-tune deeper layers for CIFAR-100 specific features

Stage 3: Full Model (Epochs 26-100)

All layers trainable
Layer-wise Learning Rates:
- Layer 1: 0.00005
- Layer 2: 0.0001
- Layer 3: 0.0003
- Layer 4: 0.0008
- FC: 0.002
Purpose: End-to-end fine-tuning with discriminative learning rates

Data Augmentation Pipeline

- Resize to 224×224 - ColorJitter (brightness=0.3, contrast=0.3, saturation=0.3, hue=0.1) - RandomGrayscale (p=0.2) - RandomHorizontalFlip (p=0.5) - RandomRotation (±15°) - RandomAffine (degrees=15, translate=(0.1, 0.1)) - RandomPerspective (distortion=0.5, p=0.2) - GaussianBlur (kernel=3, sigma=(0.1, 0.2)) - Normalization (ImageNet stats) - RandomErasing (p=0.3, scale=(0.05, 0.2))

- Mixup Alpha: 0.8 - CutMix Alpha: 0.8 - Switch Probability: 0.5 - Label Smoothing: 0.1

Python 3.10
CUDA-capable GPU (recommended)
8GB+ RAM

Clone the repository

git clone https://github.com/Amirali-SoltaniRad/cifar100-classification.git cd cifar100-classification

Create virtual environment

python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate

Install dependencies

pip install -r requirements.txt

torch==2.1.2 torchvision==0.16.2 streamlit==1.51.0 numpy<2 pandas==2.3.3 Pillow==11.3.0 timm==1.0.22 tqdm==4.67.1 jupyter>=1.1.1 notebook>=7.4.7

Download CIFAR-100 dataset

Download dataset from release and extract

e.g., download cifar-100-python.tar.gz from releases page and extract to ./data

Run the training notebook to train your own model:

jupyter notebook train.ipynb

Or convert to script and run:

jupyter nbconvert --to script train.ipynb python train.py

Training will automatically:

Split data into train/val/test (90%/10%/test)
Apply progressive fine-tuning
Save best model based on validation loss
Implement early stopping (patience=10)

Inference with Streamlit App

Launch the interactive web application:

The app will open in your browser at http://localhost:8501

Features:

Upload images (JPG, JPEG, PNG)
Real-time predictions
Confidence scores for all 100 classes
Probability distribution visualization

import torch from torchvision import transforms, models from PIL import Image import torch.nn as nn # Load model device = "cuda" if torch.cuda.is_available() else "cpu" model = models.resnet50(weights="IMAGENET1K_V2") model.fc = nn.Linear(2048, 100) model.load_state_dict(torch.load("best_model.pth")["model_state_dict"]) model.to(device) model.eval() # Prepare image transform = transforms.Compose([ transforms.Resize(224), transforms.ToTensor(), transforms.Normalize((0.5071, 0.4867, 0.4408), (0.2675, 0.2565, 0.2761)) ]) image = Image.open("path/to/image.jpg").convert("RGB") image_tensor = transform(image).unsqueeze(0).to(device) # Predict with torch.no_grad(): outputs = model(image_tensor) probabilities = torch.nn.functional.softmax(outputs, dim=1) confidence, predicted = torch.max(probabilities, 1) print(f"Predicted class: {predicted.item()}") print(f"Confidence: {confidence.item():.2%}")

Metric Value

Test Accuracy	84.35%
Test Loss	0.7622
Best Validation Loss	0.9729
Training Epochs	66/100 (Early Stopping)
Total Parameters	23,712,932
Training Time	~15.5 hours (GPU - NVIDIA GeForce GTX 1650)

Stage Epochs Best Val Acc Description

Stage 1	1-10	~40%	FC layer training
Stage 2	11-25	~73%	Deep layers fine-tuning
Stage 3	26-66	~79%	Full model fine-tuning

The model achieved steady convergence with:

Consistent training loss reduction
Validation loss stabilization around epoch 56
Early stopping triggered at epoch 66

Total Images: 60,000 (32×32 RGB)
Training Set: 45,000 images (90% split)
Validation Set: 5,000 images (10% split)
Test Set: 10,000 images
Classes: 100 fine-grained categories
Superclasses: 20 coarse categories

All classes are balanced with 500 training images and 100 test images per class.

Aquatic mammals: beaver, dolphin, otter, seal, whale Fish: aquarium fish, flatfish, ray, shark, trout Flowers: orchids, poppies, roses, sunflowers, tulips Food: apples, mushrooms, oranges, pears, peppers Household: bottles, bowls, cans, cups, plates ... (95 more classes)

cifar100-classification/ │ ├── app.py # Streamlit web application ├── train.ipynb # Training notebook ├── requirements.txt # Python dependencies ├── README.md # This file ├── model_checkpoint.pth # Pretrained model checkpoint |__ cifar-100-python.tar.gz # compressed dataset

Note: You can download model_checkpoint.pth and cifar-100-python.tar.gz from the release page.

Modify Training Parameters

Edit train.ipynb to adjust:

# Hyperparameters batch_size = 32 max_epochs = 100 patience = 10 # Learning rates per stage stage1_lr = 0.01 stage2_lrs = [0.0005, 0.001, 0.005] stage3_lrs = [0.00005, 0.0001, 0.0003, 0.0008, 0.002] # Augmentation mixup_alpha = 0.8 cutmix_alpha = 0.8 label_smoothing = 0.1

Change Model Architecture

from torchvision import models # Try different ResNet variants model = models.resnet34(weights="IMAGENET1K_V1") # Lighter model model = models.resnet101(weights="IMAGENET1K_V2") # Deeper model # Or use different architectures model = models.efficientnet_b0(weights="IMAGENET1K_V1") model = models.vit_b_16(weights="IMAGENET1K_V1")

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add some AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

This project is licensed under the MIT License - see the LICENSE file for details.

CIFAR-100 Dataset: Learning Multiple Layers of Features from Tiny Images, Alex Krizhevsky, 2009
ResNet Architecture: Deep Residual Learning for Image Recognition, He et al., 2015
PyTorch Team: For the excellent deep learning framework
Streamlit Team: For the intuitive web app framework
TIMM Library: Ross Wightman for data augmentation utilities

Krizhevsky, A. (2009). Learning Multiple Layers of Features from Tiny Images.
He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep Residual Learning for Image Recognition. CVPR.
Zhang, H., Cisse, M., Dauphin, Y. N., & Lopez-Paz, D. (2018). mixup: Beyond Empirical Risk Minimization. ICLR.
Yun, S., Han, D., Oh, S. J., Chun, S., Choe, J., & Yoo, Y. (2019). CutMix: Regularization Strategy to Train Strong Classifiers. ICCV.
Smith, L. N., & Topin, N. (2019). Super-convergence: Very Fast Training of Neural Networks Using Large Learning Rates.

For questions or feedback, please open an issue on GitHub.

⭐ If you find this project helpful, please consider giving it a star!

Happy Classifying! 🎨🤖

Read Entire Article

Cifar-100 classifier in PyTorch (with demo)

Three-Stage Progressive Fine-tuning

Stage 1: FC Layer Only (Epochs 1-10)

Stage 2: Deep Layers (Epochs 11-25)

Stage 3: Full Model (Epochs 26-100)

Data Augmentation Pipeline

Inference with Streamlit App

Modify Training Parameters

Change Model Architecture

Related

WhatsApp enables third-party chats for users in Europe

NN Interpretability

Spore