DINOv2 flower classification code

14 Jan 2024 • Junuk Cha

Introduction

As you may know, DINOv2 proccess a powerful ability to extract semantic features from input images. (Please refer to this post). Based on extracted semantic features, I have addressed the flower classification problem.
What I’m going to share can be summarized as follows:

The result comparisons of using the fully connected layer with DINOv2 and Resnet 50 as backbone for classification.
The visualization of TSNE.

Dataset

Before starting to talk about the results, I briefly explain the dataset I used.

17 Catergory Flower Dataset

It has 17 categories of flowers, with 80 images for each class. I have implemented the code to processing this dataset. dataset code.
Download data

The result comprisions

I used Resnet50 + Classifier (a simple FC layer) and DINOv2 + Classifier (a simple FC layer). I used the same classifier.

The classifier consists of two fc layers. classifier code.

Resnet50 + Classifier

DINOv2 + Classifier

The performace when using DINOv2 is better than that with Resnet50. DINOv2’s accuracy on train set and validation set dramatically increased. You can train the model in here.

The visualization of TSNE

You can reproduce the results by this code.

TSNE of Resnet50

TSNE of DINOv2

The TSNE results for DINOv2 demonstrate that the features are more coherent within each class.

Interactive

I implemented the interactive figure. You can interact with with your mouse, and check how similar they are in the same cluster. interactive code. GIF

For more detailed code, please click on this link. If you have any questions, feel free to leave a comment.