torchvision.datasets.ImageFolder is giving me a 3x3 grid of images intead of 1 image

Question

I can't figure out why it's giving me 9 gray images in a 3x3 grid instead of just one color image (original image is not gray and has RGB channels). I have spent 5 hours on this. Thanks for the help.

Here is my code

test_path = "asl_data/test/" #path to the folder
test_data = torchvision.datasets.ImageFolder(test_path, transform=torchvision.transforms.ToTensor())
def test32():
    for x, y in test_data:
        print(x.shape)
        x = x.reshape(533,800,3)
        plt.axis("off")
        plt.imshow(x)
        plt.show()
        plt.axis("off")
        plt.imshow(x[:176,:267,:])
        break
test32()

does it happen on any image folder? – Shai Mar 02 '22 at 22:11 — Shai, Mar 02 '22 at 22:11
@shai yes, all images in every folder – Altaïr Ibn-La'Ahad Mar 02 '22 at 22:13 — Altaïr Ibn-La'Ahad, Mar 02 '22 at 22:13
what version of `torchvision` are you using? – Shai Mar 02 '22 at 22:15 — Shai, Mar 02 '22 at 22:15
@shai, version is '0.11.3+cu113' – Altaïr Ibn-La'Ahad Mar 02 '22 at 22:17 — Altaïr Ibn-La'Ahad, Mar 02 '22 at 22:17

Shai · Accepted Answer · 2022-03-02T22:26:41.403

1

Classic.

You reshape instead of permute.

See this thread on the crucial difference between the two.

Fix:

x = x.permute((1, 2, 0))
plt.imshow(x)

A simple visual example:

x, y = test_data[0]  # take one image
x.shape  # torch.Size([3, 223, 320])

# see the difference
fig, ax = plt.subplots(1,2)
ax[0].imshow(x.numpy().reshape(223, 320, 3))
ax[0].set_title('Wrong reshape instead of permute')

ax[1].imshow(x.permute((1,2,0)))
ax[1].set_title('correctly permuting')

edited Mar 02 '22 at 22:26

answered Mar 02 '22 at 22:19

Shai

111,146
38
238
371

ah, Thank you for the answer. I spent so much time reading the documentation of torchvision.dataset and torchvision.transform. I feel so stupid now. – Altaïr Ibn-La'Ahad Mar 02 '22 at 22:23

torchvision.datasets.ImageFolder is giving me a 3x3 grid of images intead of 1 image

1 Answers1

Linked