Skip to main content

Working with Grayscale Images using OpenCV and Python3


Hi Guys! Welcome to the PART-2 of Getting Started with OpenCV - Working with Grayscale Images using OpenCV and Python3. In this step by step tutorial we are going to learn about

  • What are grayscale images ?
  • How to convert image from RGB to Grayscale ?
  • How to do the depth analysis of an image ?

What are Grayscale Images in Image Processing ?

A Grayscale Image is a 1D image whose every pixel represents a single color plane, which is know as gray. Gray color plane is a single intensity value plane. It is very different from RGB color plane which has 3 color planes Red(R - 0 -> 255), Green(G - 0 -> 255) and Blue(B - 0 -> 255) but in Grayscale images has only one color plane which is gray and intensity values goes from 0(Black) to 255(White) and the values lies in between has different shades of gray.

1. How Grayscale Images Work

  • Each pixel in a grayscale image holds a single value that defines the brightness.
  • The range of values depends on the bit-depth:
    • 8-bit grayscale (most common): Pixel values range from 0 to 255.
    • 16-bit grayscale: Pixel values range from 0 to 65,535.
    • 32-bit grayscale: Used in high-dynamic-range (HDR) imaging.

2. Why Use Grayscale Images?

Grayscale images are widely used in image processing due to their simplicity and efficiency. Some benefits include:

  • Faster Processing → Grayscale images require less computation than color images.
  • Reduced Memory Usage → Instead of three channels (RGB), only one channel is stored.
  • Better Feature Detection → Many computer vision techniques (e.g., edge detection, thresholding) work better on grayscale images.
  • Noise Reduction → Grayscale images eliminate color variations, making processing more robust.

4. Grayscale Conversion Formula

The conversion from RGB (or BGR in OpenCV) to grayscale is done using a weighted sum: $$Y = 0.299R + 0.587G + 0.114B$$ where:

  • R (Red) contributes 30% to brightness.
  • G (Green) contributes 58% (human eyes are more sensitive to green).
  • B (Blue) contributes 11%.

Prerequisites and Installation

We will start with Prerequisites and Installation for OpenCV Python library -

  1. You should have Windows or Linux or Mac OS installed on your PC.
  2. Make sure you should have internet up and running for the installations.
  3. Install any IDE of your choice (Like VS Code, Jupyter Lab/Notebook, Pycharm etc...). For OpenCV my personal choice is Jupyter Lab/Notebook.
  4. Install OpenCV Python library with pip - pip install opencv-python.

Once all the above mentioned requirements are fulfilled, we are good to go.

import cv2
from matplotlib import pyplot as plt
  1. cv2 (OpenCV)

    import cv2
    
    • OpenCV (Open Source Computer Vision) is a popular library used for image and video processing.
    • The cv2 module provides functions to read, manipulate, and analyze images and videos.
    • Example functionalities include reading an image (cv2.imread()), displaying an image (cv2.imshow()), applying filters, and performing edge detection.
  2. pyplot from Matplotlib

    from matplotlib import pyplot as plt
    
    • pyplot is a submodule of matplotlib, used for creating plots and visualizations.
    • It provides a MATLAB-like interface for generating graphs and displaying images.
    • The alias plt is a common convention to shorten the module name for easier usage.

Purpose of this Code Block:

  • It allows using OpenCV for image processing and Matplotlib for visualizing the results.
  • OpenCV's cv2.imshow() is often used for displaying images, but it doesn't work well in Jupyter Notebooks. Matplotlib's plt.imshow() is an alternative to display images more flexibly.
# Load our input image
image = cv2.imread('./image/fruits.webp')

This line of code reads an image file (fruits.webp) from the specified path (./image/) using OpenCV's cv2.imread() function.

Breakdown:

  • cv2.imread(filepath)

    • This function loads an image from the given file path.
    • The file format can be .jpg.png.bmp.tiff.webp, etc.
    • If the file is not found, cv2.imread() returns None.
  • './image/fruits.webp'

    • './' refers to the current directory.
    • 'image/fruits.webp' means the file fruits.webp is inside the image/ folder.
  • Assignment to image

    • The variable image now holds the image as a NumPy array.
    • The array shape depends on the color mode:
      • Colored Image(height, width, 3) → Three channels (BGR).
      • Grayscale Image(height, width) → Single-channel.
# Define our imshow function 
def imshow(title = "Image", image = None, size = 6):
    w, h = image.shape[0], image.shape[1]
    aspect_ratio = w/h
    plt.figure(figsize=(size * aspect_ratio,size))
    plt.imshow(cv2.cvtColor(image, cv2.COLOR_BGR2RGB))
    plt.title(title)
    plt.show()

imshow("Lena", image)

Explanation of the imshow():

What Does This Code Do?

This defines a function imshow() to display an image using Matplotlib while ensuring proper aspect ratio. It then calls imshow() to display an image (e.g., image containing Lena's picture).

Breaking It Down:

1. Function Definition

def imshow(title="Image", image=None, size=6):
  • Defines a function named imshow().
  • Parameters:
    • title="Image" → Default title for the image.
    • image=None → Holds the image to display.
    • size=6 → Controls the figure size.

2. Get Image Dimensions

w, h = image.shape[0], image.shape[1]
  • Retrieves width (w) and height (h) of the image.
  • image.shape returns:
    • (height, width, channels) for colored images.
    • (height, width) for grayscale images.

3. Compute Aspect Ratio

aspect_ratio = w / h
  • Aspect ratio ensures the image retains its original proportions when displayed.
  • It is calculated as width ÷ height.

4. Set Figure Size

plt.figure(figsize=(size * aspect_ratio, size))
  • Dynamically scales the figure size based on the aspect ratio.
  • size (default 6) determines the base height, and width is adjusted accordingly.

5. Convert BGR to RGB and Display

plt.imshow(cv2.cvtColor(image, cv2.COLOR_BGR2RGB))
  • OpenCV loads images in BGR format, but Matplotlib expects RGB.
  • cv2.cvtColor(image, cv2.COLOR_BGR2RGB) converts the color format.

6. Add Title and Show

plt.title(title)
plt.show()
  • Sets the title of the displayed image.
  • plt.show() renders the image.

Function Call

imshow("Lena", image)
  • Calls imshow() with:
    • Title "Lena".
    • The image variable (which should be a loaded image).
gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

imshow("Converted to Grayscale", gray_image)

1. Converting a Color Image to Grayscale

gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
  • cv2.cvtColor() is an OpenCV function used to change the color space of an image.
  • cv2.COLOR_BGR2GRAY converts an image from BGR (Blue, Green, Red) format (default in OpenCV) to grayscale.
  • The resulting gray_image contains only one channel (intensity values from 0 (black) to 255 (white)).

2. Displaying the Grayscale Image

imshow("Converted to Grayscale", gray_image)
  • Calls the previously defined imshow() function to display the grayscale image.
  • The image title is set as "Converted to Grayscale".
  • Since grayscale images contain only one channelcv2.cvtColor() is not needed before displaying.

Understanding Grayscale Conversion:

  • color image (BGR) has three channels (Blue, Green, Red).
  • grayscale image has only one channel where pixel intensity values range from 0 (black) to 255 (white).
  • The conversion formula used by OpenCV:
    Y = 0.299R + 0.587G + 0.114B
    • More weight is given to the green channel since the human eye is more sensitive to green.

Expected Output

  1. Before Conversion

    • image.shape - (1024, 1024, 3) (Height, Width, 3 color channels)
  2. After Conversion to Grayscale

    • gray_image.shape - (1024, 1024) (Only Height, Width)
print(image.shape)
print(gray_image.shape)
(1024, 1024, 3)
(1024, 1024)

Depth Analysis

This code prints the shape (dimensions) of two images: image and gray_image. The .shape attribute of a NumPy array (which OpenCV uses to store images) provides information about the image dimensions.

1. Understanding image.shape

  • image is typically a colored image loaded using OpenCV:
    image = cv2.imread('./image/fruits.webp')
    
  • OpenCV loads color images in BGR format, meaning the image array has three channels (Blue, Green, Red).
  • The output of image.shape for a color image:
    (height, width, 3)
    
    • Height → Number of rows (pixels).
    • Width → Number of columns (pixels).
    • 3 → Represents the three color channels (BGR).

Example:

image = cv2.imread('fruits.webp')
print(image.shape)  # Output: (1024, 1024, 3) for a Full HD image

2. Understanding gray_image.shape

  • If gray_image is a grayscale image, it has only one channel.
  • Grayscale images are typically obtained using:
    gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
    
  • The output of gray_image.shape for a grayscale image:
    (height, width)
    
    • No third value, because there’s only one channel (intensity of light per pixel).

Example:

gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
print(gray_image.shape)  # Output: (1024, 1024)

Wrapping Up

In this blog we have learned -

  • What are grayscale images ?
  • How to convet image from RGB to Grayscale ?
  • How to do the depth analysis of an image ? I hope you have learn somthing new and you like this blog. If you have any feedback or thoughts you can write in the comment section. Thank you!

Comments