You Won’t Believe Your Eyes: Combining Cloudinary’s Generative AI Transformations

Written By Paul Thompson May-30-2024 3 Min Read

Cloudinary’s AI features are more than powerful, they’re also user-friendly. Combining robust APIs and SDKs with generative AI provides developers with the easiest way to start editing images. This blog post offers some example use cases that demonstrate how these features can be progressively combined to create stunning new variations of original images.

Use Case 1: Removing an Object and Expanding With Generative Fill

Sometimes, companies have an existing library of images they want to reuse for new experiences. This process may involve removing text or an object and changing the image dimensions. Simply removing an object may sound easy enough for a Photoshop expert, but expanding the image is where things get tricky.

Here’s an image of a house in the woods. We want to remove the car to highlight the property’s seclusion. We also need to reset the format from portrait to landscape for use in a banner:

1. Original

2. Remove the Car

Let’s emphasize the solitude of the house:

3. Expand the Image to 16:9 Aspect Ratio With Standard (White) Padding

4. Replace the White Padding With Generative Fill

Use Case 2: Normalizing and Correcting UGC Imagery

Our generative features’ most common use case is correcting and normalizing images supplied by vendors or subscribers as product or property listing images. Often, these businesses need to learn the condition of the photos they will receive.

In this example from a travel and tourism site, the agency managing a hotel listings for their client has uploaded a beautiful but overly compressed and underexposed image of a couple on a tropical beach at night:

1. Original

2. Correct the Expose With Our Enhance Feature (e_enhance)

3. Handle Blurriness and Artifacts With Generative Restore

4. Create a Banner With a 3:1 Aspect Ratio Using Auto Crop and Auto Gravity

Use Case 3: Removing Background, Adding a Drop Shadow, and BG Color

Many retailers need their product catalog to have a uniform look and feel for each product. Often, they receive products with different dimensions, quality, and backgrounds. In this example, we’ll adjust a product shot to have a common plain background and a drop shadow and crop it to a standard size.

1. Original

2. Ensure the Desired 4:3 Aspect Ratio Using Fill Pad and a White Background

3. Add Depth With an AI-Powered Drop Shadow

4. Fine-Tune the Drop Shadow With Advanced Settings Such as Azimuth and Elevation

Use Case 4: Editing an Image with Generative AI

Now, let’s look at how all our generative AI features work together to create a new image from your original. For this example, we’ll take an image of an “open” sign for a business and edit it to make an ad for Cloudinary AI:

1. Original

2. Clean Up the Image With Enhance and Generative Restore

3. Remove the Hand and Text Using Generative Remove

4. Make the Image Landscape With a 1:1 (Square) Aspect Ratio Using Generative Fill

5. Use Generative Replace to Change the Sign to a Cloud Design

6. Add Text Advertising Our AI Features Using the Text Overlay Tool

Adding Performance Optimization

Of course, all of these images can be delivered in the best possible format and quality by using Cloudinary’s f_auto and q_auto. This ensures the user experiences the fastest possible loading time without sacrificing quality. Let’s look at the last example with f_auto and q_auto added:

The resulting image on Chrome is a 78KB AVIF image, compared to the 250KB JPEG original. Quite a savings in size for two simple API commands.

And That’s a Wrap

When it comes to editing and creating new variations of your original images, the possibilities are limitless with Cloudinary’s AI transformations. These use cases span from the practical to the imaginative, demonstrating the versatility of our features. No matter what your image and video enhancements may be, you can be confident that Cloudinary transformations will integrate into your automated workflows, simplifying your processes, and accelerating your time to market, and freeing up teams to be more creative.

You Won’t Believe Your Eyes: Combining Cloudinary’s Generative AI Transformations

Use Case 1: Removing an Object and Expanding With Generative Fill

1. Original

2. Remove the Car

3. Expand the Image to 16:9 Aspect Ratio With Standard (White) Padding

4. Replace the White Padding With Generative Fill

Use Case 2: Normalizing and Correcting UGC Imagery

1. Original

2. Correct the Expose With Our Enhance Feature (e_enhance)

3. Handle Blurriness and Artifacts With Generative Restore

4. Create a Banner With a 3:1 Aspect Ratio Using Auto Crop and Auto Gravity

Use Case 3: Removing Background, Adding a Drop Shadow, and BG Color

1. Original

2. Ensure the Desired 4:3 Aspect Ratio Using Fill Pad and a White Background

3. Add Depth With an AI-Powered Drop Shadow

4. Fine-Tune the Drop Shadow With Advanced Settings Such as Azimuth and Elevation

Use Case 4: Editing an Image with Generative AI

1. Original

2. Clean Up the Image With Enhance and Generative Restore

3. Remove the Hand and Text Using Generative Remove

4. Make the Image Landscape With a 1:1 (Square) Aspect Ratio Using Generative Fill

5. Use Generative Replace to Change the Sign to a Cloud Design

6. Add Text Advertising Our AI Features Using the Text Overlay Tool

Adding Performance Optimization

And That’s a Wrap

Leveraging React’s Compiler With Cloudinary for Optimized Image Handling

Products

Solutions

Developers

Company

Contact Us

Featured Post