The WebP team performed a comparative study of three image compression methods over a random set of 1 million images. The goals of the study were to measure the compression achieved by those methods and to analyze the trade-offs between image sizes and compression.
Three compression methods, WebP, JPEG 2000 and Re-JPEG, were applied to the 900,000 JPEG images contained in the data set. JPEG images were re-compressed with Re-JPEG so that each was as close as possible to a target peak signal-to-noise ratio (PSNR) value. The team's analysis was done on the images after they had been compressed.
In addition to the large scale study, the team manually inspected 100 random images and found that, in most cases, WebP provides acceptable visual quality.
WebP achieves overall higher compression than either JPEG or JPEG 2000. Gains in file size minimization are particularly high for smaller images which are the most common ones found on the web.
The data set used in the study was a collection of 1 million images randomly sampled from a repository of images crawled from the web. The following table shows the distribution of different types of images in the data set.
The experiments were run in two stages:
The actual compression result was measured as a percentage using the following formula:
compression_percentage = 100 * (original_image.length - compressed_image.length) / original_image.length.
The tests yielded the following results for negative and non-negative compression gain:
Following are the average compression percentages that were achieved by each of the three methods.
|Type||Average PSNR obtained||Average Compression Percentage
(non-negative compression gain)
|Average Compression Percentage
(negative compression gain)
When negative compression gain was allowed, the average performance for JPEG 2000 was lower than that of JPEG. This may be due to high-frequency artifacts present in the JPEG images; these artifacts can contribute to a less-than-ideal compression situation.
The Kakadu implementation used in the experiment to convert to JPEG 2000 failed to convert approximately 240,000 images.
JPEG 2000 had lower compression percentages than Re-JPEG, primarily because the JPEG 2000 result was negative compression for many images. For that reason, JPEG images achieved a lower overall compression score.
The following figures provide distribution graphs for the three methods:
Figure 1: Image size population distribution
The graph shows that most of the images were small. The frequency of the images with a size more than 500K is generally less than 100.
Figure 2: Compression percentages for WebP, JPEG, and JPEG 2000 images
This graph demonstrates that the body of pictures compressed with WebP got better compression than re-jpeg and jpeg 2000. Also, the pictures compressed with WebP got more positive compression than the others.
Figure 3: Comparison of image sizes and compression percentage
This graph shows that WebP achieves better compression then the other formats, particularly for smaller images.