Complex Wavelet SSIM
The complex wavelet structural similarity (CW-SSIM) index is an extension of the structural similarity index that utilizes a complex continuous wavelet transform in order to determine the perceptual similarity between two digital images: a reference image and a distorted image. This model was developed in the Laboratory for Computational Vision (LCV) at New York University.
Motivation[edit]
The SSIM was devised with the intent of being a structural similarity measurement that should provide an adequate measurement of perceived image similarity. The SSIM does this by taking advantage of strong correlations between neighboring pixels in an image. The SSIM works very well for many types of image distortions that affect the image's structural information such as contrast stretching, luminance shifting, or the addition of Gaussian noise. However, the SSIM has a major drawback: it is sensitive to image rotations and image translations.:[1]
This becomes particularly troublesome in cases when trying to compare images where one of the image-taking devices is off-centered and leads to erroneous image quality measurements. The CW-SSIM was developed to make the SSIM insensitive to these aforementioned distortions.
The CW-SSIM takes advantage of the fact that phase contains more structural information than magnitude does in natural images and the fact that translations in images lead to a consistent phase shift in the complex wavelet domain[1]. This allows it to deal with image rotations since small image rotations can be locally approximated as translations in an image.
Algorithm[edit]
The CW-SSIM index is calculated on various subbands of an image in the complex wavelet domain. The resultant indices are then averaged over the whole image to get a resultant quality measure for the whole images.
The similarity measure between two images over a window of size 'N' 'x' 'N' is given as [2]
where:
- is the zero mean complex wavelet coefficient taken at spatial location for window
- is the zero mean complex wavelet coefficient taken at spatial location for window
- is the complex conjugate of
- is a small positive constant used to keep the CW-SSIM from becoming unbounded
The CW-SSIM index has the same properties as the SSIM index. Notably symmetry: and boundedness from -1 to 1.[2]
Formula components[edit]
The CW-SSIM formula can be understood more fully by breaking up the the formula into a maginitude component () and a phase component ().
The magnitude component is given by .
This term is only dependent on the magnitudes of the complex wavelet transform coefficients and reaches its upper bound of 1 if and only if .[2]
The phase component is given by .
This term depends only on the phase difference between the components and [2]. It reaches its upper bound of 1 if the difference in phase between and is uniform over all windows i.
Comparison of the CW-SSIM With Other Fidelity Measures[edit]
Like the SSIM, the CW-SSIM method returns values closer to 1 for distortions caused by luminance and contrast changes. In addition, it returns high values in response to small translations and subsequently small rotations, and small scalings. It works well with the latter two since small rotations and small scalings can be locally approximated as small translations in a given window.[2]. Furthermore, additions that degrade the structural information contained within the magnitudes of the pixel values but do not degrade the information in the phase by a lot are awarded higher similarity scores by the CW-SSIM then by the SSIM.[2]
An example of this comparison is shown in the figure to the right. This file shows seven different non-structural distortions on a picture of stuffed cow. As seen by the captions, the MSE (Mean Squared Error) gives very mixed results when comparing the visual similarity of the altered images to the original image. The SSIM on the other hand, performs quite well for the images (b) and (c), demonstrating its robustness to luminance and contrast changes. It shows very poor results the remaining altered images which demonstrates its sensitivity to scaling (d and e), shifts (f and g) and rotation (h). The CW-SSIM is shown to work better at recognizing the perceptual similarity of the images above in all cases when compared to the SSIM. work very well for nonstructural distortions such as contrast stretching. We can see though that the CW-SSIM correctly awards the translated images with a higher scores than the MSE and the SSIM due to its phase component which attributes the shift of the image to a phase shift in the complex wavelet domain. [1] The CW-SSIM also is shown to be effective when faced with scaling and rotations due to approximating both of those locally as small translations in the spatial domain. [1]
Despite being much better than the SSIM and MSE when utilized for image similarity comparisons, the CW-SSIM still suffers from a limitation that it only performs well if the shift, rotation, or scaling distortions are small enough. [2]
Applications[edit]
In the paper "Translation insensitive image similarity in complex wavelet domain" by Zhou Wang and Eero P. Simoncelli, the CW-SSIM was compared to the MSE and the SSIM through character recognition. In the results of the paper, the CW-SSIM was shown to have a much greater character recognition rate than methods based off of the MSE and SSIM. [3] In another paper titled "Complex Wavelet Structural Similarity: A New Image Similarity Index", the CW-SSIM has also been tested as a method for facial recognition and it was shown to perform this task very well compared to existing methods based off of the mean squared error and the Partial Hausdorff Distance Metric (PHD).[2] The CW-SSIM has also been utilized as in a method of palm print verification. The method based off of the CW-SSIM was shown to perform better than one of the most effective palmprint verification algorithms based off of the competitive coding scheme. [4]
Variations[edit]
Weighted CW-SSIM[edit]
One weakness of the CW-SSIM is its poor performance in dealing with low-frequency errors that cause spatial distortions in images. The Weighted Subband Extension of CWSSIM (WCWSSIM) is a proposed extension of the CW-SSIM that works to combine the results of multiple wavelet scales in a manner reminiscent of the technique used in the MS-SSIM. This technique is shown to work better than the CW-SSIM when dealing with images affected by local mean distortions. [5]
References[edit]
- ↑ 1.0 1.1 1.2 1.3 Wang, Zhou; Bovik, A.C. (2004-04-01). "Mean squared error: Love it or leave it? A new look at Signal Fidelity Measures". IEEE Transactions on Image Processing. 26 (1): 98–117. doi:10.1109/MSP.2008.930649. ISSN 1053-5888.
- ↑ 2.0 2.1 2.2 2.3 2.4 2.5 2.6 2.7 Wang, Zhou; Bovik, A.C.; Sampat, Mehul; Gupta, Shalini; Markey, Mia (2009-06-23). "Complex Wavelet Structural Similarity: A New Image Similarity Index". IEEE Transactions on Image Processing. 18 (11): 2385–2401. Bibcode:2009ITIP...18.2385S. doi:10.1109/TIP.2009.2025923. ISSN 1057-7149. PMID 19556195.
- ↑ Wang, Zhou; Bovik, A.C.; Markey, Mia (2005-03-23). "Translation insensitive image similarity in complex wavelet domain". IEEE Transactions on Image Processing. 18 (11): 2385–2401. doi:10.1109/TIP.2009.2025923. ISSN 1520-6149. PMID 19556195.
- ↑ Kong, A.W.-K.; Zhang, D. (2004-08-24). "Competitive coding scheme for palmprint verification". IEEE Transactions on Image Processing. 1: 520–523. doi:10.1109/ICPR.2004.1334184. ISBN 0-7695-2128-2. ISSN 1051-4651.
- ↑ Brooks, Alan; Zhao, Xiaonan; Pappas, Thrasyvoulos (2008-07-28). "Structural similarity quality metrics in a coding context: exploring the space of realistic distortions". IEEE Transactions on Image Processing. 17 (8): 1261–1273. Bibcode:2008ITIP...17.1261B. doi:10.1109/TIP.2008.926161. ISSN 1057-7149. PMID 18632337.
This article "Complex Wavelet SSIM" is from Wikipedia. The list of its authors can be seen in its historical and/or the page Edithistory:Complex Wavelet SSIM. Articles copied from Draft Namespace on Wikipedia could be seen on the Draft Namespace of Wikipedia and not main one.