Plaque It!
|
[0001] 1. Field of the Invention
[0002] The invention relates to image noise reduction techniques primarily operable in real-time by apparatus and methods for reducing the correlated noise in an image or a sequence of images. More particularly, the invention relates mainly to spatial adaptive techniques for mosquito noise reduction in Discrete Cosine Transform (DCT) based decoded image applications.
[0003] 2. Description of the Prior Art
[0004] Recently, many international standards for still image and video compression such as the ITU-T H261, H263, and the ISO JPEG, MPEG-1, MPEG-2 standards have mainly proposed the block based Discrete Cosine Transform (DCT) as a possible compression technique.
[0005] At low and moderate bit rates, block-based DCT coding artifacts become perceptible. Such artifacts are known as mosquito noise or ringing noise occurring around edges within an image or near a smooth zone as well as the blocking effect. For still pictures or still parts of image, the blocking effect is dominant and visible in smooth regions. For dynamic video sequences, mosquito noise becomes more evident for the human vision system (HVS) than the blocking effect.
[0006] There are many existing techniques for blocking effect reduction. In H. Reeve and J. Lim, “Reduction of blocking effects in image coding”, Optical Engineering, vol. 23, January/February 1984, pp. 34-37, the authors teach the systematical use of low-pass filters applied at block boundary. Low pass filtering is utilized also in U.S. Pat. No. 5,850,294 to Apostolopoulos et al. for blocking artifact reduction purposes. However, the blocks that potentially exhibit block artifacts are detected in the DCT domain and low-pass filtering is applied only for the distorted blocks. In B. Ramamurthi and A. Gersho, “Nonlinear Space-variant post processing of block coded images”, IEEE Transactions on Acoustics, Speech and Signal Processing, vol. ASSP-34, October 1986, pp. 1258-1268, the proposed adaptive filtering is based on the detection of edge orientation at each block boundary pixel. Many authors, as in, for instance, A. Zakhor, “Iterative Procedure for Reduction of Blocking Effects in Transform Image Coding”, IEEE Transactions on Circuits and Systems for Video Technology, vol. 2, No.1, March 1992, pp. 91-95, have proposed various multi-pass procedure techniques for this purpose. The iterative techniques can provide potentially a higher performance than the non-iterative ones, but are less attractive for real time processing.
[0007] For mosquito noise artifact reduction (MNR), in U.S. Pat. No. 5,610,729, Nakajima teaches an estimation of block mean noise using the quantization step and the I, P, B coding mode when these data are available from the compressed bit stream. Nakajima teaches also the use of the well-known Minimum Mean Square Error (MMSE) filter proposed originally by J. S. Lee in “Digital image enhancement and noise filtering by use of local statistics”, IEEE Transactions on PAMI-2, March 1980, pp. 165-168, for artifact reduction. However, in many applications, the quantization step or the coding mode is not necessary known or accessible. Moreover, while the MMSE filter is efficient for edge reservation, it is not necessary for noise reduction near an edge.
[0008] In U.S. Pat. No. 5,754,699, Sugahara proposes a similar approach by using block quantization step size information for noise power estimation and an empiric coring technique for artifact filtering.
[0009] Also for MNR, in U.S. Pat. No. 5,850,294, Apostolopoulos et al. propose a filtering on the true non-edge pixels within blocks containing edge pixels rather than smoothing the edge pixels, to avoid eventual blur and picture sharpness loss due to true edge filtering. However, the filtering technique for non-edge pixels is not clearly specified.
[0010] In a same manner, in U.S. Pat. No. 5,852,475, Gupta et al. apply separable low pass filters only on portions of an image that are not part of an edge and are not part of areas of texture or fine detail. The proposed post processor contains also a look up table based temporal digital noise reduction unit for reliable edge detection. For the chrominance signals Gupta et al. teach the use of simple low pass filtering. U.S. Pat. No. 5,920,356 to Smita et al. is an ameliorated version of U.S. Pat. No. 5,852,475 in which the filtering is controlled by a coding parameter of the replenished macro-blocks.
[0011] In U.S. Pat. No. 6,064,776 to Kikuchi et al., in a similar manner, a given block is classified according to whether it is considered part of a flat domain or not. If a block is considered as part of a flat domain, block pixel correction is then given by an AC component prediction technique.
[0012] In U.S. Pat. No. 6,188,799, Tan et al. teach the use of separable low-pass filtering, when block boundaries are located, for a serial reduction of blocking effect and then, mosquito noise. For detected blocking effect, the pixels are firstly corrected by a proposed modified version of bilinear interpolation and secondly, by a mean value of homogenous neighboring pixels within the quantization step size.
[0013] The present invention provides an apparatus and method for efficiently reducing noise in a block-based decoded image signal.
[0014] According to an aspect of the present invention, there is provided an apparatus for reducing noise in a block-based decoded image signal including a luminance component. The apparatus comprises an image region classifier responsive to said luminance component for analyzing each luminance pixel value of the luminance component according to a corresponding luminance pixel spatial context in a same frame of said image signal to classify the luminance pixel in a selected one of a plurality of predetermined image region classes associated with distinct image region spatial characteristics and to generate a corresponding selected region class indicative signal. The apparatus further comprises a shape-adaptive luminance noise power estimator responsive to said luminance component and said selected region class indicative signal for estimating statistical characteristics of said luminance pixel by using local window segmentation data associated with the luminance pixel, to generate a corresponding luminance noise power statistical characteristics indicative signal; and a shape-adaptive luminance noise reducer for filtering said luminance component according to said luminance noise power statistical characteristics indicative signal. Conveniently, the distinct image region spatial characteristics include edge, near edge, flat, near flat and texture spatial characteristics. Preferably, the block-based decoded image signal further includes first and second chrominance components, and the apparatus further comprises a shape-adaptive chrominance noise power estimator responsive to said chrominance components and said selected region class indicative signal for estimating statistical characteristics of first and second chrominance pixels associated with said luminance pixel by using local window segmentation data associated with each said chrominance pixel to generate a corresponding chrominance noise power statistical characteristics indicative signal; and a shape-adaptive chrominance noise reducer for filtering each said chrominance component according to said corresponding chrominance noise power statistical characteristics indicative signal.
[0015] According to a further aspect of the present invention, there is provided a method for reducing noise in a block-based decoded image signal including a luminance component. The method comprises the steps of: i) analyzing each luminance pixel value of said luminance component according to a corresponding luminance pixel spatial context in a same frame of said image signal to classify the luminance pixel in a selected one of a plurality of predetermined image region classes associated with distinct image region spatial characteristics and to generate a corresponding selected region class indicative signal; ii) estimating, from said luminance component and said selected region class indicative signal, statistical characteristics of said luminance pixel by using shape-adaptive local window segmentation data associated with the luminance pixel, to generate a corresponding luminance noise power statistical characteristics indicative signal; and iii) filtering said luminance component according to said luminance noise power statistical characteristics indicative signal. Conveniently, the distinct image region spatial characteristics include edge, near edge, flat, near flat and texture spatial characteristics. Preferably, the block-based decoded image signal further includes first and second chrominance components and, method further comprises the steps of: iv) estimating, from said chrominance components and said selected region class indicative signal, statistical characteristics of first and second chrominance pixels associated with said luminance pixel by using shape-adaptive local window segmentation data associated with each said chrominance pixel to generate a corresponding chrominance noise power statistical characteristics indicative signal; and v) filtering each said chrominance components according to said corresponding chrominance noise power statistical characteristics indicative signal.
[0016] According to a further aspect of the present invention, there is provided an apparatus and method for post-processing a decompressed image signal to reduce spatial mosquito noise therein. In particular, the post processor calls for an image multiple region segmentation, region noise power estimations for respectively luminance and chrominance signal components, and their associated adaptive noise corrections.
[0017] In segmenting an image into regions, the inventive apparatus and method employ edge/no-edge detectors and simple binary consolidation operators to classify and reinforce detected Edge (E), Near-Edge regions (NE), Flat regions (F), Near-flat regions (NF) and finally Texture (T) regions. The preferred segmentation is based essentially on the following observations: First, almost strong mosquito noise is found not only in NE regions but also in NF regions; second, some important noise is also noticeable in picture edges; third, texture masks mosquito noise; and fourth, any excessive filtering in texture or flat regions will degrade eventually fine signal details.
[0018] In estimating local noise power of the luminance component of the image signal, the inventive apparatus and method consider the diagonal high frequency component of the decoded image. The local noise power estimator comprises a local variance calculator that considers only local similar pixels to the current one, a look up table (LUT) for a conversion from observed diagonal high frequency component power to equivalent additive noise power. The noise power estimator also comprises a noise power weighting for each classified region and finally a low-pass filter for smoothing the variation of estimated local noise power between regions. Thus, the proposed method permits different smoothing degree for each segmented region and region transition to ensure resulting image quality.
[0019] For noise correcting, the proposed apparatus and method are based on a shape adaptive local segmented window that considers only the similar intensity pixels to the current one for the local mean and local standard deviation estimations. For reliable window segmentation, a diamond shape two-dimensional (2D) low pass filter is preferably required for the local adaptive windowing. The noise corrector further comprises a gain calculator in order to minimize the Mean Square Error (MMSE) for given local signal mean, local signal power and local additive noise power. The combination of local shape adaptive windowing and MMSE constitutes a noise corrector working on all of the above-cited classified regions.
[0020] It is worthwhile to mention that the proposed mosquito noise filtering also partly reduces the blocking effect.
[0021] From another broad aspect of the present invention, there is also provided an adaptive apparatus and method for noise power estimation and noise correction for the chrominance components which are severely damaged at low bit rate in a decoded video signal. In estimating local noise power in each chrominance component, the proposed method is similar to luminance component processing. However, in the chrominance case, the region classification is not required. In other words, there is only a single region for the whole image. For noise correcting of the chrominance component, the above luminance-based shape adaptive windowing and the MMSE technique are both utilized in a similar manner to the luminance case. Of course, considering the chrominance-sampling rate requires the use of suitable interpolation and decimation techniques for the chrominance signals.
[0022] Embodiments of the present invention will be now described with reference to the accompanying drawings, in which:
[0023]
[0024]
[0025]
[0026]
[0027]
[0028]
[0029]
[0030]
[0031] Referring now to the drawings,
[0032] MNR apparatus
[0033] MNR apparatus
[0034] Image Region Classifier (RC)
[0035] Region map signal
[0036] LU-REBNE
[0037] Noise local standard deviation signal
[0038] Chrominance Cu/Cv signals
[0039] Finally, chrominance noise local standard deviation signal
[0040] As is understood by persons of ordinary skill in the art, appropriate delays for signal synchronization required by the various operations of MNR apparatus
[0041] Referring now to
[0042] A decoded noisy luminance signal Y
[0043] in which the couple (i, j) represents the current coordinates (line, column) of the central and considered pixel. The filter output
[0044] Each of the Sobel masks
[0045] For edge detection, the four (4) absolute value detector outputs
[0046] Let in(i, j) and out(i, j) denote respectively the input and the output of the operator at the coordinates (i, j) of the current pixel. Let W is the local window domain. The operator output is given by:
[0047] The second operator
[0048] In the above equation, in(i, j) and out(i, j) are respectively again the input and the output of the considered operator and W is the local window domain. In other words, if the count of “1” numbers in the window is smaller than or equal to a threshold, then the operator output is “0”; otherwise, the output remains unchanged. In the preferred embodiment, the window dimension is 3×3 and the ROC threshold at
[0049] In order to determine a Near Edge (NE) region, the detected edge map signal
[0050] BAOC output
[0051] For a flat region detection, the four (4) absolute value detector outputs
[0052] In this equation, signal YF
[0053] It is interesting to note that omitting the chrominance components in Equation (6) yields a possible simplified, but less efficient version for Flat region consolidation.
[0054] The Flat region map signal
[0055] The Texture (T) region in the present embodiment is computed as NOT all of the four (4)-detected regions: E, NE, F and NF. The Texture region map signal
[0056] Finally, combining together the five above region maps by the classification block
[0057] Referring now to
[0058] First of all, it can be frequently observed that there is no important signal component in a diagonal high frequency spatial domain. It is thus reasonable to use a diamond shape filter for noise power estimation. Let the noisy decoded luminance signal Y
[0059] The high pass filter output
[0060] A standard deviation estimator, such as
[0061] In Equation (10), g(i, j), μ(i, j) and σ(i, j) are respectively the estimator input, the internal local mean and the estimated local SD output. Moreover, depending on the anticipated noise distribution the constant C can be chosen in accordance with equal to 1.25 appropriately for additive Gaussian noise, to 1.15 for additive uniform noise or, simply omitted. In the preferred embodiment, the window dimension is chosen as 5 lines×11 columns. For the high frequency signal, the local mean μ(i,j) can be set to zero in Equation (10).
[0062] The SD estimator
[0063] wherein the unknown additive noise variance σ
[0064] Referring now to
[0065] or
[0066] The estimation (12) or (13) can be done then on an off-line basis by a data storage and estimation device
[0067] It is worthwhile to note that, in the present embodiment, the noise contribution on Edge pixel is considered as important as the noise contribution on Near-Edge or Near-Flat regions. Such noise will be heavily filtered in these three regions. Inversely, the filtering in Texture region should be sufficiently light enough, since texture already masks noise. Finally, in Flat regions, noise is relatively small and nearly random; excessive filtering will degrade eventually fine but visible signal details.
[0068] In order to smooth the region transitions, the weighting function output signal
[0069] The filter output signal σ
[0070] Referring now to
[0071] Thus, the binary signals w designate a homogenous region, within a threshold tolerance, to the current pixel located at (i,j). The local window therefore becomes shape-adaptive. The threshold value is applied at
[0072] In order to provide efficient estimation of the two first order signal statistics, the window binary signals
[0073] Finally, in order to provide efficient noise reduction in a varying environment of picture signal, such as edge regions, a MMSE coring technique is given by a gain calculator
[0074] A possible simplified version of Equation (17), at the expense of heavier signal reduction, is an MMSE-like coring defined as:
[0075] Finally, the filtered output luminance signal Y*(i,j) at
[0076] using a first adder
[0077] Referring now to