ISSN ONLINE(2278-8875) PRINT (2320-3765)

All submissions of the EM system will be redirected to Online Manuscript Submission System. Authors are requested to submit articles directly to Online Manuscript Submission System of respective journal.

Integrated Approach for Speech Compression Using DWT & DCT

Hemant Amhia1, Ratish Kumar2
  1. Prof. at Jabalpur Engineering College, Jabalpur engineering college, Jabalpur, M.P, India
  2. PG stud [control system], Jabalpur Engineering College, Jabalpur, M.P, India
Related article at Pubmed, Scholar Google

Visit for more related articles at International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering

Abstract

With the growth of multimedia technology over past decades, the demand for digital information increase dramatically. The enormous demand poses difficulties in handling speech compression. Speech compression is a mature technology with many applications. To overcome this problem is to compress the information by removing redundancies present in it. Lossy compression scheme that is often used to compress information such as speech signals. This paper presents a method of transformation for the compression of speech signal. In this paper a new lossy algorithm to compress speech signal using discrete wavelet transform (DWT) and then again compressed by discrete cosine transform (DCT) then decompressed it by discrete cosine transform afterward decompressed by discrete wavelet transform to retrieve the original signal in compressed form. To measure the performance of speech signal on the basis of signal to noise ratio (SNR) and mean square error (MSE) by using different filter of wavelet families.



 

Keywords

DWT, DCT, SPEECH COMPRESSION & DECOMPRESSION, PSNR, MSE.

INTRODUCTION

Speech is a very basic way for humans to convey information to one another. With a bandwidth of 400Hz to 4kHz, speech can carry information with different colour of emotion of a human voice. Now a day’s people are able to hear someone’s voice from anywhere in the world-as if the person would be in the same location and they are talking in face to face. Speech can be defined such as the response of the vocal tract to one or more excitation signals.Compression of signals is based on removing the redundancy between neighbouring samples and/or between the adjacent cycles. In data compression, it is desired to represent data by as small as possible number of coefficient within an acceptable loss of visual quality. Compression techniques can be classified into one of two main categories: lossy and lossless.
Compression methods can be classified into three functional categories:
Direct Methods: The samples of the signal are directly handled to provide compression. In transformation method the signal is transformed into frequency domain. In transform method we use Discrete wavelet transform and Discrete cosine transform. Transformation techniques don’t compress the signal, they provide information they provide information about the signal and using various encoding technique compression of signal.
Transformation Methods: Such as Fourier Transform (FT). Wavelet Transform (WT), and Discrete Cosine Transform (DCT).Parameter Extraction Methods: A pre-processor is employed to extract some features that are later used to reconstruct the signal.
A discrete wavelet transform is defined as a “small wave” that has its energy concentrated in time to provide a tool for the analysis of transient, non-stationary or time-varying phenomena. It has the oscillating wave like properties but also has the ability to allow simultaneous time and frequency analysis. Wavelet Transform has emerges as a powerful mathematical tool in many areas of science and engineering more so in the field of audio compression. Wavelet transform decomposes a signal into a set of basic functions these basis functions are called wavelets. Wavelet are obtained from a single prototype wavelet y (t) called mother wavelet by dilation and shifting.
image
Where “a” is the scaling parameter and “b” is the shifting parameter.
The wavelet transform is computed separately for different segment of the time-domain signal at different frequencies giving different frequencies. Multi-resolution analyzes (MRA): analysis of signal at different resolution. MRA is designed to give good time’s resolution and poor frequency resolution at high frequencies and good frequency resolution and poor time resolution at low frequencies. Good for signal having high frequency components for short duration and low frequency components for long duration e.g. images, video frames, speech signal.
Discrete Cosine Transform (DCT) is very common when encoding video and audio tracks on computers. The DCT can also be used to analysis the spectral components of images as well. To DCT is very similar to the DFT, except the output values are all real numbers and the output vector is approximately twice as long DFT output. It expresses a sequence of finite data points in terms of sum of cosine functions.
The 1-D Discrete Cosine Transforms
image
Where N is the length of x, and x and y are of same size. If x is a matrix, DCT transform its columns. The series is indexed from n = 1 and k = 1 instead of the usual n=0 and k=0 because MATLAB vectors run from 1 to N instead of 0 to N-1.
It often reconstructs a sequence very accurately from only few DCT coefficients, a useful property for application requiring data reduction. But our effort is to store the compressed signal to smaller bit size for better compression as well as implementation.

IMPLEMENTATION

In this paper hybrid model of compression and decompression by using DWT and DCT technique is used for the recorded signal. A different approach is being used for better compressed signal as well as less degraded signal is being recovered.
First set the compression and decompression at zero level flag. Then set the compression ratio i.e. up to what level of compression is required. By using the frame size of the signal we can estimate the size (length) of the signal to be compressed. After the transformation of the signal, we arrange the coefficient so that higher value of frequency of the signal is not eliminated. Whereas we are move the unnecessary zeros to other location. This provides a great help in compression with better result of MSE and PSNR.
In the above figure the signal which is used to compress using DWT then by DCT with different compression ratio generating the compressed signal. After the compression by removing the distortion or unnecessary coefficient the signal is reconstructed to the original form by taking inverse transform after adding the zeros which were generated in DWT & DCT. In method of compression the signal doesn’t lose any of its information.
In this paper DB4 filter of wavelet transform is used to compress the signal. Now take the frame size of the signal to be 10 sec to analysis the different compression ratio in order to attain compressed signal with better result.

CONCLUSION

Speech compression is used for transmission and storage. The speech compression is achieved by representing each sample of digitized data by lesser number of bits this paper shows the key advantageous features of the Wavelet filters in the field of speech Signal processing. It is found that the wavelet filters significantly improves the reconstruction or fidelity assessments of the compressed speech signal. The SNR and MSE of signal show the high efficiency of the compression it is concluded that it can be very effectively used for the speech signal compression.

RESULT

Speech compression is achieved by using DB4 filter of DWT and DCT transform. In table 1 shows the MSE, SNR, compression time, decompression time of the given speech signal of frame size as10sec. Figure 4 to 7 shows the compressed signal with compression ratio varying from 2 to 10 gives us the better result in terms of reconstructed speech signal.

Tables at a glance

Table icon
Table 1

Figures at a glance

Figure 1 Figure 2 Figure 3 Figure 4
Figure 1 Figure 2 Figure 3 Figure 4
Figure 1 Figure 2 Figure 3 Figure 4
Figure 5 Figure 6 Figure 7 Figure 8

References