Acquiring Audio Data in Data Structure,s.

ACQUIRING AUDIO DATA UNIT II

ACQUIRING AUDIO DATA Obtaining or collecting of sounds. And there are many formats in acquiring of audio.

METHOD OF ACQUIRING. COLLECT DATA SIGNALING PROCESS PROCESSED BY CS USERS CAN BE USED

FORMATS OF AUDIO.

TYPES OF AUDIO. 1. LOSSY AUDIO : - loss of data and quality. 2. LOSSLESS AUDIO : - original audio copy.

TYPES WITH EXAMPLES.

LOSSY AUDIO : EXAMPLES : 1. MP3 –Developed by MPEG(Moving pictures experts group). - Compressed lossy audio format . - Max 320 kpbs. Smaller file size , faster file transfer , less space.

LOSSY AUDIO : EXAMPLES : 2.WMA – Designed by MS. - Compressed lossy audio format . - consumes less space.

LOSSLESS AUDIO : EXAMPLES : 1. WAV – Original , uncompressed , over space. - designed by MS. 2. FLAC – Designed by Xiph.org. - Royality free music license , open audio format.

OTHER FORMATS :

Data Storage Hardware

Bits and bytes A computer only understands the numbers or 1 , or whether a switch is on or off . We call those 1’s and 0’s ‘ bits ’ – binary digits . A byte (made up of 8 bits) is enough computer memory to store a single character of data (e.g. the letter F). The computer uses a code to understand what each bit pattern means. Using the ASCII code, for instance, the letter F is 70 and has a bit pattern of 01000110.

A merican S tandard C ode for I nformation I nterchange (askee) is a code which represents English characters as numbers. Each letter is assigned a number. For example, A = 65. ASCII Most computers use ASCII codes. This makes it possible to transfer data from one computer to another by changing the ASCII code into a binary pattern. ASCII for Capital Letters 65 A 78 N 66 B 79 O 67 C 80 P 68 D 81 Q 69 E 82 R 70 F 83 S 71 G 84 T 72 H 85 U 73 I 86 V 74 J 87 W 75 K 88 X 76 L 89 Y 77 M 90 Z

ASCII for Capital Letters 65 A 78 N 66 B 79 O 67 C 80 P 68 D 81 Q 69 E 82 R 70 F 83 S 71 G 84 T 72 H 85 U 73 I 86 V 74 J 87 W 75 K 88 X 76 L 89 Y 77 M 90 Z

Decimal and binary When we write numbers in the decimal system, we write them in columns. Each column is 10 times bigger than the one before (right to left). So 1010 is 1000 plus 10 = 1010. Tens of millions millions 100000s 10000s 1000s 100s 10s 1s 1 1 11010 would be 10000 plus 1000 plus 10 = 11010. Tens of millions millions 100000s 10000s 1000s 100s 10s 1s 1 1 1

In the binary system, everything is based on 2s, not 10s, so each column is twice as big as the one before. So 1010 in binary is 8 plus 2 = 10. 128s 64s 32s 16s 8s 4s 2s 1s 1 1 11010 would be 16 plus 8 plus 2 = 26. 128s 64s 32s 16s 8s 4s 2s 1s 1 1 1

Binary patterns Using the binary system, convert the ASCII code into the binary pattern. The first row has been completed for you.

All computer data is stored in binary form. This does not only include text but images, sounds and movies as well. The more complex the data the more memory is used to store it. I take up more space than you.

The amount of data stored is measured in kilobytes ( KB ). 1 megabyte ( MB ) is 1,000 KB ( 2 20 ) 1 gigabyte ( GB ) is 1,000 MB ( 2 30 ) 1 terabyte ( TB ) is 1,000 GB ( 2 40 ). Confusingly, 1KB is actually 1,024 bytes ( 210 ), not 1,000 as you might expect, but most people think in multiples of 1,000.

When a computer is first switched on, it needs to load up the BIOS (Basic Input/Output System) and basic instructions for the hardware. These are stored in ROM ( R ead O nly M emory). This type of memory is called non-volatile because it retains the data . Data stored in ROM remains there even when the computer is switched off. ROM can be found on the motherboard. Read Only Memory (ROM)

When a computer is started operating instructions, any computer programs that are opened and data are stored in the RAM ( R andom A ccess M emory) temporarily . When the computer is switched off all the data is cleared from the RAM. This type of memory is called volatile because it only stores the data while the computer is switched on . RAM sticks are found on the motherboard. The contents of RAM are constantly rewritten as the data is processed. Random Access Memory (RAM)

There are three types of storage device : Storage devices and media The medium is what the data is actually stored on. Examples of media include floppy disks , CD-ROM and zip disks . those that store data by magnetizing a special material that coats the surface of a disk, and others that store data using optical technology to etch the data onto a plastic-coated metal disk. Laser beams are then passed over the surface to read the data. Finally, there are solid state devices, such as memory sticks. We will look at these in more detail later on.

Fixed storage

Removable Storage

Summary Data is stored using binary code (0 and 1). Computer memory is measured in kilobytes . Read Only Memory ( ROM ) is non-volatile because it cannot be changed . Random Access Memory ( RAM ) is volatile because it only works when the computer is switched on . There are two types of storage devices; those that use magnetic media and others that use optical media . Different types of media have different storage capacities . Storage devices can also be divided into those that are fixed and those that are removable .

Image Characteristics Image Digitization Spatial domain Intensity domain 27 Image Acquisition

28 What is an Image ? An image is a projection of a 3D scene into a 2D projection plane . An image can be defined as a 2 variable function f(x,y): R 2 →R , where for each position (x,y) in the projection plane, f(x,y) defines the light intensity at this point.

Image as a function 29

30 i(x,y) r(x,y) f(x,y)=i(x,y)⋅r(x,y) g(i,j) Image Acquisition pixel=picture element

Acquisition System 31 World Camera Digitizer Digital Image CMOS sensor

Image Types Three types of images: Binary images g(x,y) ∈ {0 , 1} Gray-scale images g(x,y) ∈ C typically c={0,…,255} Color Images three channels: g R (x,y)∈C g G (x,y)∈C g B (x,y)∈C 32

Gray Scale Image 33 210 209 204 202 197 247 143 71 64 80 84 54 54 57 58 206 196 203 197 195 210 207 56 63 58 53 53 61 62 51 201 207 192 201 198 213 156 69 65 57 55 52 53 60 50 216 206 211 193 202 207 208 57 69 60 55 77 49 62 61 221 206 211 194 196 197 220 56 63 60 55 46 97 58 106 209 214 224 199 194 193 204 173 64 60 59 51 62 56 48 204 212 213 208 191 190 191 214 60 62 66 76 51 49 55 214 215 215 207 208 180 172 188 69 72 55 49 56 52 56 209 205 214 205 204 196 187 196 86 62 66 87 57 60 48 208 209 205 203 202 186 174 185 149 71 63 55 55 45 56 207 210 211 199 217 194 183 177 209 90 62 64 52 93 52 208 205 209 209 197 194 183 187 187 239 58 68 61 51 56 204 206 203 209 195 203 188 185 183 221 75 61 58 60 60 200 203 199 236 188 197 183 190 183 196 122 63 58 64 66 205 210 202 203 199 197 196 181 173 186 105 62 57 64 63 x = 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 y = 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55

Color Image 34

Notations Image Intensity - Light energy emitted from a unit area in the image Device dependence Image Brightness - The subjective appearance of a unit area in the image Context dependence Subjective Image Gray-Level - The relative intensity at each unit area Between the lowest intensity (Black value) and the highest intensity (White value) Device independent 35

Intensity vs. Brightness 36

37 Intensity Δ f 1 Δ f 2 f2 f1 f 1 < f 2 , Δ f 1 = Δ f 2 Equal intensity steps: Equal brightness steps: Intensity vs. Brightness

Weber Law Describe the relationship between the physical magnitudes of stimuli and the perceived intensity of the stimuli. In general, Δf needed for just noticeable difference (JND) over background f was found to satisfy: 38 Brightness ∝ log( f )

What about Color Space? JND in XYZ color space was measured by Wright and Pitt, and MacAdam in the thirties MacAdam ellipses: JND plotted at the CIE-xy diagram Conclusion: measuring perceptual distances in the cie-XYZ space is not a good idea

Perceptually Uniform Color Space Most common: CIE-L*a*b* (CIELAB) color space. L* represents luminance. a* represents the difference between green and red, and b* represents the difference between yellow and blue.

Perceptually Uniform Color Space XYZ to CIELAB conversion: where (X ,Y ,Z ) are the XYZ values of a reference white point

100 100 100 100 100 100 0 0 0 100 100 0 0 0 100 100 0 0 0 100 100 100 100 100 100 Digitization Two stages in the digitization process: Spatial sampling : Spatial domain Quantization : Gray level 42 f x y 1 2 3 4 5 1 2 3 4 5 f(x,y) Continuous Image Digital Image g(i,j) ∈ C j i

Spatial Sampling When a continuous scene is imaged on the sensor, the continuous image is divided into discrete elements - picture elements (pixels)

Spatial Sampling

x Sampling The density of the sampling denotes the separation capability of the resulting image Image resolution defines the finest details that are still visible by the image We use a cyclic pattern to test the separation capability of an image

Sampling Rate

1D Example: Nyquist Frequency Nyquist Rule : To observe details at frequency f (wavelength d) one must sample at frequency > 2f (sampling intervals < d/2) The Frequency 2f is the Nyquist Frequency . Aliasing : If the pattern wavelength is less than 2d erroneous patterns may be produced.

Aliasing - Moiré Patterns

Temporal Aliasing

Temporal Aliasing Example

Image De-mosaicing Can we do better than Nyquist?

Image De-mosaicing Basic idea: use correlations between color bands

Quantization Choose number of gray levels (according to number of assigned bits) Divide continuous range of intensity values

Quantization

Quantization 8 bits image 4 bits image Low freq. areas are more sensitive to quantization

10 20 30 40 50 60 70 80 90 100 2 4 6 8 10 How should we quantize an image? Simplest approach: uniform quantization Gray-Level Sensor Voltage Z Z 1 Z 2 Z 3 Z 4 Z k-1 Z k . . . . q q 1 q 2 q 3 q k-1 . . . . . . . . sensor voltage quantization level

Non-uniform Quantization Quantize according to visual sensitivity (Weber’s Law) Non uniform sensor voltage distribution Z 7 Z 6 Z 5 Z 4 Z 3 Z 1 Z Z 2 q 6 q 5 q 3 q 2 q q 4 q 1 Low Visual Sensitivity High Visual Sensitivity

Optimal Quantization (Lloyd-Max) Content dependant Minimize quantization error q q 1 q 2 q 3 sensor voltage quantization level Z Z 1 Z 2 Z 3 Z 4

Optimal Quantization (Lloyd-Max) Also known as Loyd-Max quantizer Denote P(z) the probability of sensor voltage The quantization error is : Solution: Iterate until convergence (but optimal minimum is not guaranteed).

Example 8 bits image 4 bits image Uniform quantization 4 bits image Optimal quantization

Common color resolution for high quality images is 256 levels for each Red , Greed , Blue channels, or 256 3 = 16777216 colors. How can an image be displayed with fewer colors than it contains? Select a subset of colors (the colormap or pallet) and map the rest of the colors to them. from: Daniel Cohen-Or Color Quantization

With 8 bits per pixel and color look up table we can display at most 256 distinct colors at a time. To do that we need to choose an appropriate set of representative colors and map the image into these colors from: Daniel Cohen-Or 126 14 111 36 36 111 36 111 5 12 12 17 17 111 14 126 17 36 12 111 36 36 200 12 14 126 17 36 36 111 12 14 36 36 200 12 Color Quantization

from: Daniel Cohen-Or 2 colors 256 colors 16 colors 4 colors Color Quantization

Color Quantization Naïve (uniform) Color Quantization 24 bit to 8 bit: Retaining 3-3-2 most significant bits of the R,G and B components. false contours from: Daniel Cohen-Or

Median Cut R G B

Median Cut from: Daniel Cohen-Or

Color_MedCut (Image, n){ For each pixel in Image with color C, map C in RGB space; B = {RGB space}; While (n-- > 0) { L = Heaviest (B); Split L into L1 and L2; Remove L from B, and add L1 and L2 instead; } For all boxes in B do assign a representative (color centroid); For each pixel in Image do map to one of the representatives; } The median cut algorithm from: Daniel Cohen-Or

Better Solution from: Daniel Cohen-Or

Generalized Lloyed Algorithm (GLA) p i from: Daniel Cohen-Or

Color_GLloyd(Image, K) { - Guess K cluster centre locations - Repeat until convergence { - For each data point finds out which centre it’s closest to - For each centre finds the centroid of the points it owns - Set a new set of cluster centre locations - optional: split clusters with high variance } } The GLA algorithms aims at minimizing the quantization error:

8 bit 4 bit 24 bit from: Daniel Cohen-Or

More on Color Quantization Observation 1 : Distances and quantization errors measured in RGB space, do not relate to human perception. Solution : Apply quantization in perceptually uniform color space (such as CIELAB).

More on Color Quantization Original RGB Quantization Lab Quantization

More on Color Quantization Observation 2 : Quantization errors are spatially dependent: we are more sensitive to errors at lower spatial frequencies. 1 3 10 30 100 Sensitivity Spatial Frequency

More on Color Quantization Solution : Assign weight for each pixel color Using this scheme we minimize: 50 100 150 200 250 100 200 300 50 100 150 200 250 W w W w W W

Original Standard quantization Weighted quantization

Acquiring Audio Data in Data Structure,s.

About This Presentation

Slide Content

Tags

Categories

Download

Quick Actions

Statistics

Related Slideshows

Acquiring Audio Data in Data Structure,s.

About This Presentation

Slide Content

Slide 1

Slide 2

Slide 3

Slide 4

Slide 5

Slide 6

Slide 7

Slide 8

Slide 9

Slide 10

Slide 11

Slide 12

Slide 13

Slide 14

Slide 15

Slide 16

Slide 17

Slide 18

Slide 19

Slide 20

Slide 21

Slide 22

Slide 23

Slide 24

Slide 25

Slide 26

Slide 27

Slide 28

Slide 29

Slide 30

Slide 31

Slide 32

Slide 33

Slide 34

Slide 35

Slide 36

Slide 37

Slide 38

Slide 39

Slide 40

Slide 41

Slide 42

Slide 43

Slide 44

Slide 45

Slide 46

Slide 47

Slide 48

Slide 49

Slide 50

Slide 51

Slide 52

Slide 53

Slide 54

Slide 55

Slide 56

Slide 57

Slide 58

Slide 59

Slide 60

Slide 61

Slide 62

Slide 63

Slide 64

Slide 65

Slide 66

Slide 67

Slide 68

Slide 69

Slide 70

Slide 71

Slide 72

Slide 73

Slide 74

Slide 75

Slide 76

Slide 77