You are on page 1of 32

2011

100032

TTSA

2011

2011

ii

48
1.18

iii

(Abstract)
A Study on Voice Source Localization and its Application
Our research is enlightened by the magical beating-drum skill Voice Source
Localization in the movie House of Flying Daggers. To explore the theory of
Voice Source Localization, we use a pair of microphones syncing the time
difference which is produced when the microphones are receiving sound waves
during record. From the preceding procedure, we confirm the feasibility of Voice
Source Localization. Utilizing the theory, we make use of the common computer
sound cards and cheap sonic wave sensory elements (Piezo Buzzer) and apply
them to the point of impact immediate display system. The practice and testing
results prove that our method can directly judge the time difference of right and left
channels in sounds data and then immediately display the position of the point of
impact. On the 48x48cm target, the average error is 1.18cm, which is a pretty good
result.

iv


BB

GoldWave 5.06Visual Basic 6

3.5mm Jack
2mm 3mm
1

1-1

1-1

1-2

1-2

x2 y2 x2
y2

1
a2 b2 a2 c2 a2
a 1/2
2

v t t t L

t R t 0 t 0
t 0 c
mL mR / 2

a
c

2a
1 v (t L t R )
cos 1 v t
cos

m m
2c
R
mL mR
L

cos 1 cos 1

[ 1]

t 0 0 90 t 0 90 180 t 0 90
( x, y)

x2
y2
x2
y2

1 [ 2]
a 2 c 2 a 2 (1 / 2 v t ) 2 ( mL mR / 2) 2 (1 / 2 v t ) 2

t 0 x 0 t 0 x 0
t 0 y

[ 1]
mL mR
3

(1)

Goldwave
192,000Hz96,000Hz44,100Hz22,050Hz11,025Hz

192,000HzGoldWave
0.107% 192,000Hz

2-1

2-2

2-3 GoldWave

760Hz

758.21Hz

0.235%

350Hz

349.34Hz

0.187%

5517Hz

5517.24Hz

0.004%

512Hz

512.82Hz

0.160%

66080Hz

66115.70Hz

0.054%

1000Hz

1000.00Hz

0.000%

192,000Hz

(2)

4 GoldWave

3 45 GoldWave
6

4 GoldWave
0.2588 0.2540 0.0018
0.9m 27 347.2 m / sec
[ 1]

v t
mL mR

cos 1

cos 1 347.2 0.0018 46.02

0.9

x2
y2

1
(1 / 2 347.2 0.0018) 2 (0.9 / 2) 2 (1 / 2 347.2 0.0018) 2

x2
y2

1
0.31248 2 0.45 2 0.31248 2
t 0

5
Mic In BB
1 2

[ 2]

[5][6][7][8]

6-1

6-2

6-3

6-4

GoldWave

7
102
801

8-1

8-5

8-2

8-3

8-4

8-6

8-7 102

8-8 801

(0,0) 1cm 1 (24,28)


(12,28) 1 (28,12) (28,24) 2

Mic In
4cm 48cm48cm

10

(1)
5 10

GoldWave 192,000Hz
BB 3
3
GoldWave
t
v d / t d

(2)

11

GoldWave

10-5

BB

10

(3)
BB
BB

12

[1][2]

0.5%
BB

11 Visual Basic 6
BB

11 BB

BB

Sound ' (i)

Sound (i)

13


2Bytes 215 32768 1Byte 2 7 128
12

12

14

2.5%
5%10%15%20%25%
30%

25%

13

2.5%

5%

10%

15%

20%

25%

30%

5476.66 1104.99

972.09

663.29

660.67

656.34

626.39

2103.14

553.73

553.73

587.05

590.83

587.05

590.83

1107.29

646.66

704.93

651.00

603.97

611.59

603.97

682.75

618.10

575.19

604.23

613.70

604.23

587.74

599.67

710.08

759.51

665.70

640.90

604.53

595.67

847.26

717.59

683.39

685.80

663.85

661.60

659.37

694.86

641.09

612.82

631.69

631.69

631.69

631.69

1865.35

816.48

733.72

732.62

693.84

625.80

598.02

1189.76

710.08

773.96

692.49

693.57

605.35

605.35

10

1543.72

855.38

579.63

571.68

575.99

604.06

647.58

(m/s) 1611.05

737.42

694.90

648.55

636.90

619.22

614.66

157.29

126.18

50.38

41.15

24.33

25.04

90.17% 21.33% 18.16%

7.77%

6.46%

3.93%

4.07%

1452.70

15

(4)
25%
619.22 m/s

d (cm)

t (sec)

(m / sec)

( 20 , 20 )

33.41

0.000509

656.34

( 16 , -16 )

15.28

0.000206

587.05

( 12 , -4 )

16.17

0.000264

611.59

( -16 , 12 )

14.36

0.000238

604.23

( -24 , -8 )

14.91

0.000247

604.53

d (cm)

t (sec)

(m / sec)

( 16 , 24 )

32.51

0.000491

661.60

( 12 , -16 )

14.36

0.000227

631.69

( -4 , 12 )

16.17

0.000258

625.80

( -8 , -24 )

14.91

0.000246

605.35

10

( -16 , 16 )

15.28

0.000253

604.06

(1)
BB ( x, y) 1
2 t1 t 2

16

t 1 0 t 2 0

( x 6) 2
( y 28) 2

1
(1 / 2 v t1 ) 2 ( 24 12 / 2) 2 (1 / 2 v t1 ) 2
( y 6) 2
( x 28) 2

1
(1 / 2 v t 2 ) 2 ( 24 12 / 2) 2 (1 / 2 v t 2 ) 2

[ 3]

t 1 0 t 2 0
x 6
( y 6) 2
( x 28) 2

1
(1 / 2 v t 2 ) 2 ( 24 12 / 2) 2 (1 / 2 v t 2 ) 2

t 1 0 t 2 0

( x 6) 2
( y 28) 2

1
(1 / 2 v t1 ) 2 ( 24 12 / 2) 2 (1 / 2 v t1 ) 2
y 6

t 1 0 t 2 0

x6
y 6

(2)
[ 3]

t1 t 2 14

17

x 6 y 6 t1 0 t 2 0
1 Part I t1 0 t 2 0 2
Part II
t1 0 t 2 0 3 Part III
t1 0
t1 0 t 2 0 4 Part IV
x 6 t 2 0 y 6

14 t 1 t 2

[ 3]
1 y
[24,24] 0.1 481 t1
x

18

t1 0

x 6

(1 / 2 v t1 ) 2 [( y 28) 2 18 2 (1 / 2 v t1 ) 2 ]
18 2 (1 / 2 v t1 ) 2

t1 0
x 6

t1 0

x 6

(1 / 2 v t1 ) 2 [( y 28) 2 18 2 (1 / 2 v t1 ) 2 ]
18 2 (1 / 2 v t1 ) 2

2
x' [24,24]
0.1 481 t 2
y '

t 2 0
y' 6

(1 / 2 v t 2 ) 2 [( x 28) 2 18 2 (1 / 2 v t 2 ) 2 ]
18 2 (1 / 2 v t 2 ) 2

t 2 0
y' 6

t 2 0
y' 6

(1 / 2 v t 2 ) 2 [( x 28) 2 18 2 (1 / 2 v t 2 ) 2 ]
18 2 (1 / 2 v t 2 ) 2

19

Client / Server 15
Client
BB
Server Server Client

15

(1) Client

BB 3

1 Buffer1
2 Buffer2

20

3 Buffer3
1 BB
Server 1
3 1 2

[3][4] Windows API


GlobalAllocGlobalLockGlobalFree
waveInOpenwaveInStartwaveInAddBufferwaveInPrepareHeader
waveInStopwaveInClosewaveInUnprepareHeader
Winsock ConnectConnectionRequest
SendDataDataArrivalGetData Client
16 17 Visual Basic 6 Client

16 Client

21

17 Client BB

(2) Server
Sever
Client
Client

18 Server

18 Server Visual Basic 6 Server


19 3 Winsock 1
PictureBox3 TextBox Winsock
Winsock Client

Client

22

19 Server

10
BB
1.18cm
48cm 48cm

1
t1

( 16 , -12 )

0.000274

( 12 , 0)

2
t2

cm

-0.000184

( 17.7 , -12.7 )

1.8

0.000274

0.000144

( 11.3 , 0.2 )

0.7

(8,8)

0.000274

0.000250

( 8.6 , 6.2 )

1.9

( 8 , -12 )

0.000187

-0.000090

( 8.4 , -10.2 )

1.8

( 4 , -8 )

0.000146

-0.000021

( 4.3 , -7.1 )

0.9

(0,0)

0.000099

0.000092

( -0.1 , -0.6 )

0.6

( -4 , 4 )

0.000034

0.000127

( -4.1 , 2.3 )

1.7

( -8 , 0 )

-0.000031

0.000081

( -7.8 , -0.3 )

0.4

( -8 , -8 )

-0.000026

-0.000026

( -7.8 , -7.8 )

0.3

10

( -16 , -16 )

-0.000130

-0.000128

( -17.2 , -17.1 )

1.6

23

20 ()()

619.22 m/s 48cm 0.000775sec


192000Hz 148 149

0.322516 cm/sample

24

DAQ, Data Acquisition

[5][6][7][8]

21-1

21-2

21-1

21-2

25

619.22 m/s
619.22 m/s
22-1 BB 22-2
P
S
2.5%5%10%15%20%25% 30%

22-1

22-2

48
1.18

26

MAYA44
V3

http://neural.cs.nthu.edu.tw/jang/books/audioSignalProcessing/

, , , 2004
, Visual Basic 6.0 , , 1999
http://web.ncyu.edu.tw/~andrew/

http://zh.wikipedia.org/zh-tw/
http://en.wikipedia.org/wiki/
, ,
32 , 1995
,
, 44 , 2004

27

28

You might also like