Skip to content Skip to navigation

Connexions

You are here: Home » Content » Results (Analysis of Speech Signal Spectrums Using the L2 Norm)

Navigation

Recently Viewed

This feature requires Javascript to be enabled.
 

Results (Analysis of Speech Signal Spectrums Using the L2 Norm)

Module by: Nicholas. E-mail the author

Summary: Results

VIII – Results

Table 8.1 shows the values of the L2 norms against a database of sounds generated by Nicholas. The first row represents a comparison of phrases generated by Nicholas to the database. The second row represents the same phrases generated by Andrew to the database.

Table 1
  “one” “two” “three” “four” “five” “six” “seven” “eight” “nine” “ten”
Nicholas 0.37 0.27 0.18 0.28 0.24 0.21 0.31 0.29 0.46 0.29
Andrew 0.78 0.67 0.66 0.74 0.39 0.50 0.64 0.62 0.64 0.66

Table 8.1: L2 Norm values for comparison to a database of Nicholas signals.

It is highly likely that the two distributions represent different quantities. The mean and standard deviation for the L2 norm values generated by Nicholas (Andrew) are 0.3 and 0.08 (0.63 and 0.11). This shows that by placing a threshold at around 0.45, we can separate the database into matches and impostors.

We can make a similar comparison to the database of sounds generated by Andrew; table 8.2 shows the values of L2 norms against this database.

Table 2
  “one” “two” “three” “four” “five” “six” “seven” “eight” “nine” “ten”
Andrew 0.23 0.45 0.31 0.32 0.41 0.28 0.24 0.35 0.42 0.35
Nicholas 0.76 0.63 0.58 0.51 0.48 0.41 0.51 0.62 0.59 0.64

Table 8.2: L2 Norm values for comparison to a database of Andrew signals.

Note that the populations L2 norm values from the true match (Andrew) and the impostor (Nicholas) are not as disparate as the analgous values when comparing to the database generated by Nicholas. In fact, the L2 norm for the true “nine” match is actually higher than the impostor “six” match.

This is not completely unexpected. Indeed, the same words issued by different people have great similarities – thus they are understood to have the same meaning. We can reduce the probability of false alarm (the probability that we will inappropriately mark a true match as an impostor) by combining the results from the matches of all four components of the Personal Identification Number.

Specifically, we are able to set the threshold for the L2 norm to a high value (e.g. 0.48). Then, if the comparison value exceeds this amount for any one phrase, the candidate is flagged as an impostor.

Figure 8.1 shows the Receiver Operating Characterisic (ROC) curve for the algorithm. This curve was generated by varying the L2 norm threshold used distinguish matches from non-matches. For this plot, a detection is an accurate classification of an impostor as an impostor. A false alarm is a classification of a valid user as an impostor.

We generated this curve using databases of eight recordings per signal. Additionally, for this ROC curve, impostors were always implemented with the correct PIN numbers.

Figure 1
Figure 1 (graphics1.png)

Figure 8.1: Receiver Operating Characteristic of complete algorithm

For a probability of 50% to accurately identify a valid user as a match, the probability of detecting an impostor as an impostor is approximately 88%. This corresponds to a L2 norm threshold value of 0.70. At this operating level, on average, a valid user would have to repeat his/her pin twice before being correctly identified as a match. Increasing the probability of detecting an impostor to 95% only increases the probability of false alarm to approximately 55%. This corresponds to a threshold value of 0.65.

Content actions

Download module as:

PDF | EPUB (?)

What is an EPUB file?

EPUB is an electronic book format that can be read on a variety of mobile devices.

Downloading to a reading device

For detailed instructions on how to download this content's EPUB to your specific device, click the "(?)" link.

| More downloads ...

Add module to:

My Favorites (?)

'My Favorites' is a special kind of lens which you can use to bookmark modules and collections. 'My Favorites' can only be seen by you, and collections saved in 'My Favorites' can remember the last module you were on. You need an account to use 'My Favorites'.

| A lens I own (?)

Definition of a lens

Lenses

A lens is a custom view of the content in the repository. You can think of it as a fancy kind of list that will let you see content through the eyes of organizations and people you trust.

What is in a lens?

Lens makers point to materials (modules and collections), creating a guide that includes their own comments and descriptive tags about the content.

Who can create a lens?

Any individual member, a community, or a respected organization.

What are tags? tag icon

Tags are descriptors added by lens makers to help label content, attaching a vocabulary that is meaningful in the context of the lens.

| External bookmarks