Similarity visualization for the grouping of forensic speech recordings
| Authors |
|
|---|---|
| Publication date | 2008 |
| Host editors |
|
| Book title | Computational Forensics |
| Book subtitle | Second International Workshop, IWCF 2008, Washington, DC, USA, August 7-8, 2008 : proceedings |
| ISBN |
|
| ISBN (electronic) |
|
| Series | Lecture Notes in Computer Science |
| Event | Second International Workshop on Computational Forensics (IWCF 2008), Washington, DC, USA |
| Pages (from-to) | 169-180 |
| Publisher | Berlin: Springer |
| Organisations |
|
| Abstract |
In a forensic phone wiretapping investigation, a major problem is to get the full picture of the speakers involved. Typically, the wiretapped speech recordings are grouped using a clustering tool. The main disadvantage of such an approach is that in a bootstrapped scenario grouping errors accumulate. In this paper, we propose a visual approach to find similar speech recordings that probably stem from the same speaker. We first model the speech recordings and define suitable similarity measures between recordings. Then, through an approximate 2-D visualization of the inter-speech, similarities the investigator can identify clear groups of recordings and recordings that are harder to differentiate. We did extensive experiments on phone data of 50 speakers with 2 recordings per speaker. We tested quality of the 2-D visualization in relation to original high dimensional similarities. It turned out that for the original high dimensional similarity measure the nearest recording is almost always the one from the same speaker. In the 2-D visualization, we achieved that on average for all speech recordings a recording of the same speaker is among the 10 nearest recordings.
|
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1007/978-3-540-85303-9_16 |
| Downloads |
295465.pdf
(Submitted manuscript)
|
| Permalink to this page | |