Audio Analytics Aggression Tested

Author: Ethan Ace, Published on Nov 20, 2015

What if you could use your IP cameras to detect fights before they start? 

That is the goal of Louroe / Sound Intelligence with their recently released Aggression Detector audio analytics. Claiming that "90% of physical aggression are preceded by verbal aggression", these analytics are designed to alert guards before verbal altercations turn physical.

We tested the Aggression Detector using both live subjects and recorded clips of multiple scenes, to see just how these analytics perform.

Here is the app main view that we used to monitor and optimize the audio analytics:

**** ** *** ***** *** **** ** ******* ** ****** fights ****** **** *****? 

**** ** *** **** ******** / ***** **************** ***** ******** ****************** ******** ***** *********. ******** **** "**% ** ******** ********** *** ******** ** verbal **********", ***** ********* *** ******** ** ***** ****** ****** verbal ************ **** ********.

** ****** *** ********** ******** ***** **** **** ******** *** ******** clips ** ******** ******, ** *** **** *** ***** ********* perform.

**** ** *** *** **** **** **** ** **** ** ******* and ******** *** ***** *********:

[***************]

*******

******/***** ************ ********** ******** ********* *** * *********** ****** ******** to ************ *******, ********* ***** ** ****** ***** *** ********* go *********, ******** ****** ******/************ ********. 

*******, **** *** *** ******** *** *** *********, *** *** best **** ** ******** ***** *****, ** ****** ** ******** ******** will ******* ******** ***** ******, ***** **** ********** ***** ****** reduce analytics' ******* ** *********** ******. 

Key ********

** *** *****, ******'* ********** ********* *********** ** ***** *********** over *** ****** ** ***** **** ** ******* **** ****** area (**** ****** ***** *************, ****** ******* **** *** ***** ** other *******, ** ***** ***********).

*******, **** ***-********** ****** ******** ****** ***** ****** ** ******* degrees:

  • *****/***** *****:***** ****** *** ****** ********* **** ******** ***** ******, ********** of ***********. ****** ********** *** ***** ********** ********* ** ***** applications.
  • ****, ***-********** ******: ** **** *****, ****** ***-********** ******, ******, etc., ********* ******. **** *** ******** *** ******** * *** times (

********* ******** ********** ******** ****** ** ******* *** *******. ***** shouts ** **** ****** ***** ********* ********* ****** *** *****.

****** ******** ***** *** ** **** ** **** *********** *** best ***********. ******** **** *** *** *********** *** ********* ****** in *** *****, **** **** ********** ***** ****** *********** ******** detection ***********. 

** ****** *** ***** ***********. *** ****** ******* ** ******* I/O *** ** ****.

*******

**** *** ******'* ********** ********* ** $*** *** *******.

Potential ************

*** **** *** *** ***** ********** ********* ** ** ******** quiet ***** ***** ********** ****** **** ****** ****** *** ******* background *****, ********* *******, *********, ****** ********, **********, ***.

***** ********* ***** ** ****** ** *******, *** *** ********* volume *** ***** ***** ** ****** ***** ******/***** ** ** between ******* **** ****** ******* ******** ***** ******. ***** ***** *** be *** ******* ** ***** **** *******, *** **** ***** largely ****** *** ******** ** *** ********.

** **** ***** ***** ****** ****** *** ******, **** ** stadiums, ***** ****** *** ****** ** ** **** ******** *** to ***** *****.

*******, ** **** **** ***** **** ** ******* ** *****, **** are ****** ******** ** ******** ****** ****** *** ********** *** to *** ********* ***** *** ********** ******, ****** *** ********* unsuitable.

************/*************

******'* ********** ******** *** * ****** ************* ******, **** **** one ******** *******: ***********, *********** ** *** *****. **** ****** also ***** **** **** ***** ******** ** ***** ******:

  1. ***** ***** ********:*** *** ***** ***** ***** ********, **** *** ****** ***** displayed ** *****, ******* ** ****** **** ***** *** ********* threshold, ** ****** **** ********, *** ******* *** **** ********** is ********. ***** ********** ** ******, *** ********** **** ******* from ***** ** ***** ***.
  2. ***** ***********: *** ****** ***** ******** ********* ********, **** ****** ******* frequencies ********* ** *** *** **** ******, ****** *** ***, and **** *** ***.
  3. **************: *******, ** *** ******, ******** ************** ** *****. ****** ***** from ***** (********** ***** ****) ** **** (************), **** ** green *** ***-******** ***** ******, *** ******/******/*** *** ********** ****** of ******/**********.

** ******** ********* ********** *********** *** ********* *********** **** *** course ** ******** **** ** ***** ** ******** ***** ***** ******/****** activations. *********** ******** **** **** **** ********* ** *** ********** used, **** ******** *******, *** ********** ***** ******* ** *** time ** ***********.

***** **** ***** ********, ***** ****** ****** **** *** ***** settings. ** ****** ***** ******* ** **** ******, ********* *** specific ***********, ** *** ***** ************ *****.

Video ***********

*** ***** ***** ******* ************ *** ************* ** **** ******, including *********** ********, *** *****, *** ****:


*** ***********

******'* ********* *** *** ********* **** *** ** *** ***** tested: ********, *******, *****, *** ********. ******* ** ***** **** applications, **** *** ** **** ** ******* *** ************* ** relay ******* ** *** ******, ***** *** ** ******** ********** to *** ***.

** ***** ******* *** *** *********, ***** *** **** ******* email/SMS ******** *** ******.

Audio ******** ********

*** ******** ***** **** ******** ******** ** ******* ************, **** live *** **** ******** *****.

**** **** ****** ********** ***** *** ** **** *** ************* ********, as *** ********* ******** ** *** ********* ********** *** ******** playing *** **** **** *** ******* *** *** ****** ********* performance. However, ** **** ***** *** ***** ******** ****** ** * way ** ********** **** *** ******* ** ******** *******.

**** ***** *** *****

*****, **** ******* ***** ******** ** * ***** **** * crowd ** **********. ****** ****** *** ****, ***** ** *** large ******* ** ****** ** *** ******** ****** *** *** frequencies ***** ** *** ***********. ********** ****** **** ********, *** not ******** ** **** *******:

****, ***-********** ******

****, **** ******* ***** ****, *** *** ********** ******. **** that ******** ****** **** **********, ** *** ******** ****** ***** all ******. *******, ******* ********* ******, *** ****** ** *** analyzed ** ********.

******* ***** *****

*******, **** ******* ***** ******* ***** ***** ** *** ***********. Even **** ******* ****** *** ******* ***********, ****** *** ****** constant:

Test **********

**** *** ********* ***** ** **** ***** **** ****** ******** A *** *** **-* ***** *******.

*** ********* ********/******** ******** **** **** ** **** ****:

  • ******/***** ************ ********** ********: *.*.*-******
  • **** *****: *.**.*.*
  • ***********: *.*.*.*****
  • ******* ******** ******: *.*.****.**
  • ********* ********: **** (*.**)

Comments (8)

Nice thorough review!

Couple questions,

Are those timelines stored somewhere, or once they move off the screen are they gone?

Did you play back the clips thru speakers into the room and bring it thru the back thru the Mic, or just bring the clips straight in thru the line level input?

Considering that broadcast audio normally has its noise floor ducked and frequency response tuned for speech, if you brought it in direct it would, IMHO, perform far better than one could expect in the real world.

I remember a while back reading that the Sound Intelligence could detect swearing. Is that part of this product? If so is it region specific, as colorful speech often has it localized variants.

Finally, you say that there were no false positives in the three days of open office deployment. But were there any false negatives? I know that when I'm troubleshooting a particularly uncooperative camera, I occasionally might indicate my dissatisfaction by sharing an expletive with the device. In rare cases I have even gone so far as to threaten components with a hot soldering iron.

So I guess I'm asking is, did you get a chance to yell in an aggressive manner, for whatever reason, and did it seem to trigger correctly?

Thanks!

The timeline images are not stored, but you can record the audio that preceded and followed an alarm.

The analytic is designed to use unfiltered audio streams and compression algorithms that don't filter out certain frequencies. So playing an mp3 over a speaker might work to trigger a detection, but isn't as ideal as a couple guys yelling at each other.

Swearing detection is an interesting idea. Certainly if you screamed a swear word you'd get a detection, but that's because of a stressed voice and not the word identification. So as long as people aren't softly whispering swear words at each other you'd still get a detection regardless of language being spoken.

I'd be curious to see if they soon further develop additional audio analytics includes keyword recognition, alerting and searching.

Does this product assume that audio is actually recorded or is it just for live monitoring? In some states audio can't be recorded legally without consent from both parties.

Louroe says: "The analytic isn't listening to words or languages but rather vocal stress and the associated change in frequencies."

That would help reduce the risk / concern. The analytic also does not record audio.

Ultimately, it is a legal issue but it does seem technically different than audio recording.

A novel use of this product would be to trigger the recording of audio only after and during detection, as such speech would not be considered interception of a private conversation, and would allow the capture of important audio details.

Thanks for the review. Regardless of test results, it is an innovative product from Louroe! You say you used an Axis Q1615 in the tests, but I don't understand where you're using the camera? Do you mean that you're just using the Q1615's audio feature?

Matt, the audio analytics app is running on the Axis camera and the mic is connected to the Axis camera. This is shown in the config video, which I've copied below:

Login to read this IPVM report.
Why do I need to log in?
IPVM conducts unique testing and research funded by member's payments enabling us to offer the most independent, accurate and in-depth information.

Related Reports

The IP Camera Lock-In Trend: Meraki and Verkada on Jan 18, 2019
Open systems and interoperability have not only been big buzzwords over the past decade, but they have also become core features of video...
Mobile Surveillance Trailers Guide on Jan 17, 2019
Putting cameras in a place for temporary surveillance where power and communications are not readily available can be complicated and expensive....
Avigilon Favorability Results 2019 on Jan 15, 2019
Since IPVM's 2017 Avigilon favorability results, the company was acquired by Motorola and has shifted from being an aggressive startup to a more...
Gorilla Technology AI Provider, Raises $15 Million, Profiled on Jan 15, 2019
Gorilla Technology is a Taiwanese video analytics manufacturer that recently announced a $15 million investment from SBI Group, saying this...
Pelco Favorability Results 2019 on Jan 11, 2019
Pelco had a significant favorability problem amongst integrators in our previous study (see 2016 Pelco results). Now, in the first edition of our...
CES 2019 Show Report on Jan 10, 2019
IPVM was at CES 2019 for the second year (see our CES 2018 Show Final Report) and is reporting on announcements and interesting news from the show...
NTP / Network Time Guide For Video Surveillance on Jan 10, 2019
Inaccurate time can lead to missing or inadmissible video, yet this topic is often overlooked, with cameras and servers left defaulted,...
H.265 / HEVC Codec Tutorial on Jan 08, 2019
H.265 support improved significantly in 2018, with H.265 camera/VMS compatibility increased compared to only a year ago, and most manufacturers...
IPVM Best New Products 2019 Opened - 70+ Entrants on Jan 07, 2019
The inaugural IPVM Best New Product Awards has been opened - the industry's first and only program where the awards are not pay-to-play and the...
CyberExtruder Face Recognition Profile on Jan 04, 2019
CyberExtruder offers 3D modeling face recognition software that they say provides quicker and more accurate matches than other 2D face recognition...

Most Recent Industry Reports

The IP Camera Lock-In Trend: Meraki and Verkada on Jan 18, 2019
Open systems and interoperability have not only been big buzzwords over the past decade, but they have also become core features of video...
NYPD Refutes False SCMP Hikvision Story on Jan 18, 2019
The NYPD has refuted the SCMP Hikvision story, the Voice of America has reported. On January 11, 2018, the SCMP alleged that the NYPD was using...
Mobile Surveillance Trailers Guide on Jan 17, 2019
Putting cameras in a place for temporary surveillance where power and communications are not readily available can be complicated and expensive....
Exacq Favorability Results 2019 on Jan 17, 2019
Exacq favorability amongst integrators has declined sharply, in new IPVM statistics, compared to 2017 IPVM statistics for Exacq. Now, over 5 since...
Testing Bandwidth Vs. Low Light on Jan 16, 2019
Nighttime bandwidth spikes are a major concern in video surveillance. Many calculate bandwidth as a single 24/7 number, but bit rates vary...
Access Control Records Maintenance Guide on Jan 16, 2019
Weeding out old entries, turning off unused credentials, and updating who carries which credentials is as important as to maintaining security as...
UK Fines Security Firms For Illegal Direct Marketing on Jan 16, 2019
Two UK security firms have paid over $200,000 in fines for illegally making hundreds of thousands of calls to people registered on a government...
Access Control Cabling Tutorial on Jan 15, 2019
Access Control is only as reliable as its cables. While this aspect lacks the sexiness of other components, it remains a vital part of every...
Avigilon Favorability Results 2019 on Jan 15, 2019
Since IPVM's 2017 Avigilon favorability results, the company was acquired by Motorola and has shifted from being an aggressive startup to a more...
Gorilla Technology AI Provider, Raises $15 Million, Profiled on Jan 15, 2019
Gorilla Technology is a Taiwanese video analytics manufacturer that recently announced a $15 million investment from SBI Group, saying this...

The world's leading video surveillance information source, IPVM provides the best reporting, testing and training for 10,000+ members globally. Dedicated to independent and objective information, we uniquely refuse any and all advertisements, sponsorship and consulting from manufacturers.

About | FAQ | Contact