BBC Featured Deep Learning Face Recognition Analyzed (Digital Barriers)

Published Aug 30, 2017 13:36 PM

The UK's largest broadcaster, BBC, recently featured Digital Barriers (also based in the UK) on a segment highlighting face recognition technology. The TV segment showed Digital Barriers' SmartVis Face [link no longer available] being used to monitor public spaces for appearances of "terrorist watchlist" suspects.

 

Digital Barriers claims the use of deep neural networks is extending the capabilities of their SmartVis Face product, allowing them to achieve new levels of performance and accuracy.

This report analyzes the SmartVis product, based on conversations with Digital Barriers, and outlines where the company says the product has made advancements, and where limitations still exist in facial recognition.

SmartVis ********

******** ** ******* ********' ***** ********* ******* line. ******** **** [**** ** ****** available] ** *** ******-*********** *********, ***** is ***** ** ********** ******* **************** **** ************** ** ****. ***** **** ***********, *** company says **** **** **-********** *** ******** of *** ********, ********* *********** * deep ******** ***** ** *** ****** to ****** ****** ***** ** ***-******* conditions.

******** **** ** * ******-***** ******, analyzing ******* **** ********* ******* ** real ****, *** ********** ****** ** logs ***** ** ************* ******** *** matching (** *** ********) * **** against * ******** ** ******** *****.

Deep ******** ***********

******* ******** ****** **** **** **** a **** ******** ******** ** ***** a ****** ******* ** ********* ***** by ******** *********** ******** **** ************* various *****. **** ** ** ******** to *********** ******* ***** ********* **** to ******* ********** *** ********* ******* various ********** ** * **** (*.*.: spacing ******* ****, ******** ******* **** and ****, ***.).

*** **** ******** ******** ****** **** to **** * *****, ** * lack ** * ***** ****** ** compared ** ******** ********. **** **** enables ****** ******** ***** **** *** view ** *** **** ** ************* different **** *** *****(*) **** ** enroll ** ** *** ********.

* ***** **** ** ******* ****** several ****** ** *****, ******* ** from **** * ****** "**** ****", which ******* ******** ****** **** *********** to ********* ******** ** *** ******, and *** ***** ** ********, ** the ****** ***** **** ******** ***** of *** ****, ********** ********* **** data *** *** ******** ********* ** analyze.

******* ******** ****** ***** ************ ***** the ******* **** ****** *** "***-*********" checking, ***** * ****** *** *** be ***** * ****** *********** ****** is ** ***, *** **** *** not ***** ** "******" **** *** system ** ******* ******** ** * camera ** ******* ******* * **** obvious ***** *****.

Field ** **** **************

*** ********* ***** * ******* ** 40 ****** ******* *** **** (~*** PPF / ~*** ***) *** ******** performance, *** **** *** ******** *********** ************ up ** ** ****** ******* *** ****, with ****** ******* (*.*.: ** ******) not ******** *** ********** ******* ** accuracy.

** ** * ***** ** *** field ** **** *** ** ************** analyzed.

***** ***** ** ** *********** ******* resolution *********, *** * **** ***** and ***** ************ ********* **** ** customers ***** ***** *******.

Enrolling *****

***** *** ** ******** ** *** database **** ****** ****** ******** ("********"), from * ****** ******** ** * PC, ** **** ** **** ***** segments. *******, ******** *****/****** ** *** face *** ******** *** ******* ******** in ******** *** ******/**** *****.

Server ************

******** **** **** ** ** *****/******* based ******** (** ** ** *********), and **** **** ** ********** *** deep ****** ******* *********. ******* **** are****** *** **** ******** *****

***** ****** ************ **** **** ***** on *** ****** ** ********, **** of *** ******** ***** ********, *** match/lookup ***** ******* ** *** ********. An ******* ****** **** * ****** GTX **** **** ** ********* *** * streams, **** * ******** ** ~*,*** faces *** ****** ***** ******* ** real-time (be ***** ** ****** **** ******* simultaneously, *** *** *** *** ************ may ******** *** ***** ********* ** applications **** **** **** ********** ** people.

*******

***** ****** **** ~$*,***-$*,*** *** *******/******, primarily ********* ** *** **** ** the ****** ******** *** ******** ***** to ********. *** $*,*** ***** ** most ******, ********* ** ******* ********, for ********* ******* ********** ** *** range ** ******* ******* ** ~*,*** faces.

Sales *******

**** **** ***** ******* *******, ******* ******** ***** *** ******** analytics ******* ******** ******** ********** ******** globally.

Software - **** ******** *** ** ********

***** ******** **** **** ******* * "functional" **** *********, ********* ** ******* Barriers, **** ********* *** *** ******* more ** ** ****** **** ****** a *** ***** **** *********, ***** could ** * ***, ** **** other ***********-******** ********. ******* ******** **** have ************ **** ***** **** ********* and *******, ** **** ** **** PSIM *********, *** *** **** ****** customers **** ****** ***********, ** ******, for ********** *****. *** ******* ****** that *** ** *** ******* ** their ******** **** ***** ****** ** no "********" *******, *** **** ***** should ****** **** ****** ** ****** integration ** **** ***********.

*** ******* **** **** *** ******* on * *** ******** ******* **** will **** **** ************ ** *** stand-alone ******* ** **** ** ********* to * ******* ******** ****.

Upcoming ********

******* ******** **** **** *** ******* to ********* ***-**** ******* ** ** development *** ****** **** ****-********. **** is **** ******** ** ***** ** an ********** **** * ****, *** more ** ** ***** ****** ** track **** *********** ** ****** **** appearances ** ** *****/******** ******, ** clothing **** ** ******, ***.

Face *********** **********

**** *********** ******* **** ***** **** challenges **** *** *****. **** *** technical: ************ *** *********** **** ****** than ******* ************ ****** ******* *****, requirements *** ******* ********* ** *******, inabilities ** **** **** **** ***** at ****. ***** ****** *** ******* concerns **** ********, ** ****** ****** like **** ** ******* ********* ***** and ********** *** ******* **** ******* a ***** **** ** *******. *******, this *** **** *** ******* ****** to ******, *** ********** *** ******** that * ****** ** * ********* will ******** ** ******** ** **** do ****** ** *****.

**** *** *** ****** *** **** recognition *******, ********* *** *** ******* outlined *****, *** ****** ******* ** how ******** **** *** ******* ** real-world *********** ** ***** *******, ** to *** *** ************ *****.

Other **** *********** *********

******* ******** ** *** ***** ** the **** *********** ******** *****. ********* such **********,*****,******* **** ********* ******** **** *********** capabilities ** **** ** ***** *********. Additionally, ***** [**** ** ****** *********] and ********* **** **** **** ********* face *********** ************ ********.

** *** **** *** ********** ******* with * ***** **** *********** *******, please ***** * ******* ********* **** experience.

 

 

Comments (15)
Avatar
Christian Laforte
Aug 30, 2017

I tested four or five solutions the past 10 years. They were disappointing. At best I got some of them to work reliably when doing a 1-to-1 matching, i.e. between an access card and the photo previously assigned to it. They never worked "in the wild". Mismatches were common.

This past month we have installed NEC Neoface in our office. I'm really impressed. It took a few weeks (part-time) to learn the system and tweak the many settings, mainly to accommodate some of our older, lower-resolution cameras, and running 10 cameras on an old spare server. It's now working reliably as long as subjects generally look toward a camera. NEC seems to err on the side of caution, i.e. it would rather not generate a match unless it's absolutely sure this is the right person. They have an impressive track record - some customers have databases with millions of people.

BTW organizations seriously considering face recognition should read the US NIST face recognition vendor test. 

https://www.nist.gov/programs-projects/face-recognition-vendor-test-frvt

 

 

(3)
JH
John Honovich
Aug 30, 2017
IPVM

Christian, good feedback!

It's now working reliably as long as subjects generally look toward a camera. 

Well, what do they recommend there? It's hard to make people look towards the camera in a real world, uncontrolled surveillance application?

Also, any sense of how well or poorly it scales up? Btw that I mean, it's easier to get good results with a small watchlist and a small number of people being surveilled but put it in a large scale application with tens of thousands of different people passing daily and hundreds of people on a watchlist and it's a lot harder.

(1)
UM
Undisclosed Manufacturer #2
Sep 05, 2017

Facial recognition accuracy has improved greatly over the past ten years, so it's no surprise that you're more pleased with a more recent offering.

 

While FRVT is important, a more appropriate test is the recent "Faces in Video Evaluation" test. https://www.nist.gov/programs-projects/face-video-evaluation-five

Avatar
Christian Laforte
Aug 30, 2017

> Well, what do they recommend there? It's hard to make people look towards the camera in a real world, uncontrolled surveillance application?

Two approaches that would be cheap and effective:

- Put a monitor with animated content right next to the camera, that the person is likely to look at. For instance, an animated menu (for a restaurant), a welcome message (for corporate reception), a live playback of the surveillance camera (in a retail environment to deter theft), a quote-of-the-day for a church, live sports (in a bar) etc.

- Put the camera next to an intercom, a door bell, or other device that a person will naturally face when interacting with it.

> Also, any sense of how well or poorly it scales up?

I haven't tested it yet but the system appears well designed to scale to millions of subjects. Their technical documentation is extensive and gives lots of details on how to achieve it. From a performance standpoint their software services architecture is quite similar to ours (that scaled to 100 servers for one big project). As for the accuracy, the NIST report is best suited to answer your question.

> Btw that I mean, it's easier to get good results with a small watchlist and a small number of people being surveilled but put it in a large scale application with tens of thousands of different people passing daily and hundreds of people on a watchlist and it's a lot harder.

Yes, that's what the US NIST was tasked with testing, e.g. for airports, border protection, etc. false alarm rates could easily get out of control and make a multi-million dollars systems useless. I've seen it in person 4 years ago at Dubai airport where a rack full of face recognition servers was permanently turned off. (I won't mention that vendor.)

Personally I'm only aware of NEC and Safran/Morpho having a solid track record in these kinds of large government contracts. Of course this is a somewhat controlled environment. A few months ago there was news that a police in UK was able to arrest a criminal through street surveillance. That was NEC, and that's why I became interested in trying it out myself.

(3)
JH
John Honovich
Aug 30, 2017
IPVM

Thanks for the feedback. Interesting to hear about the scaling.

- Put a monitor with animated content right next to the camera, that the person is likely to look at. For instance, an animated menu (for a restaurant), a welcome message (for corporate reception), a live playback of the surveillance camera (in a retail environment to deter theft), a quote-of-the-day for a church, live sports (in a bar) etc.

I laugh at this a little bit because I have heard this advice many times before but it's generally hard to accomplish that type of 'look at the birdie' technique. It, of course, depends who the subject / watchlist / adversary is but the type of professional criminal that are often on such lists tend to be savvy about not staring up at potential cameras, welcome messages, etc.

(1)
PS
Phil Shanahan
Sep 12, 2017

I would imagine that mounting cameras much lower than normal would help with this. I'm thinking a corridor or entrance door where the camera is mounted to a wall at about 1.2 meters high looking directly at the target. As people look in front of them the camera should easily see their face. Typically cameras are mounted on, or close to the ceiling which would be more like 2.4 meters. not sating mounting cameras at 1.2 meters is practical but I don't see any other way to get people to 'look' directly at the camera

(1)
JH
John Honovich
Sep 12, 2017
IPVM

Phil, agreed, mounting cameras lower will help. The #1 objection to that is vandalism (see Camera Vandalism Statistics).

The secondary issue is that when the camera is mounted lower people block others more easily. A higher mounted camera has the benefit to literally look 'over' people and that's a practical issue as well for face recognition if you have a scenario where multiple people will regularly be walking by simultaneously.

(1)
Avatar
Christian Laforte
Aug 30, 2017

> the type of professional criminal that are often on such lists tend to be savvy about not staring up at potential cameras, welcome messages, etc.

You're right, detecting hardened criminals in the wild isn't full-proof yet and won't be for many more years. But I think there are other business opportunities that can live with less-than-perfect accuracy, e.g. retail.

UM
Undisclosed Manufacturer #3
Sep 06, 2017

If you go through some uk airports these days, there are cameras installed in the passport control areas with an animated ring of leds around the lens. These are obviously designed to make you glance up and look directly at the camera, its basic human nature..  Anyone purposely evading could be easily pulled aside for 'further questioning'.

If only I could find a link to them..

(1)
JH
John Honovich
Sep 06, 2017
IPVM

uk airports these days, there are cameras installed in the passport control areas

I believe that and, in those environments, you have the law and authorities to enforce that immediately. Where it is harder is in less controlled environments like a mall or a store or a city street, etc.

(1)
UM
Undisclosed Manufacturer #3
Sep 06, 2017

Agree 100% with you John. But that's the difference between a soft and hard target.

UE
Undisclosed End User #1
Aug 30, 2017

Disclaimer, I worked for manufacturer for almost 10years that did Face Analytics and have cut my teeth in the school of Hard Knox, I once drank the Kool-Aide from the firehouse as well.

Welcome the world of Behavior Analytics in uncontrolled environments with uncooperative subjects.  I suggest you not use "real-time alerting" as a knock-out requirement it will never work as planned w/o specially placed sensors and subjects who are in lane of analysis while looking in the direction of the sensor for a minimum of 2-3 seconds.  Even with all of that there are so many variables.

Best value for FACIAL SURVEILLANCE is for forensic searching, you don't have real-time matching issues.  You accept all of the false matches is search and weed them out but cannot do the same in real-time.  Just like "G" Search for Best BrewPub, you take all the crap it gives you and filter out what you don't want to find the best cold beer, rank indexing.  Take it for what it is, a SEARCH TOOL!

Only place where it works well enough is where there are controlled environments and subjects.  Why does it work well in mantrap, well Johnny knows he must stare at the specially placed Face Camera until it matches him, he is a static subject and its easy for the algorithm to do its thing with less pixels and computing resources.

Why does other Face Recognition systems like those at DMV or pokey work well matching?  Simple answer when you limit all of the variables then the analysis is much easier

  • Static Subjects told to stand still until image is captured
  • Camera adjusted to same eye height for all subjects input
  • FoV is the same every time
  •  No hats/glasses/smiling
  • Environmental - there are lights and a backdrop
  • Pixel Density is the same for every image, often very high resolution

Want a tip to beat Face Detection, look down when you walk in the area of analysis.

 

(2)
(1)
JH
John Honovich
Aug 30, 2017
IPVM

Good points.

Want a tip to beat Face Detection, look down when you walk in the area of analysis.

And criminals generally know this, even without face recognition or deep learning, they know it's better to look down so cameras don't get a clear shot. And / or to wear a hat or sunglasses, etc. It does not take a criminal mastermind to figure that out.

(2)
UM
Undisclosed Manufacturer #2
Sep 05, 2017

Of course, if all criminals were masterminds, fingerprint identification systems would never have gained traction. Good point, though.

Avatar
Brandon Knutson
Oct 17, 2017
IPVMU Certified
(1)
(1)