Hikvision Partners With Intel Movidius For Artificial intelligence Cameras

By: IPVM Team, Published on Oct 25, 2016

The world's largest camera manufacturer is partnering with the worlds largest semiconductor company to create a series of intelligent cameras.

Hikvision is partnering with chipmaker Movidius (soon to be acquired by Intel) to add Deep Neural Network (learning) video analytics to their cameras.

For background, see Intel Movidius IPVM overview, especially if you have not heard of Movidius before. Inside this post, we share feedback from Movidius and analyze the potential impact against incumbents including Axis and Avigilon.

*** *****'* ******* ****** manufacturer ** ********** **** the ****** ******* ************* company ** ****** * series ** *********** *******.

********* ** ********** **** chipmaker ******** (**** ** be ******** ** *****) to *** **** ****** Network (********) ***** ********* to ***** *******.

*** **********, *** ***** ******** **** ********, ********** ** *** have *** ***** ** Movidius ******. ****** **** post, ** ***** ******** from ******** *** ******* the ********* ****** ******* incumbents ********* **** *** Avigilon.

[***************]

Hikvision ******** **** ****

********* **** ** ***** Movidius' ****** * ****** ********** Unit. *** ****** * VPU ** * ****** on **** (***) ******, which ******* **** ****** chips ** **** ** integrates **** ********, ********* that ****** *** **** available ** ********** ** cameras.

'Deep ********' ** *********** ***** *********

*********, **** **** *************, has ******* **** **** of ***** ********* *** years. ***** *** *********** 'rule *****' ********* *** performance *** **** *******, especially ** ********* ******* environments *** ***** ******** is ******** (*.*. *** ********* ***** ********* ******).

'**** ********' ****** ** radically ******** *** ******** / *********** ** ***** analytics ** ******** ** individual ************ ******* ** relying ** ***-*** **********.

Advanced ******** / ****** ** *******

****** '******' *********, ******** also **** ** ******* 3D ******* **** ******** imagers ******** ******** ************, as **** ****:

****** ** **** ****** Networks *** ****** ** sensing, ********* *** **** able ** ******* ** to **% ******** ** their ******** ****** ********* applications. **** ** ***** applications *******: *** ***** classification, ******** *********, ********** baggage *****, *** ******** detection.

** *** ***** ******** show, ********* ************ * models, *** *****-****** ****** shown ** ***** ** the ***** *** ** intelligent ******* ******.

Nvidia ** ********

***** ***, ****** *** been *** ******* **** in '**** ********' *** video ************ ********* *** Hikvision *** **** * key ******* (***: ********* ****** ************** ***********). ********* *** **** deploying ****** / *********** based ********* **** ******. While ****** ******* **** edge ***** ********* (***** Tegra *****), ***** **** and ********** **** **** make **** ***********.

******** ** ****** ** bring ******* **** ******** but ** * **** and ***** ***** **** is **** ******** *** edge / ****** ***** analytics. **** ** ************ using *** ********' ****** * ****** ********** Unit. ******** ****** **** their ***** / ******** will ** * ******** of *** ***** *** power *********** ** ******.

****** ******** ** ******* about ****.

Hikvision ****** ******

** ********* **** ******* Movidius ***** ***** *******, it ***** **** * significant ****** ******.

***, *** ****** ****** lacks **** ******* ***** analytic ******* **** ******* developments ** ********* *** years (*.*.,******** / ******* ********* remains ********** ************* *** ** *** ****** challengers). ** ********* *** release ****** ******* *********, this ***** ******* ****** needed *** ******* ********** since ******** *** ***** more ** **** ***** camera's ********* ** ** used ** ***** *** VMS.

***, ********* *** **** extremely **** ** *** low ** *** ***** of *** ****** *** is ***** ****** ** expand **** *** ****-***. If **** *** ******* high ******* *********, **** could ** ** ********* differentiator ** *** **** demanding / ******* ******** customers ** *** ***** products.

*****, ** ********* *** succeed **** ****, ** *********** their *********** *********** ******* both ******** *** ****, who *** **** *********** analytics ** **** **** of ***** ****** ******** (e.g., ********'* ******** ***** approach *** **** ****** 2 ***** ********* ********* this **** -*********************).

** ** *** ***** when ********* **** ******* such *******, *** ** believe **** **** * higher ****** ** ***** adoption than *** ****** ******-***** analytics (******** ********* ***********) given *** ******* ********* of ****** ********* **** specific ******* ** ****** rather **** ****** *** deploying ********* ****** ***** analytics.

Comments (5)

Is Hikvision just quicker to adapt to change than Axis/Avigilon or are they just more public with their announcements of products a year in advance of availability?

Movidius is now announcing releases with Dahua and Uniview too:

What happened to Dahua?

No mention of Dahua now, no press release, google link broken...

Has anybody seen the chip working.
The adoption of the chip does sound promising and I really would like to hear first hand what it can and cannot do.

Other than the videos shown on Movidius website that is....

'Deep Learning' claims to radically increase the accuracy / performance of video analytics by adapting to individual environments instead of relying of pre-set heuristics.

Deep learning approaches use a lot of training data from somewhere to train detectors of important features in that data. Important means (for example) something that indicates presence of a person (or whatever is of interest). This process of training feature detectors replaces the previous technology in which feature detectors were hand crafted (an illustrative but not necessarily accurate example might be a detector to find beards as a possible component of faces). It turns out that for given training and test input (such as might occur in a competition such as ImageNet) the learned feature detectors are much better than hand crafted detectors are for given input. On the other hand the extent to which learned detectors generalise to input different to that typified by the training input is limited and may be inferior to that of the hand crafted detectors.

Historically most success with deep learning has involved supervised approaches (this is changing, however). In supervised approaches humans teach the DL system, for example by providing the correct (ground truth) results corresponding to the training input. This forces the DL system to learn to generate these correct results (basically it keeps trying and improving until it gets there).

To use supervised learning in the scenario described would be quite problematic as quite a lot of supervision is required. For example a human would need to provide the DL system with several hundred (at least) examples of the thing to be detected and a similar number of examples in which that thing was definitely absent. And in the scenario described this would have to be done separately for each and every camera (in its individual environment)

So, people are interested in unsupervised learning where no such input is required (at least not up front). One of the simplest unsupervised DL systems is the (variational) auto-encoder. An auto-encoder is a network that reconstructs its input (so given a 2 megapixel input image it should produce a 2 megapixel output image that is similar to the input . This sounds trivial: why not just copy the input pixels? the trick is that the network send all data through an intermediate layer (and in deep networks many intermediate layers) that have far fewer elements than the input and output layers (e.g. 50,000 instead of 2,000,000). So the auto-encoder has been "handicapped" forcing it to learn something other than a trivial copy of input to output. In particular to do well the intermediate layers need to learn a representation of the input that has far fewer dimensions (equivalent to pixels) and is thus forced to learn the important or distinctive aspects of that input. Notice that this process could run for each camera without supervision, and each camera could thus learn an internal representation specialized to the scene it observes (a so called generative model of that data). So each camera has (in a sense) an understanding of its individual environment encapsulated in this internal representation. Of course the next problem is to relate that internal representation to something the camera's owner cares about (e.g. person crossing the line) and it may be that at this point human intervention is again required (but perhaps at a lower volume than was required for the supervised approach). Think of a small child pointing to things and asking his or her parent "is that a dog?"; generative models can generate output that is characteristic of the model so a trained generative model could produce some number of (different) representative outputs and ask a human operator which of them are of interest...

Now all of this (deep learning) involves a training phase (supervised or unsupervised plus a few other options like reinforcement learning) during which the learning occurs and a later "test" phase during which input is presented and the (now trained) model generates a response (sound the alarum!). The training phase with deep learning is extremely intensive in its use of hardware, typically needing one or more powerful GPUs with multiple gigabytes of memory and quite possibly running for many days or even weeks (this is because the models are "deep" with many layers and may also be "wide"*, and are thus "big" structures). The test phase simply "evaluates" the trained model with new input and is much less intensive (still quite intensive though). I suspect that for the most part the Movideus chip is aimed at test, with learning having occurred elsewhere (e.g. in a data center with lots of GPUs) and this rather complicates the deployment model if one truly wants approaches adapted to each camera's individual environment...

* Not yet "tall", but that could come too...

Login to read this IPVM report.
Why do I need to log in?
IPVM conducts unique testing and research funded by member's payments enabling us to offer the most independent, accurate and in-depth information.

Related Reports on Video Analytics

AI Video Surveillance (Finally) Goes Mainstream In 2020 on Sep 03, 2019
While video surveillance analytics has been promoted, hyped and lamented for nearly 20 years, next year, 2020, will be the year that it finally...
Scylla AI Video Analytics Company Profile on Aug 29, 2019
Scylla, an AI analytics startup, says they are targeting 1 Billion dollar valuation in 5 years and it "is not rocket science" to detect weapons and...
Anyvision Facial Recognition Tested on Aug 21, 2019
Anyvision is aiming for $1 billion in revenue by 2022, backed by $74 million in funding. But does their performance live up to the hype they have...
Verkada People And Face Analytics Tested on Aug 16, 2019
This week, Verkada released "People Analytics", including face analytics that they describe is a "game-changing feature" that "pushes the...
Dahua Analytics+ Tested on Aug 07, 2019
Dahua's analytics have performed poorly in past shootouts. But now, they claim their new Analytics+ "algorithms significantly improve accuracy and...
Honeywell Speaks On NDAA Ban, New Non-Banned Cameras and Cybersecurity on Aug 06, 2019
For years, Honeywell has depended on Dahua, a company with a poor cybersecurity track record and now banned by the US NDAA, for the development and...
History of Video Surveillance on Jul 19, 2019
The video surveillance market has changed significantly since 2000, going from VCRs to ab emerging AI cloud era.  The goal of this history is to...
Wyze AI Analytics Tested - Beats Axis and Hikvision on Jul 17, 2019
$20 camera disruptor Wyze has released free person detection deep learning analytics to all of their users, claiming users will "Only get notified...
Ivideon Russian VSaaS Profile on Jun 27, 2019
Ivideon was an early VSaaS entrant, initially focusing on the consumer market, claiming massive growth to IPVM in 2014. We spoke to Ivideon, to...
Directory of 60 Video Surveillance Startups on Jun 25, 2019
This directory provides a list of video surveillance startups to help you see and research what companies are new or not yet broadly known. 2019...

Most Recent Industry Reports

How Cobalt Robotics May Disrupt Security on Sep 13, 2019
While security robots have largely become a joke over the last few years, one organization, Cobalt Robotics, has raised $50+ million from top US...
Panasonic 4K Camera Tested (WV-S2570L) on Sep 13, 2019
Panasonic has released their latest generation 4K dome, the WV-S2570L, claiming "Extreme image quality allows evidence to be captured even under...
ASIS GSX 2019 Show Report Final on Sep 12, 2019
IPVM went to Chicago for ASIS GSX 2019, with many exhibitors disappointed about traffic and the exhibitor schedule changing next year. Inside we...
Installation Course - Last Chance - Register Now on Sep 12, 2019
Last Chance - Register Now - September 2019 Video Surveillance Install Course. Thursday, September 12th is your last chance to register for the...
Commend ID5 Intercom Tested on Sep 12, 2019
Commend touts the new ID5 intercom as 'timelessly elegant' and the slim body, glass front touchscreen indeed looks better than common, but ugly,...
US State Department: "Chinese Tech Giants" "Tools of the Chinese Communist Party" on Sep 12, 2019
The US State Department has called out "Chinese tech giants" for being "tools of the Chinese Communist Party" in a blunt new speech that makes...
Uniview OEM Directory on Sep 11, 2019
This directory lists 20+ companies that OEM products from Uniview, with a graphic and links to company websites below. It does not cover all...
Yi Home Camera 3 AI Analytics Tested on Sep 10, 2019
Yi Technology is claiming "new AI features" in its $50 Home Camera 3 "eliminates 'false positives' caused by flying insects, small pets, or light...
Hanwha Announces 32MP Camera + AI Line on Sep 10, 2019
In the first rise in maximum megapixel resolution in 5 years, Hanwha has announced a 32MP / 8K camera directly competing with Avigilon's H4 30MP /...
Fingerprints for Access Control Guide on Sep 09, 2019
Users can lose badges, but they never misplace a finger, right? The most common biometric used in access are fingerprints, and it has become one...

The world's leading video surveillance information source, IPVM provides the best reporting, testing and training for 10,000+ members globally. Dedicated to independent and objective information, we uniquely refuse any and all advertisements, sponsorship and consulting from manufacturers.

About | FAQ | Contact