Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 10 Next »

Image Quality

Understanding what constitutes quality images and how to optimize the quality of images used in 3VR systems is critical to developing viable solutions. Meeting specific requirements for 3VR facial surveillance is much more challenging than in any traditional CCTV deployment. Accordingly, many partners and users are unaccustomed to being concerned about these issues and can easily overlook them.

By understanding and optimizing image quality, you will be able to:

  • Better qualify what opportunities are strong fits for 3VR

  • Set the appropriate level of expectations with partners and users

  • Design a system that accommodates real world conditions

  • Deliver a solution that is optimally effective

Key Elements in Optimizing Images for Facial Surveillance

This report will examine and explain the many elements that are critical to using 3VR for facial surveillance. While fundamental principles apply to images from photographic cameras as well as CCTV cameras, the application will differ. The following summarizes the key elements so that the reader may have a quick reference for future use.

  • Any image, whether video or photo, requires sufficient detail. Detail is determined by

    • the level of resolution in the image

    • the size of the person’s face relative to the size of the image

  • The person’s face must look directly towards the camera. This affects both the class of imported photos that are acceptable and how video cameras must be positioned to capture faces.

  • Imported images have to meet the requirements listed above. With the exception of mugshots and passport photos, most photos do not meet these requirements.

  • Video cameras must be positioned specifically to capture faces. The cost may be inexpensive but the skill is not
    trivial. Care must be taken to precisely position cameras to capture faces consistently.

Imaging Background

Overview

Two aspects of imaging are most important in understanding and optimizing 3VR facial surveillance:

  1. Resolution

  2. Field of View

Without understanding the impact of these two aspects, it will not be possible to master use of 3VR for either importing images or capturing video. As such, prior to analyzing facial surveillance or its application to 3VR, these aspects shall be explained and relevant
introductory material shall be presented.

Resolution

Resolution, as applied to images used in the 3VR system, is defined as the level of visual detail in the image.

Resolution is commonly defined in two dimensions: horizontal and vertical. For instance, a frequent resolution level cited is 640 x 480 pixels. This means that there are 640 unique pixels across the image horizontally and 480 pixels down the image vertically.

The number of horizontal pixels is important for performing 3VR facial surveillance because it determines the amount of detail available for performing facial analysis. Sufficient horizontal pixels are required to perform facial recognition.

One standard metric used in describing video surveillance images is the Common Intermediate Format (CIF). This is a way to quickly
cite specific resolution levels that are commonly used in digital video. CIF specifies specific horizontal and vertical resolution levels.

The following table provides examples of different CIF levels.

CIF Level

Resolution

Example

Quarter CIF

176 x 44

CIF

352 x 288

Typical Internet streaming

2CIF

704 x 240

4CIF

704 x 576

NTSC camera max resolution

16CIF

1408 x 1152

1.5 Megapixel camera

Industry participants commonly cite different CIF levels. These CIF levels have a significant impact on whether imported images or captured video can be used for facial analysis.

To explain further with an example, an image of a person recorded with 4CIF resolution may be able to be used by the 3VR system for facial analysis, however, the same image recorded at CIF resolution may not contain enough detail to be used for facial analysis unless the face is extremely large (at least a quarter of the width of the field of view). See “Camera Placement” on the following page for more information and examples on field of view for facial analysis.

Field of View

Need to Multi Excerpt content from other pages.

Camera Placement for Facial Analysis - 4 Factors

Field of View Determines Size and Resolution of Face

Facial analysis requires a certain minimum resolution level to be effective. This resolution level is measured in pixels.

  • 3VR requires a minimum of 35 horizontal pixels between the eyes (or about 80 - 100 horizontal pixels across the head) to perform facial analysis

  • 3VR performs analysis of all analog NTSC video at 4CIF (704 x 576 pixels)

Given these facts and that the average width between eyes is 3”, and the width of a head is approximately 6 - 7”, an NTSC camera at
4CIF resolution can capture faces in a field of view (FOV) no more than about 4.5 feet.

Place a person standing in the foreground in optimal focus. When they hold their arms stretched out from side-to-side,
you should not be able to see their hands (the image should be cut off at their wrists). If you can see their hands in the
image when they are standing in focus in the foreground, the field of view is too wide for facial analysis.

If you are looking at prerecorded images from a camera already placed, measure the width of the head, and if it is smaller than 1/7 (about 15%) of the field of view, the field of view is too wide for facial analysis.

Poor FOV

Feet

Face

5.5

1/9 of the image

Max Acceptable FOV

Feet

Face

4.5

1/7 of the image

Excellent FOV

Feet

Face

3.5

1/6 of the image

  • No labels