text-detection-0003#

Use Case and High-Level Description#

Text detector based on PixelLink architecture with MobileNetV2-like as a backbone for indoor/outdoor scenes.

Example#

Specification#

Metric

Value

F-measure (Harmonic mean of precision and recall on ICDAR2015)

82.12%

GFlops

51.256

MParams

6.747

Source framework

TensorFlow*

Inputs#

Image, name: Placeholder, shape: 1, 768, 1280, 3 in the format B, H, W, C, where:

  • B - batch size

  • H - image height

  • W - image width

  • C - number of channels

Expected color order: BGR.

Outputs#

  1. name: model/link_logits_/add, shape: 1, 192, 320, 16 - logits related to linkage between pixels and their neighbors.

  2. name: model/segm_logits/add, shape: 1, 192, 320, 2 - logits related to text/no-text classification for each pixel.

Refer to PixelLink and demos for details.

Demo usage#

The model can be used in the following demos provided by the Open Model Zoo to show its capabilities: