text-detection-0003¶
Use Case and High-Level Description¶
Text detector based on PixelLink architecture with MobileNetV2-like as a backbone for indoor/outdoor scenes.
Example¶

Specification¶
Metric |
Value |
---|---|
F-measure (Harmonic mean of precision and recall on ICDAR2015) |
82.12% |
GFlops |
51.256 |
MParams |
6.747 |
Source framework |
TensorFlow* |
Inputs¶
Image, name: input
, shape: 1, 3, 768, 1280
in the format B, C, H, W
, where:
B
- batch sizeC
- number of channelsH
- image heightW
- image width
Expected color order: BGR
.
Outputs¶
name:
model/link\_logits\_/add
, shape:1, 16, 192, 320
- logits related to linkage between pixels and their neighbors.name:
model/segm\_logits/add
, shape:1, 2, 192, 320
- logits related to text/no-text classification for each pixel.
Refer to PixelLink and demos for details.
Legal Information¶
[*] Other names and brands may be claimed as the property of others.