Layout Analyzer¶
Bases: BaseJob
A class for analyzing the layout of documents, including table structure recognition.
This class provides functionality to process and analyze the layout of documents, such as detecting
and recognizing table structures. It initializes components like layout_parser for general layout
analysis and table_structure_recognizer for identifying table structures within documents.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
configs
|
dict
|
A dictionary of configurations to override the default settings.
The |
{}
|
device
|
str
|
The device to use for computation, e.g., "cuda" or "cpu". Defaults to "cuda". |
'cuda'
|
visualize
|
bool
|
Whether to enable visualization during layout analysis. Defaults to False. |
False
|
license_key
|
str
|
The license key for using specific features or services. Defaults to None. |
None
|
secret_key
|
str
|
The secret key for authentication with external services. Defaults to None. |
None
|
device_token
|
str
|
The device token for authentication with external services. Defaults to None. |
None
|
Attributes:
| Name | Type | Description |
|---|---|---|
layout_parser |
LayoutParser
|
An instance of the layout parser used for general layout analysis. |
table_structure_recognizer |
TableStructureRecognizer
|
An instance of the table structure recognizer used for detecting and analyzing table structures in documents. |
Source code in src/yomitoku/layout_analyzer.py
13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 | |
__call__(img)
¶
Perform layout analysis on the given image.
This method processes the input image to detect and analyze the layout structure, including paragraphs, tables, and figures. It combines the results from the layout parser and the table structure recognizer to produce a comprehensive analysis.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
img
|
ndarray
|
The input image in BGR format. |
required |
Returns:
| Name | Type | Description |
|---|---|---|
tuple |
tuple[LayoutAnalyzerSchema, ndarray | None]
|
A tuple containing:
|